Home All Groups Group Topic Archive Search About

open source .NET search engine?

Author
17 Oct 2006 6:37 PM
susiedba@hotmail.com
does anyone know of a framework; or tools; or something-- that
describes an open source VB.net search engine / spider?

anyone want to trade notes?

I want to build something a lot more focused than google; for example;
I want to spider Home Depot websites and sell it to Lowes.

Does anyone want to help?

-Susie

Author
17 Oct 2006 9:11 PM
Chris
Susie,

As for your subject: open source .NET search engine try
http://www.dotlucene.net/

As for spidering, there are many website copiers out there, try HTTrack
http://www.httrack.com/.

I am not condoning unethical copyright infringement. However, I'm so sure
that you will be unable to sell a competitor's website to a large company
like Lowes that I give this information. Besides, I'm sure they have many
talented people in their IT department that could get them this info in no
time at all.

As far as building something more focused than google--good luck. Google,
just like Microsoft, has top computer scientist working on some crazy stuff
i.e. natural language processing, query analysis, best bets, controlled
vocabularies and a little artificial intelligence. Most of which involves a
fair amount of some pretty advanced mathematics. I'm not trying to
discourage you to not build something better; there are lots of brilliant
people in this world. Just use your brilliance for something good!

As far as making money, why not use the Amazon E-commerce Web Service. It
gives you access to all of Amazon's products, images, reviews, pricing, and
a remote shopping cart system. You can just mark up the prices a bit to make
money for not doing much of anything besides coding a website.
http://aws.amazon.com
http://www.google.com/search?hl=en&q=amazon+web+service

Chris


<susie***@hotmail.com> wrote in message
Show quoteHide quote
news:1161110236.872569.205930@i3g2000cwc.googlegroups.com...
> does anyone know of a framework; or tools; or something-- that
> describes an open source VB.net search engine / spider?
>
> anyone want to trade notes?
>
> I want to build something a lot more focused than google; for example;
> I want to spider Home Depot websites and sell it to Lowes.
>
> Does anyone want to help?
>
> -Susie
>
Author
17 Oct 2006 10:03 PM
susiedba@hotmail.com
thanks guys; does anyone else have any ideas??


Chris wrote:
Show quoteHide quote
> Susie,
>
> As for your subject: open source .NET search engine try
> http://www.dotlucene.net/
>
> As for spidering, there are many website copiers out there, try HTTrack
> http://www.httrack.com/.
>
> I am not condoning unethical copyright infringement. However, I'm so sure
> that you will be unable to sell a competitor's website to a large company
> like Lowes that I give this information. Besides, I'm sure they have many
> talented people in their IT department that could get them this info in no
> time at all.
>
> As far as building something more focused than google--good luck. Google,
> just like Microsoft, has top computer scientist working on some crazy stuff
> i.e. natural language processing, query analysis, best bets, controlled
> vocabularies and a little artificial intelligence. Most of which involves a
> fair amount of some pretty advanced mathematics. I'm not trying to
> discourage you to not build something better; there are lots of brilliant
> people in this world. Just use your brilliance for something good!
>
> As far as making money, why not use the Amazon E-commerce Web Service. It
> gives you access to all of Amazon's products, images, reviews, pricing, and
> a remote shopping cart system. You can just mark up the prices a bit to make
> money for not doing much of anything besides coding a website.
> http://aws.amazon.com
> http://www.google.com/search?hl=en&q=amazon+web+service
>
> Chris
>
>
> <susie***@hotmail.com> wrote in message
> news:1161110236.872569.205930@i3g2000cwc.googlegroups.com...
> > does anyone know of a framework; or tools; or something-- that
> > describes an open source VB.net search engine / spider?
> >
> > anyone want to trade notes?
> >
> > I want to build something a lot more focused than google; for example;
> > I want to spider Home Depot websites and sell it to Lowes.
> >
> > Does anyone want to help?
> >
> > -Susie
> >
Author
17 Oct 2006 10:12 PM
susiedba@hotmail.com
and it goes without saying that I think that Microsoft is completely
and utterly incompetent.

they're pulling a napoleon also.. they need to sell their Xbox and MSN
division and fold it back into their core competencies.

supposedly they have 5,000 developers and testers working on vista AND
office 2007.

what the hell are the other 60,000 employees doing?




susie***@hotmail.com wrote:
Show quoteHide quote
> thanks guys; does anyone else have any ideas??
>
>
> Chris wrote:
> > Susie,
> >
> > As for your subject: open source .NET search engine try
> > http://www.dotlucene.net/
> >
> > As for spidering, there are many website copiers out there, try HTTrack
> > http://www.httrack.com/.
> >
> > I am not condoning unethical copyright infringement. However, I'm so sure
> > that you will be unable to sell a competitor's website to a large company
> > like Lowes that I give this information. Besides, I'm sure they have many
> > talented people in their IT department that could get them this info in no
> > time at all.
> >
> > As far as building something more focused than google--good luck. Google,
> > just like Microsoft, has top computer scientist working on some crazy stuff
> > i.e. natural language processing, query analysis, best bets, controlled
> > vocabularies and a little artificial intelligence. Most of which involves a
> > fair amount of some pretty advanced mathematics. I'm not trying to
> > discourage you to not build something better; there are lots of brilliant
> > people in this world. Just use your brilliance for something good!
> >
> > As far as making money, why not use the Amazon E-commerce Web Service. It
> > gives you access to all of Amazon's products, images, reviews, pricing, and
> > a remote shopping cart system. You can just mark up the prices a bit to make
> > money for not doing much of anything besides coding a website.
> > http://aws.amazon.com
> > http://www.google.com/search?hl=en&q=amazon+web+service
> >
> > Chris
> >
> >
> > <susie***@hotmail.com> wrote in message
> > news:1161110236.872569.205930@i3g2000cwc.googlegroups.com...
> > > does anyone know of a framework; or tools; or something-- that
> > > describes an open source VB.net search engine / spider?
> > >
> > > anyone want to trade notes?
> > >
> > > I want to build something a lot more focused than google; for example;
> > > I want to spider Home Depot websites and sell it to Lowes.
> > >
> > > Does anyone want to help?
> > >
> > > -Susie
> > >
Author
17 Oct 2006 11:24 PM
Spam Catcher
"susie***@hotmail.com" <susie***@hotmail.com> wrote in
news:1161123151.901347.83510@e3g2000cwe.googlegroups.com:

> supposedly they have 5,000 developers and testers working on vista AND
> office 2007.
>
> what the hell are the other 60,000 employees doing?

Microsoft does have more than 2 products. .NET, VS.NET, SQL Server :-)
Author
17 Oct 2006 10:10 PM
susiedba@hotmail.com
I really do think that there is room for a new service.
I just am going to have some sort of scope to my project-- instead of
pulling a napoleon- like google does-- and try to enter EVERY MARKET at
the same time.

moving from search engines to spreadsheets; IM; Email; Usenet; Books;
eCommerce-- I just dont think that google is 'big enough' to be
successful in any of these new markets.

which means that there is room for innovation.

I've had many customers ask me to pull XYZ off of site ABC

I personally think that a simple search engine should consist of a
couple of Olap Servers and a couple of relational boxes.. and a couple
of crawlers... not too complex at all.

-Susie




Chris wrote:
Show quoteHide quote
> Susie,
>
> As for your subject: open source .NET search engine try
> http://www.dotlucene.net/
>
> As for spidering, there are many website copiers out there, try HTTrack
> http://www.httrack.com/.
>
> I am not condoning unethical copyright infringement. However, I'm so sure
> that you will be unable to sell a competitor's website to a large company
> like Lowes that I give this information. Besides, I'm sure they have many
> talented people in their IT department that could get them this info in no
> time at all.
>
> As far as building something more focused than google--good luck. Google,
> just like Microsoft, has top computer scientist working on some crazy stuff
> i.e. natural language processing, query analysis, best bets, controlled
> vocabularies and a little artificial intelligence. Most of which involves a
> fair amount of some pretty advanced mathematics. I'm not trying to
> discourage you to not build something better; there are lots of brilliant
> people in this world. Just use your brilliance for something good!
>
> As far as making money, why not use the Amazon E-commerce Web Service. It
> gives you access to all of Amazon's products, images, reviews, pricing, and
> a remote shopping cart system. You can just mark up the prices a bit to make
> money for not doing much of anything besides coding a website.
> http://aws.amazon.com
> http://www.google.com/search?hl=en&q=amazon+web+service
>
> Chris
>
>
> <susie***@hotmail.com> wrote in message
> news:1161110236.872569.205930@i3g2000cwc.googlegroups.com...
> > does anyone know of a framework; or tools; or something-- that
> > describes an open source VB.net search engine / spider?
> >
> > anyone want to trade notes?
> >
> > I want to build something a lot more focused than google; for example;
> > I want to spider Home Depot websites and sell it to Lowes.
> >
> > Does anyone want to help?
> >
> > -Susie
> >
Author
17 Oct 2006 11:25 PM
Spam Catcher
"susie***@hotmail.com" <susie***@hotmail.com> wrote in
news:1161123049.727124.78350@e3g2000cwe.googlegroups.com:

> I personally think that a simple search engine should consist of a
> couple of Olap Servers and a couple of relational boxes.. and a couple
> of crawlers... not too complex at all.

Good luck if you don't think it's complex - there's a reason why Google
hires a lot of PhDs!
Author
17 Oct 2006 9:12 PM
Spam Catcher
"susie***@hotmail.com" <susie***@hotmail.com> wrote in
news:1161110236.872569.205930@i3g2000cwc.googlegroups.com:

> does anyone know of a framework; or tools; or something-- that
> describes an open source VB.net search engine / spider?
>
> anyone want to trade notes?
>
> I want to build something a lot more focused than google; for example;
> I want to spider Home Depot websites and sell it to Lowes.


Lucene.NET or Microsoft Index Server or SQL Server Full Text Search Engine.