Search Engines for Your Intranet or Small Business? 29
coreboarder asks: "Google recently revamped their nifty little Google Mini. It now does 100,000 documents of 220 different formats, makes your bed, and pours your beer. Where I work we have a reasonably large amount of technical data files (~80,000) of varying formats stored on a number of Windows 2000 and 2003 servers. File access is handled by permissions on the containing folder(s). Over time duplication has crept in because people cannot find what they need where they expect it to be. The $3,000 price point on the Google Mini is very attractive but is their a better way of making files and their content easily findable on a 1000 node network while still retaining their security? We also use ht://dig but it cannot handle all the file formats that would be involved here."
In that same vein, Gneral Tsao asks: "As an IT worker for a small research business, I'm trying to find a good text search engine for our subscriber facing publications. After much searching, I've found a few prospects such as Mnogosearch (which we currently use), Nutch, and Swish-e, but really no discussion about or comparison between them. This seems like a job for the Slashdot community. An ideal solution for me would be able to handle 20,000 or so pages, have a customizable PHP frontend, and allow for some amount of control over categorization." Any suggestions?
Boutell (Score:3, Interesting)
Re:Boutell (Score:3, Informative)
I've looked into the Google Mini for work but have some concerns.
1) The Mini doesn't handle access controls.
2) The yearly costs for all the Google search
appliances are, IMO, too high.
3) Google will only sell you one extra year of
maintenance. In effect, you're supposed to
pitch this appliance after two years.
I really, really like the Google appliance concepts, but I really, really dislike their
Wimp. (Score:3, Funny)
Give the users a shell and tell them to read the grep manpage.
Re:Wimp. (Score:1)
-9mm-
Re:Wimp. (Score:2)
Re:Wimp. (Score:2)
oh yea and Grep for windows [interlog.com].
Re:Wimp. (Score:2)
Re:Wimp. (Score:1)
Re:Wimp. (Score:1)
Re:Wimp. (Score:1)
The Mini is great (Score:2, Informative)
Nutch (Score:2, Interesting)
Nutch (Score:2)
Permision / Security issues (Score:1)
maybe namazu (Score:3, Interesting)
http://www.namazu.org/ [namazu.org]
long term (Score:1)
Of course workplace has limits to the amount of formats you can import into it, but definitely not the amount of data (well of course hd space, and whatever limit db2 has applies).
Re:long term (Score:2)
Re:long term (Score:1)
Best tool for the job and all - the Lotus products are the best groupware tools thus far.
Re:long term (Score:2)
Re:long term (Score:1)
However there is ALOT more to lotus/domino then just mail and caledaring.
Google Desktop Enterprise Edition (Score:3, Informative)
Re:Google Desktop Enterprise Edition (Score:1)
Re:Google Desktop Enterprise Edition (Score:2)
That being said, I remember a post by somebody who installed the Google Desktop tool on a single machine, and then hacked it up to index the network drives, and did some more tweaking to allow searches from other computers. Esentially creating his own Google Mini (although I wouldn't be suprised if this were against
Re:Google Desktop Enterprise Edition (Score:2)
But while reading the features page, it looks like you can run both the Mini or Search Appliance in tandem with GDS Enterprise to both index your intranet as well as let your employees index and search through their content.
Looks like it could be quite the time saver if you ask me. Being able to type in something like "Oracle" and pop up all of
EnterFind Appliance (Score:3, Interesting)
Supports indexing docs on Windows shares directly (as well as HTTP crawling), supports hundreds of document formats (including exotic ones like dwg files), allows precise control over indexing process and allows access via Web Services API as well as browser.
No limitations on number of users or documents and fully customizable search page.
Disclaimer: I participated in the development of this product. They (company) are good people, take care of their customers.