Follow Slashdot blog updates by subscribing to our blog RSS feed

 



Forgot your password?
typodupeerror
×
Networking

Search Engines for Your Intranet or Small Business? 29

coreboarder asks: "Google recently revamped their nifty little Google Mini. It now does 100,000 documents of 220 different formats, makes your bed, and pours your beer. Where I work we have a reasonably large amount of technical data files (~80,000) of varying formats stored on a number of Windows 2000 and 2003 servers. File access is handled by permissions on the containing folder(s). Over time duplication has crept in because people cannot find what they need where they expect it to be. The $3,000 price point on the Google Mini is very attractive but is their a better way of making files and their content easily findable on a 1000 node network while still retaining their security? We also use ht://dig but it cannot handle all the file formats that would be involved here."
In that same vein, Gneral Tsao asks: "As an IT worker for a small research business, I'm trying to find a good text search engine for our subscriber facing publications. After much searching, I've found a few prospects such as Mnogosearch (which we currently use), Nutch, and Swish-e, but really no discussion about or comparison between them. This seems like a job for the Slashdot community. An ideal solution for me would be able to handle 20,000 or so pages, have a customizable PHP frontend, and allow for some amount of control over categorization." Any suggestions?
This discussion has been archived. No new comments can be posted.

Search Engines for Your Intranet or Small Business?

Comments Filter:
  • Boutell (Score:3, Interesting)

    by Intron ( 870560 ) on Friday May 20, 2005 @04:57PM (#12593879)
    I run the Boutell search engine [boutell.com] on my Company's internal website.
    • Re:Boutell (Score:3, Informative)

      We run ht://dig at work. I use swish and w4ais at home (I maintain them) and on some customer sites.

      I've looked into the Google Mini for work but have some concerns.

      1) The Mini doesn't handle access controls.
      2) The yearly costs for all the Google search
      appliances are, IMO, too high.
      3) Google will only sell you one extra year of
      maintenance. In effect, you're supposed to
      pitch this appliance after two years.

      I really, really like the Google appliance concepts, but I really, really dislike their
  • Wimp. (Score:3, Funny)

    by Anonymous Coward on Friday May 20, 2005 @04:57PM (#12593888)

    Give the users a shell and tell them to read the grep manpage.
  • The Mini is great (Score:2, Informative)

    by jacumba ( 692476 )
    We picked up a mini about 2 weeks ago. The thing is amazing. From the time I cut open the box it was delivered in, to when I had our entire intranet & internet sites indexed and serving results was only 90 minutes. It's very easy to configure. Overall, it's a steal for any organization needing search.
    • Nutch (Score:2, Interesting)

      by zmarty ( 850185 )
      I run Nutch [apache.org], a project which is now part of Apache Incubator [apache.org]. I'm indexing a few tens of gaming-related websites, on www.playfuls.com [playfuls.com]. There is a lack of documentationm but if you read and play with the config files, you'll do fine.
  • One project in this area I've been playing with is Nutch [apache.org].
  • I can't figure out from the original post how you expect the Google Mini to crawl your content. The mini is limited to only stuff accessible via a website interface. Also, the Google Mini doesn't have any way for you to securely restrict search access to your various content.
  • maybe namazu (Score:3, Interesting)

    by Shaleh ( 1050 ) <shaleh.speakeasy@net> on Friday May 20, 2005 @07:09PM (#12594969)
    Has filters for lots of doc types, you can write more.

    http://www.namazu.org/ [namazu.org]
  • The long term solution is to put your data into groupware - lotus workplace and domino/notes is the example of how this can and should be done.

    Of course workplace has limits to the amount of formats you can import into it, but definitely not the amount of data (well of course hd space, and whatever limit db2 has applies).
    • I can't see how any solution using domino/notes could be considered a good solution.
      • Well thats your problem isn't it?
        Best tool for the job and all - the Lotus products are the best groupware tools thus far.
        • I'm glad you think so, but I imagine you'd be hard pressed to find anyone that agrees with you.
          • depends on what I ask them - lots of people only know lotus/domino as an e-mail calendaring application to compete with outlook - and untill the 6.x branch the outlook client definitely was easier to use in this regard.

            However there is ALOT more to lotus/domino then just mail and caledaring.
  • by La Camiseta ( 59684 ) <me@nathanclayton.com> on Friday May 20, 2005 @08:38PM (#12595579) Homepage Journal
    Why don't you use the recently released Google Desktop Enterprise Edition [google.com]? It has access controls, the ability to be pushed out to all of the client computers seemlessly, filters for a huge ammount of files, the option of plugins to read more files, and is completely free.
    • ...as free as beer can be.
    • Because they said the files were stored on servers while the Google Desktop tool only searches the local computer (You probably wouldn't want 100 computers indexing your network drive anyway).

      That being said, I remember a post by somebody who installed the Google Desktop tool on a single machine, and then hacked it up to index the network drives, and did some more tweaking to allow searches from other computers. Esentially creating his own Google Mini (although I wouldn't be suprised if this were against
      • IIRC Google Desktop Search won't automatically index network drives on the first go-around, but if you open the file up while GDS is runing it will index the file.

        But while reading the features page, it looks like you can run both the Mini or Search Appliance in tandem with GDS Enterprise to both index your intranet as well as let your employees index and search through their content.

        Looks like it could be quite the time saver if you ask me. Being able to type in something like "Oracle" and pop up all of
  • EnterFind Appliance (Score:3, Interesting)

    by BigGerman ( 541312 ) on Friday May 20, 2005 @10:47PM (#12596365)
    http://www.enterfind.com/ [enterfind.com]
    Supports indexing docs on Windows shares directly (as well as HTTP crawling), supports hundreds of document formats (including exotic ones like dwg files), allows precise control over indexing process and allows access via Web Services API as well as browser.
    No limitations on number of users or documents and fully customizable search page.
    Disclaimer: I participated in the development of this product. They (company) are good people, take care of their customers.

"A car is just a big purse on wheels." -- Johanna Reynolds

Working...