Objectively Comparing Competing Search Engines? 405
aendeuryu asks: "My default search engine of choice is, like most of you I assume, Google. That said, some complaints about Google over the years do seem to have some merit -- basically, that sometimes the indices aren't always updated, that it's too easy to manipulate via googlebombing or legislation, and that maybe too many of its featured services never get out of beta stage. Maybe the fact that Google has gone so long without significant competition is enough to make one at least begin to ask questions about it possibly becoming stagnant. Personally, I'm so used to doing things the Google way (and achieving acceptable results quickly) that I'm not really interested in switching -- case in point, all the above links referenced were quickly found via Google. However, what am I missing out on by not giving (for example) Yahoo search a shot? Or, more to the point, how would one go about trying to effectively and objectively compare competing search engines? In what areas have people found Google to have become obsolete for their purposes? Have less ignorant people than myself figured out ways to test a competing search engine's efficacy for themselves?"
Dont bother (Score:5, Insightful)
Subjective (Score:4, Insightful)
Even by comparing keyword search side by side, one can still consider a worse result better, but who's to judge except the user?
I kept using Yahoo until it's not giving me results that I think are good enough, then I switched to Google, and I'll keep using Google until it's not returning good enough result.
I hate to say it, but... (Score:4, Insightful)
Frankly, I think you're on the right track when you ask, "What am I missing out on by not giving Yahoo search a shot?"
Likewise, I think you're on the wrong track when you go on, "Or, more to the point, how would one go about trying to effectively and objectively compare competing search engines?"
Comparing the results of searches is necessarily subjective. Only that first question has a real answer.
RD
Re:Appalling (Score:2, Insightful)
Re:I use Google for almost everything (Score:2, Insightful)
Any algorithm can be gamed (Score:3, Insightful)
Wikipedia (Score:3, Insightful)
Google is not Microsoft, and therefore is good. (Score:1, Insightful)
Remember this -- never forget this -- once Microsoft takes monopo-ownership of something, it's nearly impossible for anyone to take it away from them. Google's strong lead in search (and increasingly in other Internet services as well) helps to keep things at least a little balanced. Imagine a future in which Microsoft owns search and webmail as well? Sooner or later everything would be IE-only, and eventually Windows-only, and Microsoft will have completed its goal of effectively taking ownership of the Internet.
A good policy to go with is to simply always go with the strongest non-Microsoft choice available when choosing any product or service.
Punctuation (Score:5, Insightful)
I _used_ to go to altavista everytime i had a search that involved specific punctuation, usually some kind of coding question. Now i just get frustrated with google while trying to find some related term i can add in that will give me the results i want.
Why Google works (Score:5, Insightful)
One way to test (Score:5, Insightful)
phentermine
home loans
poker
mesothelioma
viagra
miserable failure
Then look at the sites that rank at the top. It's very easy to tell which search engines are more succeptible to manipulation. A quick look at the backlinks for sites favorably ranking in those competitive keywords tells you how that SE is doing.
Here's my opinion on the race between Google, Yahoo & MSN. Google has more sites that are authorities in the top results and Google penalizes over optimization however extreme examples of over optimization continue to show up in Google. Yahoo is a moderate success and does a fair job of filtering out spammy sites as well as authorities like wikipedia - wikipedia will always rise to the top in G but not in Y - and this is good for Y because you get more variety. MSN does an average job of filtering out blog spam but new sites are too favorably ranked and this is because MSN is new and has no recorded history of URLs. My personal preference is to use G simply because it loads the fastest in my browser... Maybe it's also worth pointing out that my company has several URLs ranked favorably in the terms listed above - looking at the change in rankings over time certainly helps give insight into which SE is better. MSN & Y are by far easier to manipulate than G but G gives the most traffic.
Re:Dont bother - why? Parallel to OS Wars (Score:5, Insightful)
Needs to be more complex though (Score:3, Insightful)
1) HOW relivant is a page, and is that page more highly ranked? It doesn't do me any good to have 99 slightly relivant results and 1 highly relivant result, if that one is at the end. So you have to measure how relivant the page is, and how high it appears in teh search and weight that.
2) The ability to find the correct page. Sometimes it's not that you are looking for general inforamtion on a topic, there's a specific page you want. However you don't know the URL or how to get there. Maybe you saw it once and have a vague memory, maybe you just heard about it, whatever. In this case, it's a question of how quickly the engine gets you the correct answer, both in terms of how high it's ranked, and how many search variations you have to try.
3) Along those lines, the ability to deal with degraded input. Sometimes it's as simple as a spelling error, but sometimes it's the searcher misunderstanding their own question. They don't know precisely what they want. Maybe because they only have a vague idea, maybe because the term they remember for it isn't quite right, whatever. So how well can teh search engine figure out what they really want and find that?
So there's lots of things like that to consider as well when you are using a general purpose web search enigne. Really only personal experience can tell you if one works well for you at finding what you want.
Re:Questions (Score:5, Insightful)
The other day I needed to know, for obscure reasons, the number of heroin addicts in Dublin. This is the kind of info that you know is probably on the web, but is going to be hard to find with Google.
I used BrainBoost - "How many heroin addicts are there in Dublin?" [brainboost.com], and, bam, first line of the result - "There are 13,000 heroin addicts in Dublin."
That's damn impressive. Out of curiosity I tried to see if I could find the same info with Google - it was fairly tough. Took three or four searches, eventually resorting to
which is a fairly specialized search that average users probably wouldn't be able to construct. The BrainBoost search, on the other hand, was completely natural, my granma could have done it.So, thumbs up for BrainBoost for question answering.
Still, it's not the kind of thing you'll want every day. For day-to-day search, Google is the tool, but BB is worth a look.
Re:Precision and Recall (Score:2, Insightful)
--
Dum de dum.
Re:Dont bother (Score:4, Insightful)
I know everyone loves google, and I use it too, but I find that where it used to be an efficient way to find information, it's becoming less and less so as time goes on because of this sort of crap. As far as I'm concerned, if I need to pay to access the information, google should not be indexing that information and putting up links to the sign up page for me to waste my time with when the answer is already freely available elsewhere and that freely available source is in their index. If I wanted to use pay sites to provide my answers, I wouldn't be using google in the first place, would I?
Re:Dont bother - why? Parallel to OS Wars (Score:1, Insightful)
learn_new_interface();
} else return null;
}
In other words, depending on how much you do with it and how long you spend using it, if the performance increase is greater than the amount of lost time, then go for it. Otherwise don't bother.
Re:Dont bother (Score:3, Insightful)
Sites like this come up all to frequently, even in google. What be really sweet, would be a way to EXCLUDE certain sites. Maybe it's already possible