9238论搜索引擎的评测方法9238 on the evaluation method ofsearch engine
A long time ago, the search engine is not like today' s Allflowers bloom together. requirements of the people, low, aslong as it can be put on the Internet related website search,search to the site as a little more, the web site has a littleless able to meet. So at that time, the way people evaluate thesearch engine is to use a few keywords, test and compare theirsearch speed, search amount and the number of unrelatedwebsites. In short, it' s all inclusive. At that time, the searchengine technology is not very different, so this evaluationmethod is feasible.
Since then, the unique search engine technology has emerged oneafter another, and now it is obviously in the Warring Statesperiod. However, people' s evaluation methods have not changedmuch, and now the common evaluation is simply using severalkeywords to compare the search speed, the number of searchresults and the accuracy of their respective search.
Far not said that in the first quarter of 2001 after the upgradeof the askjeeves, you can play like as a phone in any phone inthe hands of askjeeves phone number, can also be labeled on thepage to enter the online about online speech, using yourcomputer' s microphone and speakers to communicate. Then youjust orally to it a request, it will put your voice into text,and then analyze your request to 7 million standard answers,it and other 2 million multimedia repository and Internet tofind the answer, find and then converted into voice to answeryou.
Imagine, if you ask, "the recent American election is pending,what do Americans think?"" After a while, the computer ortelephone to answer you: "according to the latest survey, thelast is if Bush is elected, 80% of Americans will accept himas the legitimate president, if Gore is finally elected, 79%of Americans will accept him as the legitimate president. " Ifyou ask, "who scored in the last World Cup finals?" "It answersyour name as well as the audio and video clips of the final goalfor you to enjoy. (of course, the audio video clips are basedon the fact that you're not using the phone, but the computer) .Although, askjeeves think their speech and search speed has tobe the degree of commercialization, but it will still have manyimmature, if you take a few keywords to test its search speedand precision and recall, and many of the common search engine,it came in where? If it' s behind you, is it a lousy searchengine?
One is evaluatingthe Internet searchengine is averydifficultthing, but a lot of evaluation results are ordinary Internetusers to see, is bound to take the Yahoo, include Sina portal,for them, is just a part of the Internet search, other kindsof search how to do? If you don't count, but the net civilianmuch; if it is, is a mess, where to?
Here, let' s analyze the capabilities flaws of several importantevaluation elements:
I. recall
Since it' s a search engine, first of all, it' s amatter of course,and if that fails, it doesn't seem to be necessary. Because thenumber of included pages each search engine announced, can thewhole letter, with a keyword search results is obviously, sothe general evaluation on this subject.
But to this date there are still many problems, most decentpoint of the search engine I can find a number of keywords toprove its search results is the most complete. Because althoughthe number of pages indexed in size, but the robot and spiderprogram, index scope and index standards are not the same, thebiggest search engine to be much smaller in the search engineto search.
Some search engines support "about", "of", "ah", and so on Whichevaluation mentioned?
In addition to the content is difficult to choose, the lengthis not good.
Some search engines do not support single Chinese charactersearch, how do you count it? Generally only a single keywordsearch, and multi keyword search it?How long is the search forlong sentences? Even search engines can support any articlesor fragments as keywords, so compare the results of the keywordsearch is not the same, not to mention the function of nocomparison. The semantic search engines like excite, as wellas the engines that support fuzzy search, and other searchengines that search for very few or even zero keywords, can finda whole bunch of results, and how do they compare?
Finally, the search engine can optimize the results forspecific keywords, and who will ensure the fairness of theevaluation? If one of the evaluated engines knows the keywordsin advance, then the champion is the only one that can be easilyoptimized.
Two: search speed
Recall ratio is faster than the search speed, if there aresearch engine index page is more, but search for a second fiveor six seconds or longer, directly ask it out, there is nomeaning than going down.
The problem of speed is at first in keyword, single keywordsearch is not fast, multi keyword search fast.
Then there is the problemof access, which is unfair for a searchengine with more than one hundred million of daily visits anda search engine with tens of thousands of visits per day.And the number of pages indexed, a search engine index 1 billion", another search engine index ten million", let them on thesame keywords in the database search results than the searchspeed, so how to convince people?
In addition to optimization problems, some search engines havethe memory search results accelerate the ability to transfera keyword, even the first word search took 10 seconds, secondsearch may be 2 seconds, third times, fourth times, when yougo to the test has always been 0.0001 seconds. So, if you choosea common word test, it' s amazing, if you come to a remote word,
maybe you can't get out of it for a long time. What keywordsshould you choose?How much do you usually use?This is reallya silly sum.
Search engines are not on the local machine in the lab, but forordinary users, so the search time should also include thesearch interface and search results of the transmissionprocess.A search engine took 0.0001 seconds, but it took 3 seconds toget the page, another search took 0.5 seconds, but it took asecond to send the page. Which search engine would you say isfaster? When you really use, you choose that 3.0001 secondslater to see the search results or 1.5 seconds later to see thesearch results?
Three: precision
This is very important, and the search is fast and fast, butthe result youwant doesn't knowhowmany pages youwant to find.What' s the result of this search? This kind of search engineis only useful when searching for rare things, but to searchfor rare things, you should go to the meta search engine. Whyuse it? The evaluation criteria of precision are difficult todetermine, and it depends on what you check. You have to lookfor a specific website and find a similar website. The key toprecision ratio is to search what and what keyword to choose,the judge can decide at random, and then affect the reliabilityof the evaluation results.
Four: dead link
General search engines have some search results that don' t go
anywhere, less than one percent, two, and eight or nine, andthis is often used as one of the evaluation criteria. But asGoogle uses web snapshot, there is almost no dead link problem,and even if the site in the search results is closed, you canstill see the web page that Google stores itself. How do youcalculate this kind of dead link?
Five: user burden
I haven' t seen anyone who has ever used this search engine inChina, but it' s an important factor in evaluating the pros andcons of search engines, including many aspects. Search enginesare for human use,
Make sure that people are comfortable, convenient, and quick,and that any user who hinders and delays the user' s access tothe final search results is charged by the user.
The first is the search interface, a pure search engineinterface with a search box, compared with a portal with adsand a large number of web pages, and their search burden forusers is high.
The second is to describe the search results, search resultspage description of the text is long or short, "the textdescription index with keyword part or the beginning of indexedpages indexed pages or a few lines of the main content, keywordsare highlighted by what color is not displayed page address,and the searchresults page layout, the the user' s searchburdenthere is a big difference.
Effect of addition is the user steps, whether can use the mouseto start the search, the search results page shows the numberis only 10, page convenient or not, the search box is two ora, above or below, a search keyword search is still displaycable box, every one of thesewill affect the search efficiency.Six: there are other
Do you want to search in this directory?,
Internet Index database update time,
Stability of search engines,
The ability to support advanced search should also beevaluated.
A person is not considerate, there may be other importantevaluation elements I did not mention, if you want to, hope toinform. See here, everyone on the limitations of the currentevaluation methods commonly used search engine must understand,of course, the most ridiculous is that I do not know is ignorantor tricky or special selection criteria, some Chinese searchengine evaluation this year to do not even include Google, aswell as a long list of celebrities can row the violin missedPaganini.
It' s really hard to evaluate a search engine.
传统农历新年将至,国人主机商DogYun(狗云)发来了虎年春节优惠活动,1月31日-2月6日活动期间使用优惠码新开动态云7折,经典云8折,新开独立服务器可立减100元/月;使用优惠码新开香港独立服务器优惠100元,并次月免费;活动期间单笔充值每满100元赠送10元,还可以参与幸运大转盘每日抽取5折码,流量,余额等奖品;商家限量推出一款年付特价套餐,共100台,每个用户限1台,香港VPS年付199元...
Sharktech又称SK或者鲨鱼机房,是一家主打高防产品的国外商家,成立于2003年,提供的产品包括独立服务器租用、VPS云服务器等,自营机房在美国洛杉矶、丹佛、芝加哥和荷兰阿姆斯特丹等。之前我们经常分享商家提供的独立服务器产品,近期主机商针对云虚拟服务器(CVS)提供优惠码,优惠后XS套餐年付最低仅33.39美元起,支持使用支付宝、PayPal、信用卡等付款方式。下面以XS套餐为例,分享产品配...
ucloud香港服务器优惠降价活动开始了!此前,ucloud官方全球云大促活动的香港云服务器一度上涨至2核4G配置752元/年,2031元/3年。让很多想购买ucloud香港云服务器的新用户望而却步!不过,目前,ucloud官方下调了香港服务器价格,此前2核4G香港云服务器752元/年,现在降至358元/年,968元/3年,价格降了快一半了!UCloud活动路子和阿里云、腾讯云不同,活动一步到位,...