Quora - What are good resources to learn about search engine architecture?
来源:百度文库 编辑:神马文学网 时间:2024/04/29 09:57:56
What are good resources to learn about searchengine architecture?
Ithink Manning and Prabhakar's book on Information Retrieval covers agood bit of the theory behind search engines, but what are the bestresources out there to learn about their distributed systems, networkrouting and scalability aspects? Pointers to books, conference andjournal papers perhaps that talk about real world designs and systems?Cannot addcomment at this time.3Answers
Krishna Gade, TwitterSearch <- Bing <- Live Search <- M... 6 votes by Anon User, Anon User, Amund Tveit, (more)Anon User, Anon User, Amund Tveit, Viksit Gaur, Michael Maloneand BabakHamadaniIf you're interested in thearchitecture of search engines the way they are done in practice ratherthan in academia, following are some of the papers that're very good.Esp., the last one helps you give a good model to approach the problemof how to design the architecture of a search engine.-Evolution of Google's search architecture by Jeff Dean. http://research.goo
-Lessons from building large scale systems by Jeff Dean. http://www.cs.corne
-Operational Requirements for Scalable Search Systems. http://www.ir.ii
AlsoI found this IR lab produces good search architecture papers.
http://cis.poly.edu/westl
Itfocuses more on the use of IR in search engines.
Also searchengine is a large area - in general you can divide it into systems andthe algorithms side. Algorithm part is obvious; systems refers tobuilding large scale distributed systems that enable the algorithm toperform effectively and efficiently.
Some conferences to followon this topic are: SOSP, WWW, SIGMOD, VLDB.
As for informalreadings, I personally subscribe to the following blogs, and many ofthem talk about challenges in building real systems (not necessarilysearch engines, but all kinds of distributed systems):
WernerVogels (Amazon CTO)
http://www.allthingsdistr
JamesHamilton (Amazon DE & VP of Engineering)
http://perspectives.mvdir
http://highscalability.co
http://www.royans.net/arc
http://googleresearch.blo
From the samplechapters, it looks like the new book could be fantastic.
Cannot add comment at this time. Xuehua Shen 2 votes by Amund Tveit and Viksit GaurFor the rankingpart of search engine, SIGIR is the most relevant conference, followedby CIKM. A relatively new book about it: http://ir.iit.edu/~ophir/For system implementation part, there is one more new book http://www.search-engines
You may study Open source project Lucene (indexing and rankinglibrary), or Solr (Enterprise search solution based on Lucene), which isused by many companyies such as Netflix. Katta, http://katta.sourceforge.
There is a Lemur and related Indri project in academia http://www.lemurproject.o
Quora - What are good resources to learn about search engine architecture?
search engine
Cache is King -or- Things are about to get MESI
Search Engine Optimization | Search Engine Marketing
Microformats: What They Are and How To Use Them
News Search Engine Optimization
share search engine
Improving Search Engine Rankings
Search Engine Journal Google Click-To-Play Video Ads for AdWords
Search Engine Journal Unusual Search Engines
Search Engine Marketing Forum - Igrep niche search engine
Niche Marketing | Search Engine Optimization
The linguist‘s search engine
What are seamounts?
What are seamounts?
Westlife - What About Now
learn to use wiki
Learn to Fly
how to learn English
Playing to Learn
How To Learn English2
How To Learn
How to learn English
How To Learn English!