21. Indexing Module · A software processor that applies a compression algorithm. · For content, the algorithm generates an inverted file index. · Also yields Structure Indexes, and Special-purpose Indexes (for PDFs and video) Software, 2 Processor, 3 Compression, 7 Algorithm, 8, 12 Index, 17 Indexes, 21, 25
22. Indexes Storage area for inverted files and other processed page results. These are the valuable assets of an Internet Search company.
23. The Query Module The software that handles user queries. Interacts with the ranking module, the indexes, and the page repository. Must be fast! Feb 2003, Google reported serving 250,000,000 searches per day. (2,894 queries per second) Langville & Meyer, 2006
24. The Ranking Module The software that examines the hyperlink structure and calculates a page’s value.
25.
26. An Excellent History (the key reference text) Amy Langville, Carl Meyer, Google’s Page Rank and Beyond: The Science of Search Engine Rankings. Princeton University Pres, 2006