SlideShare a Scribd company logo
1 of 7
The heart of search engine - Inverted Index
It is the foundation of the search engine.
When you want to build search engine such as yahoo, google at the core of the search engine it lays an
Inverted Index.
1 Building an index by crawling the web.
2 Building an inverted index.
3 Lookup the Inverted Index for relevant webpages
Steps involved in building Inverted Index
www.npntraining.com/courses/big-data-and-hadoop.php
Step 01
Build an index by crawling the web
e.g. Selenium occurs on which all sites. In order to build an Inverted Index we have to crawl the webpages
from the web and store them along with their contents.
www.abc.com Training provided on Selenium Big Data Hadoop
www.xyz.com Trainings provided on Apache Spark Scala J2EE
www.def.com Training provided on Java J2EE Python Selenium
This is an index of webpages and their contents
www.npntraining.com/courses/big-data-and-hadoop.php
Step 02
Build an inverted index
www.abc.com Training provided on Selenium BigData Hadoop
www.xyz.com Trainings provided on Apache Spark Scala J2EE
www.def.com Training provided on Java J2EE Python Selenium
This is an index of webpages and their contents
Training www.abc.com, www.xyz.com, www.def.com
BigData www.abc.com
Spark www.xyz.com
J2EE www.xyz.com, www.def.com
Build an index of words to webpages they appear in
www.npntraining.com/courses/big-data-and-hadoop.php
Step 03
Given a search term, look up the inverted index for the relevant webpages
Training www.abc.com, www.xyz.com, www.def.com
Big Data www.abc.com
Spark www.xyz.com
J2EE www.xyz.com, www.def.com
Build an index of words to webpages they appear in
www.npntraining.com/courses/big-data-and-hadoop.php
Building an Inverted Index `
www.abc.com Selenium BigData Hadoop
www.xyz.com ApacheSpark Scala J2EE Selenium
www.def.com Java J2EE Python Selenium
Map
Selenium www.abc.com
Selenium ww.xyz.com
Selenium www.def.com
BigData www.abc.com
ApacheSpark www.def.com
Spark www.xyz.com
Scala www.xyz.com
J2EE www.xyz.com
J2EE www.def.com
Python www.def.com
www.npntraining.com/courses/big-data-and-hadoop.php
Building an Inverted Index `
Reduce
Selenium [www.abc.com www.xyz.com www.def.com]
BigData [ www.abc.com ]
ApacheSpark www.def.com
Spark www.xyz.com
Scala www.xyz.com
J2EE [ www.xyz.com www.def.com ]
Python www.def.com
Selenium www.abc.com|www.xyz.com|www.def.com
BigData www.abc.com
ApacheSpark www.def.com
Spark www.xyz.com
Scala www.xyz.com
J2EE www.xyz.com|www.def.com
Python www.def.com
The heart of search engine  Inverted index

More Related Content

Viewers also liked

Keyword Searching: Advanced Techniques
Keyword Searching: Advanced TechniquesKeyword Searching: Advanced Techniques
Keyword Searching: Advanced TechniquesKris Jacobson
 
Search strategies – subject searching
Search strategies – subject searchingSearch strategies – subject searching
Search strategies – subject searchingdoverlibrary
 
5013 Indexing Presentation
5013 Indexing Presentation5013 Indexing Presentation
5013 Indexing Presentationlmartin8
 
Advanced keyword research
Advanced keyword researchAdvanced keyword research
Advanced keyword researchJono Alderson
 
Kwic
KwicKwic
KwicPU
 
Google searching techniques
Google searching techniquesGoogle searching techniques
Google searching techniquesabbas mohd
 
From KWIC to Enterprise Search - M G Lindquist
From KWIC to Enterprise Search - M G LindquistFrom KWIC to Enterprise Search - M G Lindquist
From KWIC to Enterprise Search - M G Lindquistmglindquist
 
Richard kwock jsm 2012 poster
Richard kwock jsm 2012 posterRichard kwock jsm 2012 poster
Richard kwock jsm 2012 posterAjay Ohri
 
Presentation search strategy
Presentation   search strategyPresentation   search strategy
Presentation search strategyjmunks
 
Institutional Repositories
Institutional RepositoriesInstitutional Repositories
Institutional RepositoriesSarika Sawant
 
Introduction to indexing
Introduction to indexingIntroduction to indexing
Introduction to indexingDaryl Superio
 
Indexing or dividing_head
Indexing or dividing_headIndexing or dividing_head
Indexing or dividing_headJavaria Chiragh
 
The search engine index
The search engine indexThe search engine index
The search engine indexCJ Jenkins
 

Viewers also liked (20)

Keyword Searching: Advanced Techniques
Keyword Searching: Advanced TechniquesKeyword Searching: Advanced Techniques
Keyword Searching: Advanced Techniques
 
Search strategies – subject searching
Search strategies – subject searchingSearch strategies – subject searching
Search strategies – subject searching
 
3rd Thesaurus
3rd Thesaurus3rd Thesaurus
3rd Thesaurus
 
5013 Indexing Presentation
5013 Indexing Presentation5013 Indexing Presentation
5013 Indexing Presentation
 
Advanced keyword research
Advanced keyword researchAdvanced keyword research
Advanced keyword research
 
Kwic
KwicKwic
Kwic
 
Google searching techniques
Google searching techniquesGoogle searching techniques
Google searching techniques
 
Slic System
Slic SystemSlic System
Slic System
 
Identifying Keywords and Searching Techniques
Identifying Keywords and Searching TechniquesIdentifying Keywords and Searching Techniques
Identifying Keywords and Searching Techniques
 
From KWIC to Enterprise Search - M G Lindquist
From KWIC to Enterprise Search - M G LindquistFrom KWIC to Enterprise Search - M G Lindquist
From KWIC to Enterprise Search - M G Lindquist
 
Richard kwock jsm 2012 poster
Richard kwock jsm 2012 posterRichard kwock jsm 2012 poster
Richard kwock jsm 2012 poster
 
Presentation search strategy
Presentation   search strategyPresentation   search strategy
Presentation search strategy
 
Types of sentences
Types of sentencesTypes of sentences
Types of sentences
 
Institutional Repositories
Institutional RepositoriesInstitutional Repositories
Institutional Repositories
 
Introduction to indexing
Introduction to indexingIntroduction to indexing
Introduction to indexing
 
Types of indexes
Types of indexesTypes of indexes
Types of indexes
 
Indexing or dividing_head
Indexing or dividing_headIndexing or dividing_head
Indexing or dividing_head
 
Indexing
IndexingIndexing
Indexing
 
The search engine index
The search engine indexThe search engine index
The search engine index
 
Slideshare ppt
Slideshare pptSlideshare ppt
Slideshare ppt
 

Recently uploaded

Unveiling the Tech Salsa of LAMs with Janus in Real-Time Applications
Unveiling the Tech Salsa of LAMs with Janus in Real-Time ApplicationsUnveiling the Tech Salsa of LAMs with Janus in Real-Time Applications
Unveiling the Tech Salsa of LAMs with Janus in Real-Time ApplicationsAlberto González Trastoy
 
call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️
call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️
call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️Delhi Call girls
 
Right Money Management App For Your Financial Goals
Right Money Management App For Your Financial GoalsRight Money Management App For Your Financial Goals
Right Money Management App For Your Financial GoalsJhone kinadey
 
introduction-to-automotive Andoid os-csimmonds-ndctechtown-2021.pdf
introduction-to-automotive Andoid os-csimmonds-ndctechtown-2021.pdfintroduction-to-automotive Andoid os-csimmonds-ndctechtown-2021.pdf
introduction-to-automotive Andoid os-csimmonds-ndctechtown-2021.pdfVishalKumarJha10
 
AI & Machine Learning Presentation Template
AI & Machine Learning Presentation TemplateAI & Machine Learning Presentation Template
AI & Machine Learning Presentation TemplatePresentation.STUDIO
 
Define the academic and professional writing..pdf
Define the academic and professional writing..pdfDefine the academic and professional writing..pdf
Define the academic and professional writing..pdfPearlKirahMaeRagusta1
 
Learn the Fundamentals of XCUITest Framework_ A Beginner's Guide.pdf
Learn the Fundamentals of XCUITest Framework_ A Beginner's Guide.pdfLearn the Fundamentals of XCUITest Framework_ A Beginner's Guide.pdf
Learn the Fundamentals of XCUITest Framework_ A Beginner's Guide.pdfkalichargn70th171
 
Software Quality Assurance Interview Questions
Software Quality Assurance Interview QuestionsSoftware Quality Assurance Interview Questions
Software Quality Assurance Interview QuestionsArshad QA
 
VTU technical seminar 8Th Sem on Scikit-learn
VTU technical seminar 8Th Sem on Scikit-learnVTU technical seminar 8Th Sem on Scikit-learn
VTU technical seminar 8Th Sem on Scikit-learnAmarnathKambale
 
Introducing Microsoft’s new Enterprise Work Management (EWM) Solution
Introducing Microsoft’s new Enterprise Work Management (EWM) SolutionIntroducing Microsoft’s new Enterprise Work Management (EWM) Solution
Introducing Microsoft’s new Enterprise Work Management (EWM) SolutionOnePlan Solutions
 
The Guide to Integrating Generative AI into Unified Continuous Testing Platfo...
The Guide to Integrating Generative AI into Unified Continuous Testing Platfo...The Guide to Integrating Generative AI into Unified Continuous Testing Platfo...
The Guide to Integrating Generative AI into Unified Continuous Testing Platfo...kalichargn70th171
 
Tech Tuesday-Harness the Power of Effective Resource Planning with OnePlan’s ...
Tech Tuesday-Harness the Power of Effective Resource Planning with OnePlan’s ...Tech Tuesday-Harness the Power of Effective Resource Planning with OnePlan’s ...
Tech Tuesday-Harness the Power of Effective Resource Planning with OnePlan’s ...OnePlan Solutions
 
Azure_Native_Qumulo_High_Performance_Compute_Benchmarks.pdf
Azure_Native_Qumulo_High_Performance_Compute_Benchmarks.pdfAzure_Native_Qumulo_High_Performance_Compute_Benchmarks.pdf
Azure_Native_Qumulo_High_Performance_Compute_Benchmarks.pdfryanfarris8
 
Direct Style Effect Systems - The Print[A] Example - A Comprehension Aid
Direct Style Effect Systems -The Print[A] Example- A Comprehension AidDirect Style Effect Systems -The Print[A] Example- A Comprehension Aid
Direct Style Effect Systems - The Print[A] Example - A Comprehension AidPhilip Schwarz
 
TECUNIQUE: Success Stories: IT Service provider
TECUNIQUE: Success Stories: IT Service providerTECUNIQUE: Success Stories: IT Service provider
TECUNIQUE: Success Stories: IT Service providermohitmore19
 
call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️
call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️
call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️Delhi Call girls
 
Optimizing AI for immediate response in Smart CCTV
Optimizing AI for immediate response in Smart CCTVOptimizing AI for immediate response in Smart CCTV
Optimizing AI for immediate response in Smart CCTVshikhaohhpro
 
AI Mastery 201: Elevating Your Workflow with Advanced LLM Techniques
AI Mastery 201: Elevating Your Workflow with Advanced LLM TechniquesAI Mastery 201: Elevating Your Workflow with Advanced LLM Techniques
AI Mastery 201: Elevating Your Workflow with Advanced LLM TechniquesVictorSzoltysek
 

Recently uploaded (20)

Unveiling the Tech Salsa of LAMs with Janus in Real-Time Applications
Unveiling the Tech Salsa of LAMs with Janus in Real-Time ApplicationsUnveiling the Tech Salsa of LAMs with Janus in Real-Time Applications
Unveiling the Tech Salsa of LAMs with Janus in Real-Time Applications
 
call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️
call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️
call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️
 
Right Money Management App For Your Financial Goals
Right Money Management App For Your Financial GoalsRight Money Management App For Your Financial Goals
Right Money Management App For Your Financial Goals
 
introduction-to-automotive Andoid os-csimmonds-ndctechtown-2021.pdf
introduction-to-automotive Andoid os-csimmonds-ndctechtown-2021.pdfintroduction-to-automotive Andoid os-csimmonds-ndctechtown-2021.pdf
introduction-to-automotive Andoid os-csimmonds-ndctechtown-2021.pdf
 
AI & Machine Learning Presentation Template
AI & Machine Learning Presentation TemplateAI & Machine Learning Presentation Template
AI & Machine Learning Presentation Template
 
Define the academic and professional writing..pdf
Define the academic and professional writing..pdfDefine the academic and professional writing..pdf
Define the academic and professional writing..pdf
 
Learn the Fundamentals of XCUITest Framework_ A Beginner's Guide.pdf
Learn the Fundamentals of XCUITest Framework_ A Beginner's Guide.pdfLearn the Fundamentals of XCUITest Framework_ A Beginner's Guide.pdf
Learn the Fundamentals of XCUITest Framework_ A Beginner's Guide.pdf
 
Software Quality Assurance Interview Questions
Software Quality Assurance Interview QuestionsSoftware Quality Assurance Interview Questions
Software Quality Assurance Interview Questions
 
VTU technical seminar 8Th Sem on Scikit-learn
VTU technical seminar 8Th Sem on Scikit-learnVTU technical seminar 8Th Sem on Scikit-learn
VTU technical seminar 8Th Sem on Scikit-learn
 
Vip Call Girls Noida ➡️ Delhi ➡️ 9999965857 No Advance 24HRS Live
Vip Call Girls Noida ➡️ Delhi ➡️ 9999965857 No Advance 24HRS LiveVip Call Girls Noida ➡️ Delhi ➡️ 9999965857 No Advance 24HRS Live
Vip Call Girls Noida ➡️ Delhi ➡️ 9999965857 No Advance 24HRS Live
 
CHEAP Call Girls in Pushp Vihar (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
CHEAP Call Girls in Pushp Vihar (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICECHEAP Call Girls in Pushp Vihar (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
CHEAP Call Girls in Pushp Vihar (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
 
Introducing Microsoft’s new Enterprise Work Management (EWM) Solution
Introducing Microsoft’s new Enterprise Work Management (EWM) SolutionIntroducing Microsoft’s new Enterprise Work Management (EWM) Solution
Introducing Microsoft’s new Enterprise Work Management (EWM) Solution
 
The Guide to Integrating Generative AI into Unified Continuous Testing Platfo...
The Guide to Integrating Generative AI into Unified Continuous Testing Platfo...The Guide to Integrating Generative AI into Unified Continuous Testing Platfo...
The Guide to Integrating Generative AI into Unified Continuous Testing Platfo...
 
Tech Tuesday-Harness the Power of Effective Resource Planning with OnePlan’s ...
Tech Tuesday-Harness the Power of Effective Resource Planning with OnePlan’s ...Tech Tuesday-Harness the Power of Effective Resource Planning with OnePlan’s ...
Tech Tuesday-Harness the Power of Effective Resource Planning with OnePlan’s ...
 
Azure_Native_Qumulo_High_Performance_Compute_Benchmarks.pdf
Azure_Native_Qumulo_High_Performance_Compute_Benchmarks.pdfAzure_Native_Qumulo_High_Performance_Compute_Benchmarks.pdf
Azure_Native_Qumulo_High_Performance_Compute_Benchmarks.pdf
 
Direct Style Effect Systems - The Print[A] Example - A Comprehension Aid
Direct Style Effect Systems -The Print[A] Example- A Comprehension AidDirect Style Effect Systems -The Print[A] Example- A Comprehension Aid
Direct Style Effect Systems - The Print[A] Example - A Comprehension Aid
 
TECUNIQUE: Success Stories: IT Service provider
TECUNIQUE: Success Stories: IT Service providerTECUNIQUE: Success Stories: IT Service provider
TECUNIQUE: Success Stories: IT Service provider
 
call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️
call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️
call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️
 
Optimizing AI for immediate response in Smart CCTV
Optimizing AI for immediate response in Smart CCTVOptimizing AI for immediate response in Smart CCTV
Optimizing AI for immediate response in Smart CCTV
 
AI Mastery 201: Elevating Your Workflow with Advanced LLM Techniques
AI Mastery 201: Elevating Your Workflow with Advanced LLM TechniquesAI Mastery 201: Elevating Your Workflow with Advanced LLM Techniques
AI Mastery 201: Elevating Your Workflow with Advanced LLM Techniques
 

The heart of search engine Inverted index

  • 1. The heart of search engine - Inverted Index It is the foundation of the search engine. When you want to build search engine such as yahoo, google at the core of the search engine it lays an Inverted Index. 1 Building an index by crawling the web. 2 Building an inverted index. 3 Lookup the Inverted Index for relevant webpages Steps involved in building Inverted Index www.npntraining.com/courses/big-data-and-hadoop.php
  • 2. Step 01 Build an index by crawling the web e.g. Selenium occurs on which all sites. In order to build an Inverted Index we have to crawl the webpages from the web and store them along with their contents. www.abc.com Training provided on Selenium Big Data Hadoop www.xyz.com Trainings provided on Apache Spark Scala J2EE www.def.com Training provided on Java J2EE Python Selenium This is an index of webpages and their contents www.npntraining.com/courses/big-data-and-hadoop.php
  • 3. Step 02 Build an inverted index www.abc.com Training provided on Selenium BigData Hadoop www.xyz.com Trainings provided on Apache Spark Scala J2EE www.def.com Training provided on Java J2EE Python Selenium This is an index of webpages and their contents Training www.abc.com, www.xyz.com, www.def.com BigData www.abc.com Spark www.xyz.com J2EE www.xyz.com, www.def.com Build an index of words to webpages they appear in www.npntraining.com/courses/big-data-and-hadoop.php
  • 4. Step 03 Given a search term, look up the inverted index for the relevant webpages Training www.abc.com, www.xyz.com, www.def.com Big Data www.abc.com Spark www.xyz.com J2EE www.xyz.com, www.def.com Build an index of words to webpages they appear in www.npntraining.com/courses/big-data-and-hadoop.php
  • 5. Building an Inverted Index ` www.abc.com Selenium BigData Hadoop www.xyz.com ApacheSpark Scala J2EE Selenium www.def.com Java J2EE Python Selenium Map Selenium www.abc.com Selenium ww.xyz.com Selenium www.def.com BigData www.abc.com ApacheSpark www.def.com Spark www.xyz.com Scala www.xyz.com J2EE www.xyz.com J2EE www.def.com Python www.def.com www.npntraining.com/courses/big-data-and-hadoop.php
  • 6. Building an Inverted Index ` Reduce Selenium [www.abc.com www.xyz.com www.def.com] BigData [ www.abc.com ] ApacheSpark www.def.com Spark www.xyz.com Scala www.xyz.com J2EE [ www.xyz.com www.def.com ] Python www.def.com Selenium www.abc.com|www.xyz.com|www.def.com BigData www.abc.com ApacheSpark www.def.com Spark www.xyz.com Scala www.xyz.com J2EE www.xyz.com|www.def.com Python www.def.com

Editor's Notes

  1. Rather than creating an object inside a function, you pass it to the function.
  2. Rather than creating an object inside a function, you pass it to the function.