SlideShare ist ein Scribd-Unternehmen logo
1 von 20
Assignment
Topic-How Search Engine works?Topic-How Search Engine works?
Submitted to-Submitted to- Al Imtiaz (Lecturer)Al Imtiaz (Lecturer)
((CSE & ITCSE & IT))
Prepared by-Prepared by-
Sheikh Mohammad ShahnoorSheikh Mohammad Shahnoor ID-12410224ID-12410224
Faisal AhmedFaisal Ahmed ID-12410209ID-12410209
Saiful Islam ShakilSaiful Islam Shakil ID-12410213ID-12410213
Md.AshrafuzzamanMd.Ashrafuzzaman ID12410219ID12410219
 Crawling & indexingCrawling & indexing
 algorithmsalgorithms
 Fighting spamFighting spam
Factors of Search EnginesFactors of Search Engines
Reference- “http://www.google.com/insidesearch/howsearchworks/thestory/”
1.Crawling & Indexing
• Search starts with the web.
• It's made up of over 60 trillion individual pages.
• Google navigates the web by crawling.
• That means we follow links from page to page.
• The pages are sorted by their content and other factors.
Crawling & IndexingCrawling & Indexing
And by this ,engine keep track of it all in the index.
Names of some popular search EngineNames of some popular search Engine
 Google-Google-
 Bing-Bing-
 Yahoo-Yahoo-
 Ask-Ask-
 Aol.-Aol.-
 Mywebsearch-Mywebsearch-
2.Algorithms2.Algorithms
• Programs & formulas are written to deliver the best results possible.
• Algorithms get to work looking for clues to better understand what
we mean.
As we search??
Spelling -Identifies and corrects possible spelling errors and provides alternatives.
 Auto complete -Predicts what you might be searching for. This includes understanding
terms with more than one meaning.
 Synonyms -Recognizes words with similar meanings.
 Query Understanding-Gets to the deeper meaning of the words you type.
 Search Methods -Creates new ways to search, including "search by image" and
"voice search."
Google Instant-Displays immediate results as you type.
Based on this clues we get relevant documents from the index
Ranking
 Site & Page Quality -Uses a set of signals to determine how
trustworthy, reputable, or authoritative a source is.
 Freshness -Shows the latest news and information.
 Safe Search-Reduces the amount of adult web pages,
images, and videos in your results.
 User Context -Provides more relevant results based on
geographic region, Web History, etc.
 Translation and Internationalization -Tailors results
based on our language and country.
 Universal Search -Blends relevant content, such as images,
news, maps, videos, and your personal content, into a single
unified search results page.
ResultsResults
The outcomes comes within 1/8th
seconds towards us
Robot Indexing DiagramRobot Indexing Diagram
3.Fighting spam3.Fighting spam
It fights spam always 24/7.
To keep your results relevant.
The majority of spam removal is automatic.
Search Engine examine other questionable
documents by hand.
If it find spam, we take manual action.
Types of SpamTypes of Spam
1.1. Pure Spam -Pure Spam -Site appears to use aggressive spam techniques such as
automatically generated gibberish, cloaking, scraping content from other
websites, and/or repeated or egregious violations of Google's
Webmaster Guidelines.
2.2. Hidden text and/or keyword stuffing-Hidden text and/or keyword stuffing- Some of the pages maySome of the pages may
contain hidden text and/or keyword stuffing.contain hidden text and/or keyword stuffing.
3. User-generated spam -Site appears to contain spam my user-
generated content. The problematic content may appear on forum
pages, guestbook pages, or user profiles.
4.4. ParkedParked domainsdomains-Parked-Parked domains are placeholder sites with littledomains are placeholder sites with little
unique content, so Google doesn't typically include them in search results.unique content, so Google doesn't typically include them in search results.
5.5. Thin content with little or no added valueThin content with little or no added value..
6.6. UnnaturalUnnatural links tolinks to aa sitesite-manipulative links pointing to the site.-manipulative links pointing to the site.
7.7. Spammy free hosts and dynamic DNS providers.Spammy free hosts and dynamic DNS providers.
8.8. Cloaking and/or sneaky redirects-DCloaking and/or sneaky redirects-Displayingisplaying different content todifferent content to
human users than is shown to search engineshuman users than is shown to search engines
9.9. Hacked site.Hacked site.
1010..Unnatural links from a site-ThisUnnatural links from a site-This may be the result of selling linksmay be the result of selling links
that pass Page Rank or participating in link schemes.that pass Page Rank or participating in link schemes.
InitiativesInitiatives
 When necessary actions are taken , notification are sent toWhen necessary actions are taken , notification are sent to
the website owners.the website owners.
 In replyIn reply
 Site owners can fix their sites and let this know to theSite owners can fix their sites and let this know to the
desired search engine.desired search engine.
This is how search works
…………..
Questions & Answers????
Reference- “http://www.google.com/insidesearch/howsearchworks/thestory/”
Search engine

Weitere ähnliche Inhalte

Was ist angesagt?

SEO 101 webinar 10 25-2012
SEO 101 webinar 10 25-2012SEO 101 webinar 10 25-2012
SEO 101 webinar 10 25-2012
451 Marketing
 
Web crawler with seo analysis
Web crawler with seo analysis Web crawler with seo analysis
Web crawler with seo analysis
Vikram Parmar
 

Was ist angesagt? (20)

Basic SEO mini workshop for copywriter
Basic SEO mini workshop for copywriter Basic SEO mini workshop for copywriter
Basic SEO mini workshop for copywriter
 
SEO 101 webinar 10 25-2012
SEO 101 webinar 10 25-2012SEO 101 webinar 10 25-2012
SEO 101 webinar 10 25-2012
 
Keeping Things Lean & Mean: Crawl Optimisation - Search Marketing Summit AU
Keeping Things Lean & Mean: Crawl Optimisation - Search Marketing Summit AUKeeping Things Lean & Mean: Crawl Optimisation - Search Marketing Summit AU
Keeping Things Lean & Mean: Crawl Optimisation - Search Marketing Summit AU
 
How search engine works ( Mr. Mirza)
How search engine works ( Mr. Mirza)How search engine works ( Mr. Mirza)
How search engine works ( Mr. Mirza)
 
Google algorithim’s
Google  algorithim’sGoogle  algorithim’s
Google algorithim’s
 
Search Engines
Search EnginesSearch Engines
Search Engines
 
Web crawler with seo analysis
Web crawler with seo analysis Web crawler with seo analysis
Web crawler with seo analysis
 
How a search engine works report
How a search engine works reportHow a search engine works report
How a search engine works report
 
Seoptimizing
SeoptimizingSeoptimizing
Seoptimizing
 
How search engine works
How search engine worksHow search engine works
How search engine works
 
Site Architecture Best Practices for Search Findability - Adam Audette
Site Architecture Best Practices for Search Findability - Adam AudetteSite Architecture Best Practices for Search Findability - Adam Audette
Site Architecture Best Practices for Search Findability - Adam Audette
 
Spooky Good Technical SEO for E-Commerce Sites - Adam Dince
Spooky Good Technical SEO for E-Commerce Sites - Adam DinceSpooky Good Technical SEO for E-Commerce Sites - Adam Dince
Spooky Good Technical SEO for E-Commerce Sites - Adam Dince
 
Search engine marketing
Search engine marketingSearch engine marketing
Search engine marketing
 
DomainTools Fingerprinting Threat Actors with Web Assets
DomainTools Fingerprinting Threat Actors with Web AssetsDomainTools Fingerprinting Threat Actors with Web Assets
DomainTools Fingerprinting Threat Actors with Web Assets
 
Comparing Search Engines
Comparing Search EnginesComparing Search Engines
Comparing Search Engines
 
WordPress SEO Basics - Melbourne WordPress Meetup
WordPress SEO Basics - Melbourne WordPress MeetupWordPress SEO Basics - Melbourne WordPress Meetup
WordPress SEO Basics - Melbourne WordPress Meetup
 
How Google Search Algorithm Works ??
How Google Search Algorithm Works ??How Google Search Algorithm Works ??
How Google Search Algorithm Works ??
 
Website audit for SEO
Website audit for SEOWebsite audit for SEO
Website audit for SEO
 
Top 10 Technical SEO Mistakes (that we see time and again)...
Top 10 Technical SEO Mistakes (that we see time and again)...Top 10 Technical SEO Mistakes (that we see time and again)...
Top 10 Technical SEO Mistakes (that we see time and again)...
 
Php Meetup Seo
Php Meetup SeoPhp Meetup Seo
Php Meetup Seo
 

Andere mochten auch (8)

How search engine works
How search engine worksHow search engine works
How search engine works
 
How goole search engine work
How goole search engine workHow goole search engine work
How goole search engine work
 
How a search engine works slide
How a search engine works slideHow a search engine works slide
How a search engine works slide
 
Google Search Engine
Google Search EngineGoogle Search Engine
Google Search Engine
 
Search Engine Optimization - What's it about?
Search Engine Optimization -  What's it about?Search Engine Optimization -  What's it about?
Search Engine Optimization - What's it about?
 
How google search engine work
How google search engine workHow google search engine work
How google search engine work
 
How Google Search Engine Works
How Google Search Engine Works How Google Search Engine Works
How Google Search Engine Works
 
Google ppt by amit
Google ppt by amitGoogle ppt by amit
Google ppt by amit
 

Ähnlich wie Search engine

Ähnlich wie Search engine (20)

Detection of Phishing Websites
Detection of Phishing Websites Detection of Phishing Websites
Detection of Phishing Websites
 
DMI Webinar Series - SEO Audits (Part 1 of 3)
DMI Webinar Series - SEO Audits (Part 1 of 3)DMI Webinar Series - SEO Audits (Part 1 of 3)
DMI Webinar Series - SEO Audits (Part 1 of 3)
 
Seo Presentation for Beginners, Complete SEO ppt,
Seo Presentation for Beginners, Complete SEO ppt,Seo Presentation for Beginners, Complete SEO ppt,
Seo Presentation for Beginners, Complete SEO ppt,
 
Seo
SeoSeo
Seo
 
Search Engine Optimization (SEO)
Search Engine Optimization (SEO)Search Engine Optimization (SEO)
Search Engine Optimization (SEO)
 
Search Engine Optimisation (Seo) And Search Engine Marketing
Search Engine Optimisation (Seo) And Search Engine MarketingSearch Engine Optimisation (Seo) And Search Engine Marketing
Search Engine Optimisation (Seo) And Search Engine Marketing
 
Seo
SeoSeo
Seo
 
Seo
SeoSeo
Seo
 
Technical SEO Best Practices
Technical SEO Best PracticesTechnical SEO Best Practices
Technical SEO Best Practices
 
SEO - Emarketing by Surya Mishra
SEO - Emarketing by Surya MishraSEO - Emarketing by Surya Mishra
SEO - Emarketing by Surya Mishra
 
Seo
SeoSeo
Seo
 
Different Module of Digital Marketing
Different Module of Digital MarketingDifferent Module of Digital Marketing
Different Module of Digital Marketing
 
SEO
SEOSEO
SEO
 
Introduction to search_marketing
Introduction to search_marketingIntroduction to search_marketing
Introduction to search_marketing
 
Complete Course Search Engine Optimization
Complete Course Search Engine OptimizationComplete Course Search Engine Optimization
Complete Course Search Engine Optimization
 
Emarketing
EmarketingEmarketing
Emarketing
 
Seo ppt
Seo pptSeo ppt
Seo ppt
 
Seo
SeoSeo
Seo
 
SEO.ppt
SEO.pptSEO.ppt
SEO.ppt
 
Website Audit [On Page and Off Page] by Carl Benedic Pantaleon
Website Audit [On Page and Off Page] by Carl Benedic PantaleonWebsite Audit [On Page and Off Page] by Carl Benedic Pantaleon
Website Audit [On Page and Off Page] by Carl Benedic Pantaleon
 

Kürzlich hochgeladen

Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Safe Software
 
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Victor Rentea
 
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Victor Rentea
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
?#DUbAI#??##{{(☎️+971_581248768%)**%*]'#abortion pills for sale in dubai@
 

Kürzlich hochgeladen (20)

Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of Terraform
 
DEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 AmsterdamDEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingRepurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
 
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
 
MS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectorsMS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectors
 
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a Fresher
 
Understanding the FAA Part 107 License ..
Understanding the FAA Part 107 License ..Understanding the FAA Part 107 License ..
Understanding the FAA Part 107 License ..
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
Exploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with MilvusExploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with Milvus
 
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
 
DBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor Presentation
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
 
Vector Search -An Introduction in Oracle Database 23ai.pptx
Vector Search -An Introduction in Oracle Database 23ai.pptxVector Search -An Introduction in Oracle Database 23ai.pptx
Vector Search -An Introduction in Oracle Database 23ai.pptx
 

Search engine

  • 1. Assignment Topic-How Search Engine works?Topic-How Search Engine works? Submitted to-Submitted to- Al Imtiaz (Lecturer)Al Imtiaz (Lecturer) ((CSE & ITCSE & IT))
  • 2. Prepared by-Prepared by- Sheikh Mohammad ShahnoorSheikh Mohammad Shahnoor ID-12410224ID-12410224 Faisal AhmedFaisal Ahmed ID-12410209ID-12410209 Saiful Islam ShakilSaiful Islam Shakil ID-12410213ID-12410213 Md.AshrafuzzamanMd.Ashrafuzzaman ID12410219ID12410219
  • 3.  Crawling & indexingCrawling & indexing  algorithmsalgorithms  Fighting spamFighting spam Factors of Search EnginesFactors of Search Engines Reference- “http://www.google.com/insidesearch/howsearchworks/thestory/”
  • 4. 1.Crawling & Indexing • Search starts with the web. • It's made up of over 60 trillion individual pages. • Google navigates the web by crawling. • That means we follow links from page to page. • The pages are sorted by their content and other factors.
  • 5. Crawling & IndexingCrawling & Indexing And by this ,engine keep track of it all in the index.
  • 6. Names of some popular search EngineNames of some popular search Engine  Google-Google-  Bing-Bing-  Yahoo-Yahoo-  Ask-Ask-  Aol.-Aol.-  Mywebsearch-Mywebsearch-
  • 7.
  • 8. 2.Algorithms2.Algorithms • Programs & formulas are written to deliver the best results possible. • Algorithms get to work looking for clues to better understand what we mean.
  • 9. As we search?? Spelling -Identifies and corrects possible spelling errors and provides alternatives.  Auto complete -Predicts what you might be searching for. This includes understanding terms with more than one meaning.  Synonyms -Recognizes words with similar meanings.  Query Understanding-Gets to the deeper meaning of the words you type.  Search Methods -Creates new ways to search, including "search by image" and "voice search." Google Instant-Displays immediate results as you type.
  • 10. Based on this clues we get relevant documents from the index
  • 11. Ranking  Site & Page Quality -Uses a set of signals to determine how trustworthy, reputable, or authoritative a source is.  Freshness -Shows the latest news and information.  Safe Search-Reduces the amount of adult web pages, images, and videos in your results.  User Context -Provides more relevant results based on geographic region, Web History, etc.  Translation and Internationalization -Tailors results based on our language and country.  Universal Search -Blends relevant content, such as images, news, maps, videos, and your personal content, into a single unified search results page.
  • 12. ResultsResults The outcomes comes within 1/8th seconds towards us
  • 13. Robot Indexing DiagramRobot Indexing Diagram
  • 14. 3.Fighting spam3.Fighting spam It fights spam always 24/7. To keep your results relevant. The majority of spam removal is automatic. Search Engine examine other questionable documents by hand. If it find spam, we take manual action.
  • 15. Types of SpamTypes of Spam 1.1. Pure Spam -Pure Spam -Site appears to use aggressive spam techniques such as automatically generated gibberish, cloaking, scraping content from other websites, and/or repeated or egregious violations of Google's Webmaster Guidelines.
  • 16. 2.2. Hidden text and/or keyword stuffing-Hidden text and/or keyword stuffing- Some of the pages maySome of the pages may contain hidden text and/or keyword stuffing.contain hidden text and/or keyword stuffing. 3. User-generated spam -Site appears to contain spam my user- generated content. The problematic content may appear on forum pages, guestbook pages, or user profiles.
  • 17. 4.4. ParkedParked domainsdomains-Parked-Parked domains are placeholder sites with littledomains are placeholder sites with little unique content, so Google doesn't typically include them in search results.unique content, so Google doesn't typically include them in search results. 5.5. Thin content with little or no added valueThin content with little or no added value.. 6.6. UnnaturalUnnatural links tolinks to aa sitesite-manipulative links pointing to the site.-manipulative links pointing to the site. 7.7. Spammy free hosts and dynamic DNS providers.Spammy free hosts and dynamic DNS providers. 8.8. Cloaking and/or sneaky redirects-DCloaking and/or sneaky redirects-Displayingisplaying different content todifferent content to human users than is shown to search engineshuman users than is shown to search engines 9.9. Hacked site.Hacked site. 1010..Unnatural links from a site-ThisUnnatural links from a site-This may be the result of selling linksmay be the result of selling links that pass Page Rank or participating in link schemes.that pass Page Rank or participating in link schemes.
  • 18. InitiativesInitiatives  When necessary actions are taken , notification are sent toWhen necessary actions are taken , notification are sent to the website owners.the website owners.  In replyIn reply  Site owners can fix their sites and let this know to theSite owners can fix their sites and let this know to the desired search engine.desired search engine.
  • 19. This is how search works ………….. Questions & Answers???? Reference- “http://www.google.com/insidesearch/howsearchworks/thestory/”