SlideShare ist ein Scribd-Unternehmen logo
1 von 23
How Google search engine algorithm works 
Prepared by:- Viral Shah (120570107014) 
Guided by :- Prof. Sahista Machhar, MEFGI
It is a program that 
searches for and 
identifies items in a 
database that 
correspond to 
keywords or 
characters specified 
by the user, used 
especially for finding 
particular sites on the 
World Wide Web.
 There are 759 Million websites on the Web & 
60 Trillion webpages of this websites. 
 AND IT’S CONSTANTLY GROWING !!!!!
 GOOGLE navigates WEB by 
crawling. 
 To find information on the 
hundreds of millions of Web 
pages that exist, a search 
engine employs special 
software robots, called 
SPIDERS, to build lists of the 
words found on Web sites. 
When a spider is building its 
lists, the process is called 
Web crawling.
 The usual starting points are lists of heavily 
used servers and very popular pages. The 
spider will begin with a popular site, indexing 
the words on its pages and following every 
link found within the site. In this way, the 
spidering system quickly begins to travel, 
spreading out across the most widely used 
portions of the Web.
 When the Google spider looked at an HTML page, it took note of 
following things:- 
Words occurring in the title, subtitles, meta tags and other 
positions of relative importance were noted for special consideration 
during a subsequent user search. The Google spider was built to index 
every significant word on a page, leaving out the articles “a”, “an” and 
"the”. Other spiders take different approaches. 
 For example, some spiders will keep track of the words in the title, 
sub-headings and links, along with the 100 most frequently used 
words on the page and each word in the first 20 lines of text. Lycos is 
said to use this approach to spidering the Web. 
 GOOGLE built their initial system to use multiple spiders, usually three 
at one time. Each spider could keep about 300 connections to Web 
pages open at a time.
 Google’s spider name is Googlebot. 
 Googlebot is the search bot software used 
by Google, which collects documents from 
the web to build a searchable index for 
the Google Search engine.
 By following the web-pages, INDEX is 
prepared. The index includes text from 
millions of books from several libraries and 
other partners. 
 That means GOOGLE follow links from page 
to page. Also they sort pages by their content 
and other factors.
 These all activities Google carry out is tracked 
in the INDEX. Google continuously updates 
index and it is stored over large servers. 
 Currently, Google’s Index size is over 100 
million Gigabyte.
 Site owners choose whether their sites are 
crawled. 
 To prevent most search engine web 
crawlers from indexing a page on your site, place 
the following meta tag into the<head> section of 
your page: 
<meta name="robots" content="noindex"> 
 To prevent only Google web crawlers from 
indexing a page: 
<meta name="googlebot" content="noindex">
1) AUTOCOMPLETE 
Predicts what you might be searching for. 
This includes understanding terms with more 
than one meaning. 
2) SYNONYMS 
Recognizes words with similar meanings.
3) QUERY UNDERSTANDING 
Gets to the deeper meaning of the words 
you type. 
4) GOOGLE INSTANT 
Displays immediate results as you type. 
5) SPELLING 
Identifies and corrects possible spelling 
errors and provides alternatives.
 Based on all the above factors, Google picks 
some web-pages from the index. 
 Then, Google ranks the result on various 
factors. 
 1) Site & Page Quality:- 
It is checked by how you are writing 
key-words.
2) Freshness:- 
How much fresh the content is & at how 
much regular interval it is updated !! 
3) Safe-Search:- 
Google tries to find out how much it is safe 
and doesn’t contains spams. 
Along with these, there are 200+ factors used 
by Google to rank any particular webs-page.
 After all these operations, you will get the 
desired result and these all happens in one 
nano-second !!!
 Google fights with spam every second to give 
true & relevant result. 
 The majority of spam removal is 
automatic. Google examine other 
questionable documents by hand. If Google 
find spam, they take manual action.
1) PURE SPAM 
Site appears to use aggressive spam 
techniques such as automatically generated 
gibberish, cloaking, scraping content from 
other websites, and/or repeated or egregious 
violations of Google's Webmaster Guidelines. 
2) HIDDEN TEXT AND/OR KEYWORD STUFFING 
Some of the pages may contain hidden 
text and/or keyword stuffing.
3) USER-GENERATED SPAM 
Site appears to contain spammy user-generated 
content. The problematic content 
may appear on forum pages, guestbook pages, 
or user profiles. 
4) PARKED DOMAINS 
Parked domains are placeholder sites with little 
unique content, so Google doesn't typically 
include them in search results.
5) THIN CONTENT WITH LITTLE OR 
NO ADDED VALUE 
Site appears to consist of low-quality or shallow pages 
which do not provide users with much added value 
(such as thin affiliate pages, doorway pages, cookie-cutter 
sites, automatically generated content, or copied 
content). 
6) UNNATURAL LINKS TO A SITE 
Google has detected a pattern of unnatural artificial, 
deceptive or manipulative links pointing to the site. 
These may be the result of buying links that pass 
PageRank or participating in link schemes.
 Besides these all there are thousands other 
factors Google uses to detect Spam and 
decides the page-rank of web-page 
accordingly which is constantly updated and 
finally Google only keeps trusted documents 
in index.
 And the point of Interest is that to make 
presentation on google, I used
 Behind your simple page of results is a 
complex system, carefully crafted and 
tested, to support more than one-hundred 
billion searches each month !!!! 
How Google Search Algorithm Works ??

Weitere ähnliche Inhalte

Was ist angesagt?

ARTDM 171, Week 15: Search Engine Optimization (SEO)
ARTDM 171, Week 15: Search Engine Optimization (SEO)ARTDM 171, Week 15: Search Engine Optimization (SEO)
ARTDM 171, Week 15: Search Engine Optimization (SEO)Gilbert Guerrero
 
5 seo-fundamentals-on page optimization (part 2)-slides
5 seo-fundamentals-on page optimization (part 2)-slides5 seo-fundamentals-on page optimization (part 2)-slides
5 seo-fundamentals-on page optimization (part 2)-slidesMasterCode.vn
 
Learn the Search Engine Type and Its Functions!
Learn the Search Engine Type and Its Functions!Learn the Search Engine Type and Its Functions!
Learn the Search Engine Type and Its Functions!aashokkr
 
How to Get 5 Million Visitors to Your Website in 12 months
How to Get 5 Million Visitors to Your Website in 12 monthsHow to Get 5 Million Visitors to Your Website in 12 months
How to Get 5 Million Visitors to Your Website in 12 monthsSankar Datti
 
Google search techniques
Google search techniquesGoogle search techniques
Google search techniquesNirav Ranpara
 
SEO 101 webinar 10 25-2012
SEO 101 webinar 10 25-2012SEO 101 webinar 10 25-2012
SEO 101 webinar 10 25-2012451 Marketing
 
Scrape box presentation
Scrape box presentationScrape box presentation
Scrape box presentationElephate1
 
Lucky Fabb SEO Presentation by Dave Cook
Lucky Fabb SEO Presentation by Dave CookLucky Fabb SEO Presentation by Dave Cook
Lucky Fabb SEO Presentation by Dave CookDave Cook
 
Entireweb review over 150 million searches per month with website submission ...
Entireweb review over 150 million searches per month with website submission ...Entireweb review over 150 million searches per month with website submission ...
Entireweb review over 150 million searches per month with website submission ...joelmaster
 
Google search architecture services in Hyderabad
Google search architecture services in HyderabadGoogle search architecture services in Hyderabad
Google search architecture services in HyderabadMartin James
 
Redefining Technical SEO, #MozCon 2019 by Paul Shapiro
Redefining Technical SEO, #MozCon 2019 by Paul ShapiroRedefining Technical SEO, #MozCon 2019 by Paul Shapiro
Redefining Technical SEO, #MozCon 2019 by Paul ShapiroPaul Shapiro
 
Demystifying google hacks
Demystifying google hacksDemystifying google hacks
Demystifying google hacksdarwinah retno
 
40 tools for sourcing productivity #sosuasia
40 tools for sourcing productivity #sosuasia 40 tools for sourcing productivity #sosuasia
40 tools for sourcing productivity #sosuasia Balazs Paroczay
 
Search Engine Optimization - Aykut Aslantaş
Search Engine Optimization - Aykut AslantaşSearch Engine Optimization - Aykut Aslantaş
Search Engine Optimization - Aykut AslantaşAykut Aslantaş
 

Was ist angesagt? (20)

ARTDM 171, Week 15: Search Engine Optimization (SEO)
ARTDM 171, Week 15: Search Engine Optimization (SEO)ARTDM 171, Week 15: Search Engine Optimization (SEO)
ARTDM 171, Week 15: Search Engine Optimization (SEO)
 
Basic SEO
Basic SEO Basic SEO
Basic SEO
 
Search engine
Search engineSearch engine
Search engine
 
5 seo-fundamentals-on page optimization (part 2)-slides
5 seo-fundamentals-on page optimization (part 2)-slides5 seo-fundamentals-on page optimization (part 2)-slides
5 seo-fundamentals-on page optimization (part 2)-slides
 
Search engine
Search engineSearch engine
Search engine
 
Search engine
Search engineSearch engine
Search engine
 
Learn the Search Engine Type and Its Functions!
Learn the Search Engine Type and Its Functions!Learn the Search Engine Type and Its Functions!
Learn the Search Engine Type and Its Functions!
 
Search engine assistance
Search engine assistanceSearch engine assistance
Search engine assistance
 
Search engine optimization
Search engine optimizationSearch engine optimization
Search engine optimization
 
How to Get 5 Million Visitors to Your Website in 12 months
How to Get 5 Million Visitors to Your Website in 12 monthsHow to Get 5 Million Visitors to Your Website in 12 months
How to Get 5 Million Visitors to Your Website in 12 months
 
Google search techniques
Google search techniquesGoogle search techniques
Google search techniques
 
SEO 101 webinar 10 25-2012
SEO 101 webinar 10 25-2012SEO 101 webinar 10 25-2012
SEO 101 webinar 10 25-2012
 
Scrape box presentation
Scrape box presentationScrape box presentation
Scrape box presentation
 
Lucky Fabb SEO Presentation by Dave Cook
Lucky Fabb SEO Presentation by Dave CookLucky Fabb SEO Presentation by Dave Cook
Lucky Fabb SEO Presentation by Dave Cook
 
Entireweb review over 150 million searches per month with website submission ...
Entireweb review over 150 million searches per month with website submission ...Entireweb review over 150 million searches per month with website submission ...
Entireweb review over 150 million searches per month with website submission ...
 
Google search architecture services in Hyderabad
Google search architecture services in HyderabadGoogle search architecture services in Hyderabad
Google search architecture services in Hyderabad
 
Redefining Technical SEO, #MozCon 2019 by Paul Shapiro
Redefining Technical SEO, #MozCon 2019 by Paul ShapiroRedefining Technical SEO, #MozCon 2019 by Paul Shapiro
Redefining Technical SEO, #MozCon 2019 by Paul Shapiro
 
Demystifying google hacks
Demystifying google hacksDemystifying google hacks
Demystifying google hacks
 
40 tools for sourcing productivity #sosuasia
40 tools for sourcing productivity #sosuasia 40 tools for sourcing productivity #sosuasia
40 tools for sourcing productivity #sosuasia
 
Search Engine Optimization - Aykut Aslantaş
Search Engine Optimization - Aykut AslantaşSearch Engine Optimization - Aykut Aslantaş
Search Engine Optimization - Aykut Aslantaş
 

Andere mochten auch

Andere mochten auch (9)

Working of search engine
Working of search engineWorking of search engine
Working of search engine
 
Smart crawler a two stage crawler
Smart crawler a two stage crawlerSmart crawler a two stage crawler
Smart crawler a two stage crawler
 
Working of a Web Crawler
Working of a Web CrawlerWorking of a Web Crawler
Working of a Web Crawler
 
Smart Crawler
Smart CrawlerSmart Crawler
Smart Crawler
 
Web crawler
Web crawlerWeb crawler
Web crawler
 
Working Of Search Engine
Working Of Search EngineWorking Of Search Engine
Working Of Search Engine
 
The Layman's Guide to Microsoft Azure
The Layman's Guide to Microsoft AzureThe Layman's Guide to Microsoft Azure
The Layman's Guide to Microsoft Azure
 
Search Engine Powerpoint
Search Engine PowerpointSearch Engine Powerpoint
Search Engine Powerpoint
 
How Google Works
How Google WorksHow Google Works
How Google Works
 

Ähnlich wie How Google Search Algorithm Works ??

Search Engine Optimization (Seo)
Search Engine Optimization (Seo)Search Engine Optimization (Seo)
Search Engine Optimization (Seo)ssunnysengar
 
Google Search Engine
Google Search Engine Google Search Engine
Google Search Engine Aniket_1415
 
Search Engine Optimization
Search Engine OptimizationSearch Engine Optimization
Search Engine Optimizationshrishail uttagi
 
Search Engine Optimization - Fundamentals - SEO
Search Engine Optimization - Fundamentals - SEOSearch Engine Optimization - Fundamentals - SEO
Search Engine Optimization - Fundamentals - SEONeeraj Reddy
 
Demand Quest SEO Training - Session 2
Demand Quest SEO Training - Session 2Demand Quest SEO Training - Session 2
Demand Quest SEO Training - Session 2Nate Plaunt
 
The Anatomy of GOOGLE Search Engine
The Anatomy of GOOGLE Search EngineThe Anatomy of GOOGLE Search Engine
The Anatomy of GOOGLE Search EngineManish Chopra
 
Search engine and web crawler
Search engine and web crawlerSearch engine and web crawler
Search engine and web crawlerishmecse13
 
Effective Searching Policies for Web Crawler
Effective Searching Policies for Web CrawlerEffective Searching Policies for Web Crawler
Effective Searching Policies for Web CrawlerIJMER
 
Lost in the Net: Navigating Search Engines
Lost in the Net:  Navigating Search EnginesLost in the Net:  Navigating Search Engines
Lost in the Net: Navigating Search EnginesJohan Koren
 
How google works and functions: A complete Approach
How google works and functions: A complete ApproachHow google works and functions: A complete Approach
How google works and functions: A complete ApproachPrakhar Gethe
 
Demand Quest SEO training session 2
Demand Quest SEO training session 2Demand Quest SEO training session 2
Demand Quest SEO training session 2Nate Plaunt
 
Latest Updates on SEO
Latest Updates on SEOLatest Updates on SEO
Latest Updates on SEOshailaja100
 
Search Engine Optimisation - MA Journalism - Week Three
Search Engine Optimisation - MA Journalism - Week ThreeSearch Engine Optimisation - MA Journalism - Week Three
Search Engine Optimisation - MA Journalism - Week Threepaulwould
 
Basic SEO Techniques All Webmasters Must Know
Basic SEO Techniques All Webmasters Must KnowBasic SEO Techniques All Webmasters Must Know
Basic SEO Techniques All Webmasters Must Knowwaqas ahmad
 
Crawling, Indicizzazione e SEO - Paolo Ramazzotti
Crawling, Indicizzazione e SEO - Paolo RamazzottiCrawling, Indicizzazione e SEO - Paolo Ramazzotti
Crawling, Indicizzazione e SEO - Paolo RamazzottiGimasi Sa
 
Il processo di Crawilng e Indexing di Google - Paolo Ramazzotti
Il processo di Crawilng e Indexing di Google - Paolo RamazzottiIl processo di Crawilng e Indexing di Google - Paolo Ramazzotti
Il processo di Crawilng e Indexing di Google - Paolo RamazzottiPaolo Ramazzotti
 
The ultimate guide to the invisible web
The ultimate guide to the invisible webThe ultimate guide to the invisible web
The ultimate guide to the invisible webYKNIB O
 

Ähnlich wie How Google Search Algorithm Works ?? (20)

Search Engine Optimization (Seo)
Search Engine Optimization (Seo)Search Engine Optimization (Seo)
Search Engine Optimization (Seo)
 
Google Search Engine
Google Search Engine Google Search Engine
Google Search Engine
 
Search Engine Optimization
Search Engine OptimizationSearch Engine Optimization
Search Engine Optimization
 
Seo Manual
Seo ManualSeo Manual
Seo Manual
 
Search Engine Optimization - Fundamentals - SEO
Search Engine Optimization - Fundamentals - SEOSearch Engine Optimization - Fundamentals - SEO
Search Engine Optimization - Fundamentals - SEO
 
Demand Quest SEO Training - Session 2
Demand Quest SEO Training - Session 2Demand Quest SEO Training - Session 2
Demand Quest SEO Training - Session 2
 
The Anatomy of GOOGLE Search Engine
The Anatomy of GOOGLE Search EngineThe Anatomy of GOOGLE Search Engine
The Anatomy of GOOGLE Search Engine
 
Search engine and web crawler
Search engine and web crawlerSearch engine and web crawler
Search engine and web crawler
 
Effective Searching Policies for Web Crawler
Effective Searching Policies for Web CrawlerEffective Searching Policies for Web Crawler
Effective Searching Policies for Web Crawler
 
Lost in the Net: Navigating Search Engines
Lost in the Net:  Navigating Search EnginesLost in the Net:  Navigating Search Engines
Lost in the Net: Navigating Search Engines
 
Search engine
Search engineSearch engine
Search engine
 
How google works and functions: A complete Approach
How google works and functions: A complete ApproachHow google works and functions: A complete Approach
How google works and functions: A complete Approach
 
Demand Quest SEO training session 2
Demand Quest SEO training session 2Demand Quest SEO training session 2
Demand Quest SEO training session 2
 
Latest Updates on SEO
Latest Updates on SEOLatest Updates on SEO
Latest Updates on SEO
 
Search Engine Optimisation - MA Journalism - Week Three
Search Engine Optimisation - MA Journalism - Week ThreeSearch Engine Optimisation - MA Journalism - Week Three
Search Engine Optimisation - MA Journalism - Week Three
 
Basic SEO Techniques All Webmasters Must Know
Basic SEO Techniques All Webmasters Must KnowBasic SEO Techniques All Webmasters Must Know
Basic SEO Techniques All Webmasters Must Know
 
Search engine
Search engineSearch engine
Search engine
 
Crawling, Indicizzazione e SEO - Paolo Ramazzotti
Crawling, Indicizzazione e SEO - Paolo RamazzottiCrawling, Indicizzazione e SEO - Paolo Ramazzotti
Crawling, Indicizzazione e SEO - Paolo Ramazzotti
 
Il processo di Crawilng e Indexing di Google - Paolo Ramazzotti
Il processo di Crawilng e Indexing di Google - Paolo RamazzottiIl processo di Crawilng e Indexing di Google - Paolo Ramazzotti
Il processo di Crawilng e Indexing di Google - Paolo Ramazzotti
 
The ultimate guide to the invisible web
The ultimate guide to the invisible webThe ultimate guide to the invisible web
The ultimate guide to the invisible web
 

Kürzlich hochgeladen

Sports & Fitness Value Added Course FY..
Sports & Fitness Value Added Course FY..Sports & Fitness Value Added Course FY..
Sports & Fitness Value Added Course FY..Disha Kariya
 
Accessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impactAccessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impactdawncurless
 
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptxPOINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptxSayali Powar
 
1029-Danh muc Sach Giao Khoa khoi 6.pdf
1029-Danh muc Sach Giao Khoa khoi  6.pdf1029-Danh muc Sach Giao Khoa khoi  6.pdf
1029-Danh muc Sach Giao Khoa khoi 6.pdfQucHHunhnh
 
The basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptxThe basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptxheathfieldcps1
 
Sanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdfSanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdfsanyamsingh5019
 
microwave assisted reaction. General introduction
microwave assisted reaction. General introductionmicrowave assisted reaction. General introduction
microwave assisted reaction. General introductionMaksud Ahmed
 
Arihant handbook biology for class 11 .pdf
Arihant handbook biology for class 11 .pdfArihant handbook biology for class 11 .pdf
Arihant handbook biology for class 11 .pdfchloefrazer622
 
Introduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The BasicsIntroduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The BasicsTechSoup
 
Grant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy ConsultingGrant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy ConsultingTechSoup
 
A Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy ReformA Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy ReformChameera Dedduwage
 
Interactive Powerpoint_How to Master effective communication
Interactive Powerpoint_How to Master effective communicationInteractive Powerpoint_How to Master effective communication
Interactive Powerpoint_How to Master effective communicationnomboosow
 
Web & Social Media Analytics Previous Year Question Paper.pdf
Web & Social Media Analytics Previous Year Question Paper.pdfWeb & Social Media Analytics Previous Year Question Paper.pdf
Web & Social Media Analytics Previous Year Question Paper.pdfJayanti Pande
 
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...Krashi Coaching
 
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptxSOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptxiammrhaywood
 
Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)eniolaolutunde
 
JAPAN: ORGANISATION OF PMDA, PHARMACEUTICAL LAWS & REGULATIONS, TYPES OF REGI...
JAPAN: ORGANISATION OF PMDA, PHARMACEUTICAL LAWS & REGULATIONS, TYPES OF REGI...JAPAN: ORGANISATION OF PMDA, PHARMACEUTICAL LAWS & REGULATIONS, TYPES OF REGI...
JAPAN: ORGANISATION OF PMDA, PHARMACEUTICAL LAWS & REGULATIONS, TYPES OF REGI...anjaliyadav012327
 
Measures of Dispersion and Variability: Range, QD, AD and SD
Measures of Dispersion and Variability: Range, QD, AD and SDMeasures of Dispersion and Variability: Range, QD, AD and SD
Measures of Dispersion and Variability: Range, QD, AD and SDThiyagu K
 

Kürzlich hochgeladen (20)

Sports & Fitness Value Added Course FY..
Sports & Fitness Value Added Course FY..Sports & Fitness Value Added Course FY..
Sports & Fitness Value Added Course FY..
 
Accessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impactAccessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impact
 
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptxPOINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
 
1029-Danh muc Sach Giao Khoa khoi 6.pdf
1029-Danh muc Sach Giao Khoa khoi  6.pdf1029-Danh muc Sach Giao Khoa khoi  6.pdf
1029-Danh muc Sach Giao Khoa khoi 6.pdf
 
The basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptxThe basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptx
 
Sanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdfSanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdf
 
INDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptx
INDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptxINDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptx
INDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptx
 
microwave assisted reaction. General introduction
microwave assisted reaction. General introductionmicrowave assisted reaction. General introduction
microwave assisted reaction. General introduction
 
Arihant handbook biology for class 11 .pdf
Arihant handbook biology for class 11 .pdfArihant handbook biology for class 11 .pdf
Arihant handbook biology for class 11 .pdf
 
Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"
Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"
Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"
 
Introduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The BasicsIntroduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The Basics
 
Grant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy ConsultingGrant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy Consulting
 
A Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy ReformA Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy Reform
 
Interactive Powerpoint_How to Master effective communication
Interactive Powerpoint_How to Master effective communicationInteractive Powerpoint_How to Master effective communication
Interactive Powerpoint_How to Master effective communication
 
Web & Social Media Analytics Previous Year Question Paper.pdf
Web & Social Media Analytics Previous Year Question Paper.pdfWeb & Social Media Analytics Previous Year Question Paper.pdf
Web & Social Media Analytics Previous Year Question Paper.pdf
 
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
 
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptxSOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
 
Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)
 
JAPAN: ORGANISATION OF PMDA, PHARMACEUTICAL LAWS & REGULATIONS, TYPES OF REGI...
JAPAN: ORGANISATION OF PMDA, PHARMACEUTICAL LAWS & REGULATIONS, TYPES OF REGI...JAPAN: ORGANISATION OF PMDA, PHARMACEUTICAL LAWS & REGULATIONS, TYPES OF REGI...
JAPAN: ORGANISATION OF PMDA, PHARMACEUTICAL LAWS & REGULATIONS, TYPES OF REGI...
 
Measures of Dispersion and Variability: Range, QD, AD and SD
Measures of Dispersion and Variability: Range, QD, AD and SDMeasures of Dispersion and Variability: Range, QD, AD and SD
Measures of Dispersion and Variability: Range, QD, AD and SD
 

How Google Search Algorithm Works ??

  • 1. How Google search engine algorithm works Prepared by:- Viral Shah (120570107014) Guided by :- Prof. Sahista Machhar, MEFGI
  • 2. It is a program that searches for and identifies items in a database that correspond to keywords or characters specified by the user, used especially for finding particular sites on the World Wide Web.
  • 3.  There are 759 Million websites on the Web & 60 Trillion webpages of this websites.  AND IT’S CONSTANTLY GROWING !!!!!
  • 4.  GOOGLE navigates WEB by crawling.  To find information on the hundreds of millions of Web pages that exist, a search engine employs special software robots, called SPIDERS, to build lists of the words found on Web sites. When a spider is building its lists, the process is called Web crawling.
  • 5.  The usual starting points are lists of heavily used servers and very popular pages. The spider will begin with a popular site, indexing the words on its pages and following every link found within the site. In this way, the spidering system quickly begins to travel, spreading out across the most widely used portions of the Web.
  • 6.  When the Google spider looked at an HTML page, it took note of following things:- Words occurring in the title, subtitles, meta tags and other positions of relative importance were noted for special consideration during a subsequent user search. The Google spider was built to index every significant word on a page, leaving out the articles “a”, “an” and "the”. Other spiders take different approaches.  For example, some spiders will keep track of the words in the title, sub-headings and links, along with the 100 most frequently used words on the page and each word in the first 20 lines of text. Lycos is said to use this approach to spidering the Web.  GOOGLE built their initial system to use multiple spiders, usually three at one time. Each spider could keep about 300 connections to Web pages open at a time.
  • 7.  Google’s spider name is Googlebot.  Googlebot is the search bot software used by Google, which collects documents from the web to build a searchable index for the Google Search engine.
  • 8.  By following the web-pages, INDEX is prepared. The index includes text from millions of books from several libraries and other partners.  That means GOOGLE follow links from page to page. Also they sort pages by their content and other factors.
  • 9.  These all activities Google carry out is tracked in the INDEX. Google continuously updates index and it is stored over large servers.  Currently, Google’s Index size is over 100 million Gigabyte.
  • 10.  Site owners choose whether their sites are crawled.  To prevent most search engine web crawlers from indexing a page on your site, place the following meta tag into the<head> section of your page: <meta name="robots" content="noindex">  To prevent only Google web crawlers from indexing a page: <meta name="googlebot" content="noindex">
  • 11. 1) AUTOCOMPLETE Predicts what you might be searching for. This includes understanding terms with more than one meaning. 2) SYNONYMS Recognizes words with similar meanings.
  • 12. 3) QUERY UNDERSTANDING Gets to the deeper meaning of the words you type. 4) GOOGLE INSTANT Displays immediate results as you type. 5) SPELLING Identifies and corrects possible spelling errors and provides alternatives.
  • 13.  Based on all the above factors, Google picks some web-pages from the index.  Then, Google ranks the result on various factors.  1) Site & Page Quality:- It is checked by how you are writing key-words.
  • 14. 2) Freshness:- How much fresh the content is & at how much regular interval it is updated !! 3) Safe-Search:- Google tries to find out how much it is safe and doesn’t contains spams. Along with these, there are 200+ factors used by Google to rank any particular webs-page.
  • 15.  After all these operations, you will get the desired result and these all happens in one nano-second !!!
  • 16.  Google fights with spam every second to give true & relevant result.  The majority of spam removal is automatic. Google examine other questionable documents by hand. If Google find spam, they take manual action.
  • 17. 1) PURE SPAM Site appears to use aggressive spam techniques such as automatically generated gibberish, cloaking, scraping content from other websites, and/or repeated or egregious violations of Google's Webmaster Guidelines. 2) HIDDEN TEXT AND/OR KEYWORD STUFFING Some of the pages may contain hidden text and/or keyword stuffing.
  • 18. 3) USER-GENERATED SPAM Site appears to contain spammy user-generated content. The problematic content may appear on forum pages, guestbook pages, or user profiles. 4) PARKED DOMAINS Parked domains are placeholder sites with little unique content, so Google doesn't typically include them in search results.
  • 19. 5) THIN CONTENT WITH LITTLE OR NO ADDED VALUE Site appears to consist of low-quality or shallow pages which do not provide users with much added value (such as thin affiliate pages, doorway pages, cookie-cutter sites, automatically generated content, or copied content). 6) UNNATURAL LINKS TO A SITE Google has detected a pattern of unnatural artificial, deceptive or manipulative links pointing to the site. These may be the result of buying links that pass PageRank or participating in link schemes.
  • 20.  Besides these all there are thousands other factors Google uses to detect Spam and decides the page-rank of web-page accordingly which is constantly updated and finally Google only keeps trusted documents in index.
  • 21.  And the point of Interest is that to make presentation on google, I used
  • 22.  Behind your simple page of results is a complex system, carefully crafted and tested, to support more than one-hundred billion searches each month !!!! 