SlideShare ist ein Scribd-Unternehmen logo
1 von 23
Search, APIs,
Capability Management
          and
  the Sensis Journey


      Craig Rees
•    Project background

•    Platform selection

•    Search capability

•    Relevance

•    Architecture

•    Quality management

•    Hurdles

•    What’s next


    Today’s menu
• Sensis helps Australians
          find, buy and sell

         • From print directories to a
          cross-platform lead generator

         • Sensis publishes over 1.8
          Million business listings

         • Two of the top 10 visited online
          sites in Australia
          (WhitePages.com.au and
          YellowPages.com.au)


Sensis
Business objectives
•   Drive presence in the local
    search market place
•   Open up the largest database of
    business listings in Australia
•   Reduce the effort required from
    local search developers           Technology objectives
•   Free to use, we are after the     •   Develop a total search platform
    reporting                         •   Relevancy testing as part of the
                                          development lifecycle
                                      •   A framework to identify problem
                                          spaces
                                      •   Manageable platform
                                      •   Continuous deployments


Project background
Developer portal
•   Support for the search
    capability team

•   Structured vs non
    structured data

•   Deterministic vs black
    box

•   Non propriety code base

•   Community backing




    Platform selection
• A/B testing
                                                      • Machine learning
Optimized                                     Lvl 5   • External collaboration
                                                      • Multiple contexts


                                                                   • Online dashboards
                                                                   • Test environments
Managed                                       Lvl 4                • Dynamic search refinements
                                                                   • Targets and metrics


                                                                            • Defined team
                                                                            • Regular monitoring
Monitored                                     Lvl 3                         • Static autosuggest
                                                                            • Basic linguistics


                                                                                 • Adhoc processes
                                                                                 • Part time team
Adhoc                                         Lvl 2                              • Static dictionaries
                                                                                 • Individual led innovation

                                                                                     • No resources
                                                                                     • No reporting
Unmanaged                                     Lvl 1                                  • Out of the box
                                                                                       features




The Sensis Search capability maturity model
*Courtesy of Pete Crawford & Craig Lonsdale
Location



                 Intent      Chronology
                 • Name
                 • Type
                             Social Graph
                 • Product
                 • Spatial

                               Device




                              Individual



Context is key
Business                         Geo Service
    Data



                                       Solr                     Mashery
  Business                             Name Query
    Data                                                         Search
               MongoDB                   Handler                 Service
                           Index                      API                   Publisher
                                                                Reporting
                                       Type Query
                                                                 Service
                                         Handler

  Historical
   search
    Data

                                                    Reporting
                                                     Events

                         Ontologies




Our architecture
Business                         Geo Service
    Data



                                       Solr                     Mashery
  Business                             Name Query
    Data                                                         Search
               MongoDB                   Handler                 Service
                           Index                      API                   Publisher
                                                                Reporting
                                       Type Query
                                                                 Service
                                         Handler

  Historical
   search
    Data

                                                    Reporting
                                                     Events

                         Ontologies




Data staging
Business                          Geo Service
   Data



                                       Solr                     Mashery
 Business                              Name Query
   Data                                                          Search
               MongoDB                   Handler                 Service
                           Index                      API                   Publisher
                                                                Reporting
                                       Type Query
                                                                 Service
                                         Handler

  Historical
   search
    Data

                                                    Reporting
                                                     Events

                         Ontologies




Search
Business                          Geo Service
   Data



                                       Solr                     Mashery
 Business                              Name Query
   Data                                                          Search
               MongoDB                   Handler                 Service
                           Index                      API                   Publisher
                                                                Reporting
                                       Type Query
                                                                 Service
                                         Handler

  Historical
   search
    Data

                                                    Reporting
                                                     Events

                         Ontologies




API
Business                          Geo Service
   Data



                                       Solr                     Mashery
 Business                              Name Query
   Data                                                          Search
               MongoDB                   Handler                 Service
                           Index                      API                   Publisher
                                                                Reporting
                                       Type Query
                                                                 Service
                                         Handler

  Historical
   search
    Data

                                                    Reporting
                                                     Events

                         Ontologies




API proxy
• Moved from a black box             Yesterday   Today   Tomorrow
  solution to a manageable
  platform
• Deliver search improvements
  without major code changes
• Understand how results were
  calculated
• Identity problems scientifically
• Continuously tune and test
  relevance




  Evolution of search management
Specific gold sets for each
       Path Analysis         problem space:
       used to identify          Intent
                                 Spelling & stemming
       problems                  Location
       spaces                    Phrase parsing




                             Features signed off
       “Gold Sets”           only when they make
       used to define        a positive impact to
       overall quality       quality score
       score (TREC)



Problem spaces, quality management & tuning
Search quality analysis and testing
Results examiner
Score analysis
Tuning
Lather, rinse, repeat
• Data redundancy and
                     homogeneity
                   • Solr ranking of rare terms
                   • Intent differentiation
                   • Contextual synonyms




Hurdles along the way
•   Query engine
              •   Facets / autosuggest
              •   Real time tuning
              •   Machine learning
              •   Multi term queries
              •   Scoring thresholds
              •   Content Value




Where next?
Email: craig.rees@sensis.com.au
             www: developers.sensis.com.au
             Twitter: @SensisAPI
                      @ablebagel




Questions?

Weitere ähnliche Inhalte

Ähnlich wie Search, APIs, capability management and Sensis's journey

SPCAdriatics - 10 Things I Like In SharePoint 2013 Search
SPCAdriatics - 10 Things I Like In SharePoint 2013 SearchSPCAdriatics - 10 Things I Like In SharePoint 2013 Search
SPCAdriatics - 10 Things I Like In SharePoint 2013 SearchAgnes Molnar
 
Large Scale Search, Discovery and Analytics with Hadoop, Mahout and Solr
Large Scale Search, Discovery and Analytics with Hadoop, Mahout and SolrLarge Scale Search, Discovery and Analytics with Hadoop, Mahout and Solr
Large Scale Search, Discovery and Analytics with Hadoop, Mahout and SolrGrant Ingersoll
 
Large Scale Search, Discovery and Analytics with Hadoop, Mahout and Solr
Large Scale Search, Discovery and Analytics with Hadoop, Mahout and SolrLarge Scale Search, Discovery and Analytics with Hadoop, Mahout and Solr
Large Scale Search, Discovery and Analytics with Hadoop, Mahout and SolrGrant Ingersoll
 
Kuali update v4 - mw
Kuali update   v4 - mwKuali update   v4 - mw
Kuali update v4 - mwsarnoa
 
Large-Scale Search Discovery Analytics with Hadoop, Mahout, Solr
Large-Scale Search Discovery Analytics with Hadoop, Mahout, SolrLarge-Scale Search Discovery Analytics with Hadoop, Mahout, Solr
Large-Scale Search Discovery Analytics with Hadoop, Mahout, SolrDataWorks Summit
 
Leveraging Solr and Mahout
Leveraging Solr and MahoutLeveraging Solr and Mahout
Leveraging Solr and MahoutGrant Ingersoll
 
Oracle Application Management Suite
Oracle Application Management SuiteOracle Application Management Suite
Oracle Application Management SuiteOracleVolutionSeries
 
2010 10-building-global-listening-platform-with-solr
2010 10-building-global-listening-platform-with-solr2010 10-building-global-listening-platform-with-solr
2010 10-building-global-listening-platform-with-solrLucidworks (Archived)
 
MetaVis Webinar - 10 Things I Like in SharePoint 2013 Search
MetaVis Webinar - 10 Things I Like in SharePoint 2013 SearchMetaVis Webinar - 10 Things I Like in SharePoint 2013 Search
MetaVis Webinar - 10 Things I Like in SharePoint 2013 SearchAgnes Molnar
 
E-commerce website development process
E-commerce website development processE-commerce website development process
E-commerce website development processStudio 10 Design
 
A Behind the Scenes Look at the Force.com Platform
A Behind the Scenes Look at the Force.com PlatformA Behind the Scenes Look at the Force.com Platform
A Behind the Scenes Look at the Force.com PlatformSalesforce Developers
 
Business intelligence-solutions 2012-english
Business intelligence-solutions 2012-englishBusiness intelligence-solutions 2012-english
Business intelligence-solutions 2012-englishStratebi
 
Exploratory Search upon Semantically Described Web Data Sources: Service regi...
Exploratory Search upon Semantically Described Web Data Sources: Service regi...Exploratory Search upon Semantically Described Web Data Sources: Service regi...
Exploratory Search upon Semantically Described Web Data Sources: Service regi...Marco Brambilla
 
Information architecture strategic process
Information architecture strategic processInformation architecture strategic process
Information architecture strategic processKerry Dirks MCPS MS
 
SPLive Orlando - 10 Things I Like in SharePoint 2013 Search
SPLive Orlando - 10 Things I Like in SharePoint 2013 SearchSPLive Orlando - 10 Things I Like in SharePoint 2013 Search
SPLive Orlando - 10 Things I Like in SharePoint 2013 SearchAgnes Molnar
 
MapR lucidworks joint webinar
MapR lucidworks joint webinarMapR lucidworks joint webinar
MapR lucidworks joint webinarTed Dunning
 
MapR LucidWorks Joint Webinar 121211
MapR LucidWorks Joint Webinar 121211MapR LucidWorks Joint Webinar 121211
MapR LucidWorks Joint Webinar 121211MapR Technologies
 

Ähnlich wie Search, APIs, capability management and Sensis's journey (20)

SPCAdriatics - 10 Things I Like In SharePoint 2013 Search
SPCAdriatics - 10 Things I Like In SharePoint 2013 SearchSPCAdriatics - 10 Things I Like In SharePoint 2013 Search
SPCAdriatics - 10 Things I Like In SharePoint 2013 Search
 
Large Scale Search, Discovery and Analytics with Hadoop, Mahout and Solr
Large Scale Search, Discovery and Analytics with Hadoop, Mahout and SolrLarge Scale Search, Discovery and Analytics with Hadoop, Mahout and Solr
Large Scale Search, Discovery and Analytics with Hadoop, Mahout and Solr
 
Large Scale Search, Discovery and Analytics with Hadoop, Mahout and Solr
Large Scale Search, Discovery and Analytics with Hadoop, Mahout and SolrLarge Scale Search, Discovery and Analytics with Hadoop, Mahout and Solr
Large Scale Search, Discovery and Analytics with Hadoop, Mahout and Solr
 
Kuali update v4 - mw
Kuali update   v4 - mwKuali update   v4 - mw
Kuali update v4 - mw
 
32 cc 3_a_l-drumheller
32 cc 3_a_l-drumheller32 cc 3_a_l-drumheller
32 cc 3_a_l-drumheller
 
SharePoint Development
SharePoint DevelopmentSharePoint Development
SharePoint Development
 
Large-Scale Search Discovery Analytics with Hadoop, Mahout, Solr
Large-Scale Search Discovery Analytics with Hadoop, Mahout, SolrLarge-Scale Search Discovery Analytics with Hadoop, Mahout, Solr
Large-Scale Search Discovery Analytics with Hadoop, Mahout, Solr
 
Leveraging Solr and Mahout
Leveraging Solr and MahoutLeveraging Solr and Mahout
Leveraging Solr and Mahout
 
SEALS @ WWW2012
SEALS @ WWW2012SEALS @ WWW2012
SEALS @ WWW2012
 
Oracle Application Management Suite
Oracle Application Management SuiteOracle Application Management Suite
Oracle Application Management Suite
 
2010 10-building-global-listening-platform-with-solr
2010 10-building-global-listening-platform-with-solr2010 10-building-global-listening-platform-with-solr
2010 10-building-global-listening-platform-with-solr
 
MetaVis Webinar - 10 Things I Like in SharePoint 2013 Search
MetaVis Webinar - 10 Things I Like in SharePoint 2013 SearchMetaVis Webinar - 10 Things I Like in SharePoint 2013 Search
MetaVis Webinar - 10 Things I Like in SharePoint 2013 Search
 
E-commerce website development process
E-commerce website development processE-commerce website development process
E-commerce website development process
 
A Behind the Scenes Look at the Force.com Platform
A Behind the Scenes Look at the Force.com PlatformA Behind the Scenes Look at the Force.com Platform
A Behind the Scenes Look at the Force.com Platform
 
Business intelligence-solutions 2012-english
Business intelligence-solutions 2012-englishBusiness intelligence-solutions 2012-english
Business intelligence-solutions 2012-english
 
Exploratory Search upon Semantically Described Web Data Sources: Service regi...
Exploratory Search upon Semantically Described Web Data Sources: Service regi...Exploratory Search upon Semantically Described Web Data Sources: Service regi...
Exploratory Search upon Semantically Described Web Data Sources: Service regi...
 
Information architecture strategic process
Information architecture strategic processInformation architecture strategic process
Information architecture strategic process
 
SPLive Orlando - 10 Things I Like in SharePoint 2013 Search
SPLive Orlando - 10 Things I Like in SharePoint 2013 SearchSPLive Orlando - 10 Things I Like in SharePoint 2013 Search
SPLive Orlando - 10 Things I Like in SharePoint 2013 Search
 
MapR lucidworks joint webinar
MapR lucidworks joint webinarMapR lucidworks joint webinar
MapR lucidworks joint webinar
 
MapR LucidWorks Joint Webinar 121211
MapR LucidWorks Joint Webinar 121211MapR LucidWorks Joint Webinar 121211
MapR LucidWorks Joint Webinar 121211
 

Kürzlich hochgeladen

Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Drew Madelung
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)Gabriella Davis
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slidevu2urc
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAndrey Devyatkin
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUK Journal
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...apidays
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...DianaGray10
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024The Digital Insurer
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...Martijn de Jong
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘RTylerCroy
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProduct Anonymous
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024Rafal Los
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Miguel Araújo
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsJoaquim Jorge
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptxHampshireHUG
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024The Digital Insurer
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...apidays
 

Kürzlich hochgeladen (20)

Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of Terraform
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...
 

Search, APIs, capability management and Sensis's journey

  • 1. Search, APIs, Capability Management and the Sensis Journey Craig Rees
  • 2. Project background • Platform selection • Search capability • Relevance • Architecture • Quality management • Hurdles • What’s next Today’s menu
  • 3. • Sensis helps Australians find, buy and sell • From print directories to a cross-platform lead generator • Sensis publishes over 1.8 Million business listings • Two of the top 10 visited online sites in Australia (WhitePages.com.au and YellowPages.com.au) Sensis
  • 4. Business objectives • Drive presence in the local search market place • Open up the largest database of business listings in Australia • Reduce the effort required from local search developers Technology objectives • Free to use, we are after the • Develop a total search platform reporting • Relevancy testing as part of the development lifecycle • A framework to identify problem spaces • Manageable platform • Continuous deployments Project background
  • 6. Support for the search capability team • Structured vs non structured data • Deterministic vs black box • Non propriety code base • Community backing Platform selection
  • 7. • A/B testing • Machine learning Optimized Lvl 5 • External collaboration • Multiple contexts • Online dashboards • Test environments Managed Lvl 4 • Dynamic search refinements • Targets and metrics • Defined team • Regular monitoring Monitored Lvl 3 • Static autosuggest • Basic linguistics • Adhoc processes • Part time team Adhoc Lvl 2 • Static dictionaries • Individual led innovation • No resources • No reporting Unmanaged Lvl 1 • Out of the box features The Sensis Search capability maturity model *Courtesy of Pete Crawford & Craig Lonsdale
  • 8. Location Intent Chronology • Name • Type Social Graph • Product • Spatial Device Individual Context is key
  • 9. Business Geo Service Data Solr Mashery Business Name Query Data Search MongoDB Handler Service Index API Publisher Reporting Type Query Service Handler Historical search Data Reporting Events Ontologies Our architecture
  • 10. Business Geo Service Data Solr Mashery Business Name Query Data Search MongoDB Handler Service Index API Publisher Reporting Type Query Service Handler Historical search Data Reporting Events Ontologies Data staging
  • 11. Business Geo Service Data Solr Mashery Business Name Query Data Search MongoDB Handler Service Index API Publisher Reporting Type Query Service Handler Historical search Data Reporting Events Ontologies Search
  • 12. Business Geo Service Data Solr Mashery Business Name Query Data Search MongoDB Handler Service Index API Publisher Reporting Type Query Service Handler Historical search Data Reporting Events Ontologies API
  • 13. Business Geo Service Data Solr Mashery Business Name Query Data Search MongoDB Handler Service Index API Publisher Reporting Type Query Service Handler Historical search Data Reporting Events Ontologies API proxy
  • 14. • Moved from a black box Yesterday Today Tomorrow solution to a manageable platform • Deliver search improvements without major code changes • Understand how results were calculated • Identity problems scientifically • Continuously tune and test relevance Evolution of search management
  • 15. Specific gold sets for each Path Analysis problem space: used to identify  Intent  Spelling & stemming problems  Location spaces  Phrase parsing Features signed off “Gold Sets” only when they make used to define a positive impact to overall quality quality score score (TREC) Problem spaces, quality management & tuning
  • 16. Search quality analysis and testing
  • 21. • Data redundancy and homogeneity • Solr ranking of rare terms • Intent differentiation • Contextual synonyms Hurdles along the way
  • 22. Query engine • Facets / autosuggest • Real time tuning • Machine learning • Multi term queries • Scoring thresholds • Content Value Where next?
  • 23. Email: craig.rees@sensis.com.au www: developers.sensis.com.au Twitter: @SensisAPI @ablebagel Questions?