SlideShare a Scribd company logo
1 of 45
Download to read offline
Build your Own Search Service



     Chris Heilmann
     Saurabh Sahni


HackU: IIT Bombay 5th Feb’ 2009
Outline



   •  Search engines using BOSS
   •  About BOSS API
      –  What?
      –  Why?
      –  Features
   •  How to use it
      –  BOSS API
      –  BOSS Mashup framework




                                 -2-
Search engines
using BOSS



 -3-
hakia: http://hakia.com/




                           -4-
hakia: http://hakia.com/




                           -5-
hakia: http://hakia.com/




                           -6-
Cluuz: http://cluuz.com




                          -7-
Cluuz: http://cluuz.com




                          -8-
Cluuz: http://cluuz.com




                          -9-
Keyword finder - http://keywordfinder.org/




                                - 10 -
askBOSS: http://ask-boss.appspot.com/




                             - 11 -
askBOSS: http://ask-boss.appspot.com/




                             - 12 -
askBOSS: http://ask-boss.appspot.com/




                             - 13 -
askBOSS: http://ask-boss.appspot.com/




                             - 14 -
askBOSS: http://ask-boss.appspot.com/




                             - 15 -
About BOSS API



 - 16 -
What?




  •  Open Yahoo’s core search features via web services to
     let 3rd parties revolutionize Search


  •  Unrestricted



        http://developer.yahoo.com/search/boss



                             - 17 -
Opening the search technology stack




                                                                           Rank
                                                       Assist
                  EXTRACT



                                                                Retrieve
                    SPAM <-> Gold              Usage
       CRAWL

                                           Web Map


                     Analyze
                                      Index                      Index
    50B pages * 20ms page download = 31 years


                                      - 18 -
Opening the search technology stack

                                                            Your App here



                                                                WEB API

                                                                            Rank
                                                       Assist
                  EXTRACT



                                                                Retrieve
                    SPAM <-> Gold              Usage
       CRAWL

                                           Web Map


                     Analyze
                                      Index                      Index
    50B pages * 20ms page download = 31 years


                                      - 19 -
Why?

  •  Removes entry barriers
       –  massive capital investment
       –  access to top technical talent

  •  Asset to Innovate
       –  Develop new relevance models
           • Leverage user insights
           • Use tags, bookmarks
       –  Change presentation style
  •  Search anywhere
       –  Improve Vertical Quality w/ Web comprehensiveness
       –  Fragment the market, foster more players, choice, competition

                                      - 20 -
BOSS API features



   •  Unlimited queries per day
   •  No branding or attribution
   •  No restrictions on presentation
   •  Ability to re-order results and blend-in addition content
   •  Access to multiple verticals (web search, image, news)
   •  Spell checks, keyword suggestions
   •  40+ supported language and region pairs
   •  Ability to monetize




                                 - 21 -
How to use it?



- 22 -
Get Started



  •  Register for an application id
     http://developer.yahoo.com/wsregapp/

  •  Documentation
     http://developer.yahoo.com/search/boss/boss_guide/

  •  Code samples: Javascript, PHP and Python
     http://www.saurabhsahni.com/boss-examples.zip




                             - 23 -
BOSS API



  Searching
  Slumdog
  Millionaire




                         (Source: http://en.wikipedia.org/wiki/File:Slumdog_Millionaire_poster.jpg)


                - 24 -
BOSS API



  •  Search for slumdog millionaire:
    –  http://boss.yahooapis.com/ysearch/web
       /v1/slumdog+millionaire
       ?appid=xyz&format=xml




                        - 25 -
BOSS API: XML response
http://boss.yahooapis.com/ysearch/web/v1/slumdog+millionaire?appid=xyz&format=xml




                                       - 26 -
BOSS API




  •  Exact search for “slumdog millionaire”
     –  http://boss.yahooapis.com/ysearch/web
        /v1/%22slumdog+millionaire%22
        ?appid=xyz&format=xml




                               - 27 -
BOSS API



  •  Search for slumdog millionaire only on indiatimes.com:
     –  Add site:indiatimes.com to your query
     –  http://boss.yahooapis.com/ysearch/web/v1/slumdog
        +millionaire+site%3Aindiatimes.com
        ?appid=xyz&format=xml


  •  Search for slumdog millionaire on selected movie sites
     –  Add param sites=indiatimes.com,movies.yahoo.com,imdb.com
     –  http://boss.yahooapis.com/ysearch/web/v1/slumdog
        +millionaire?appid=xyz&sites=indiatimes.co
        m%2Cmovies.yahoo.com&format=xml




                                   - 28 -
http://boss.yahooapis.com/ysearch/web/v1/slumdog+millionaire?
appid=xyz&sites=indiatimes.com%2Cmovies.yahoo.com&format=xml




                               - 29 -
BOSS API



  •  Find related keywords
     –  Add parameter view=keyterms
     –  http://boss.yahooapis.com/ysearch/web/v1/slumdog
        +millionaire?appid=xyz&view=keyterms&format=xml




                                - 30 -
http://boss.yahooapis.com/ysearch/web/v1/slumdog +millionaire?appid=xyz&view=keyterms&format=xml




                                                - 31 -
BOSS API




  •  Search images
     –  http://boss.yahooapis.com/ysearch/images/v1/slumdog
        +millionaire?dimensions=small




                              - 32 -
http://boss.yahooapis.com/ysearch/images/v1/
slumdog +millionaire?dimensions=small




                             - 33 -
BOSS API




  •  Search news
     –  http://boss.yahooapis.com/ysearch/news/v1/slumdog
        +millionaire?age=15d




                              - 34 -
http://boss.yahooapis.com/ysearch/news/v1/slumdog
+millionaire?age=15d





                              - 35 -
BOSS API

 Spell check request

 http://boss.yahooapis.com/ysearch/spelling/v1/
 milionare?format=xml



 Response




                           - 36 -
BOSS API REST Interface

  http://boss.yahooapis.com/ysearch/{vert}/v1/{query}

  •  {query}: term to look for (url-encoded)
  •  {vert} := {web, news, images, spelling}
  •  @ required
     –  appid
  •  @ optional
     –  start, count, lang, region, format, callback, sites




                                   - 37 -
BOSS Mashup Framework



  •  Python (v2.5+) library

  •  BOSS Search SDK plus …

  •  SQL for remixing arbitrary XML/JSON sources




         http://developer.yahoo.com/search/boss/mashup.html




                               - 38 -
BMF + Google App Engine



  •  Enhanced version of BMF to GAE platform
  •    http://zooie.wordpress.com/2008/08/04/yahoo-boss-google-app-engine-integrated/




  •  Enables quick deployment of BOSS applications online




                                               - 39 -
One more thing…




 - 40 -
BOSS in Academic Research



  •  The biggest dataset available on web
  •  Very useful for Web-mining research experiments
     –    Natural language processing
     –    Semantic extraction
     –    Related keywords
     –    Similarity detection
     –    Clustering algorithms
     –    Spelling corrections




                                    - 41 -
Thank You




                              Questions?
        More: http://developer.yahoo.com/search/boss/




                           - 42 -
Appendix



- 43 -
Search UI Templates are Included in the
BOSS Mashup Framework




                       http://www.yahoo.com




    BOSS Mashup Framework simplifies aggregating and presenting multiple data sources

                                              - 44 -
BMF Features


  •  select, group, sort, union, joins, udfs, where
  •  Text normalization and duplicate removal
  •  Auto-transformation of resource-oriented API results
     into tables w/o parsing
  •  All-in-memory storage and retrieval operations
  •  Ability to join lists of tables via an arbitrary predicate
     function (map-like)
  •  Search UI template framework
  •  Single search function provides total access to
     BOSS REST API

                                - 45 -

More Related Content

Similar to BOSS: HackU IIT Bombay

BOSS: Yahoo HackU IIIT Bangalore
BOSS: Yahoo HackU IIIT BangaloreBOSS: Yahoo HackU IIIT Bangalore
BOSS: Yahoo HackU IIIT BangaloreSaurabh Sahni
 
Y Boss External 20091017
Y Boss External 20091017Y Boss External 20091017
Y Boss External 20091017JH Lee
 
Yahoo! BOSS API external 20091001
Yahoo! BOSS API external 20091001Yahoo! BOSS API external 20091001
Yahoo! BOSS API external 20091001Fred Meng
 
Yahoo Pipes Middleware In The Cloud
Yahoo Pipes Middleware In The CloudYahoo Pipes Middleware In The Cloud
Yahoo Pipes Middleware In The CloudConSanFrancisco123
 
EscConf - Deep Dive Frontend Optimization
EscConf - Deep Dive Frontend OptimizationEscConf - Deep Dive Frontend Optimization
EscConf - Deep Dive Frontend OptimizationJonathan Klein
 
Map Mashup Trend
Map Mashup TrendMap Mashup Trend
Map Mashup TrendJinho Jung
 
GDD Japan 2009 - Designing OpenSocial Apps For Speed and Scale
GDD Japan 2009 - Designing OpenSocial Apps For Speed and ScaleGDD Japan 2009 - Designing OpenSocial Apps For Speed and Scale
GDD Japan 2009 - Designing OpenSocial Apps For Speed and ScalePatrick Chanezon
 
SpeedGeeks
SpeedGeeksSpeedGeeks
SpeedGeeksxlight
 
Padrino - the Godfather of Sinatra
Padrino - the Godfather of SinatraPadrino - the Godfather of Sinatra
Padrino - the Godfather of SinatraStoyan Zhekov
 
Apache Solr Changes the Way You Build Sites
Apache Solr Changes the Way You Build SitesApache Solr Changes the Way You Build Sites
Apache Solr Changes the Way You Build SitesPeter
 
Beyond web services: supporting mashup artists at Yahoo!
Beyond web services: supporting mashup artists at Yahoo!Beyond web services: supporting mashup artists at Yahoo!
Beyond web services: supporting mashup artists at Yahoo!Chad Dickerson
 
Sourcer\'s Daily Dozen for ERE- Arbita JobMachine
Sourcer\'s Daily Dozen for ERE- Arbita JobMachineSourcer\'s Daily Dozen for ERE- Arbita JobMachine
Sourcer\'s Daily Dozen for ERE- Arbita JobMachineGlenn Gutmacher
 
Behavior Driven Development and Automation Testing Using Cucumber
Behavior Driven Development and Automation Testing Using CucumberBehavior Driven Development and Automation Testing Using Cucumber
Behavior Driven Development and Automation Testing Using CucumberKMS Technology
 
High Performance Webdesign
High Performance WebdesignHigh Performance Webdesign
High Performance Webdesign拓樹 谷
 
Socket applications
Socket applicationsSocket applications
Socket applicationsJoão Moura
 
Обход проверки безопасности в магазинах мобильных приложений при помощи платф...
Обход проверки безопасности в магазинах мобильных приложений при помощи платф...Обход проверки безопасности в магазинах мобильных приложений при помощи платф...
Обход проверки безопасности в магазинах мобильных приложений при помощи платф...Positive Hack Days
 
Spark Hsinchu meetup
Spark Hsinchu meetupSpark Hsinchu meetup
Spark Hsinchu meetupYung-An He
 
Service Oriented Integration with ServiceMix
Service Oriented Integration with ServiceMixService Oriented Integration with ServiceMix
Service Oriented Integration with ServiceMixghessler
 

Similar to BOSS: HackU IIT Bombay (20)

BOSS: Yahoo HackU IIIT Bangalore
BOSS: Yahoo HackU IIIT BangaloreBOSS: Yahoo HackU IIIT Bangalore
BOSS: Yahoo HackU IIIT Bangalore
 
Y Boss External 20091017
Y Boss External 20091017Y Boss External 20091017
Y Boss External 20091017
 
SearchMonkey
SearchMonkeySearchMonkey
SearchMonkey
 
Yahoo! BOSS API external 20091001
Yahoo! BOSS API external 20091001Yahoo! BOSS API external 20091001
Yahoo! BOSS API external 20091001
 
Yahoo Pipes Middleware In The Cloud
Yahoo Pipes Middleware In The CloudYahoo Pipes Middleware In The Cloud
Yahoo Pipes Middleware In The Cloud
 
EscConf - Deep Dive Frontend Optimization
EscConf - Deep Dive Frontend OptimizationEscConf - Deep Dive Frontend Optimization
EscConf - Deep Dive Frontend Optimization
 
Map Mashup Trend
Map Mashup TrendMap Mashup Trend
Map Mashup Trend
 
GDD Japan 2009 - Designing OpenSocial Apps For Speed and Scale
GDD Japan 2009 - Designing OpenSocial Apps For Speed and ScaleGDD Japan 2009 - Designing OpenSocial Apps For Speed and Scale
GDD Japan 2009 - Designing OpenSocial Apps For Speed and Scale
 
SpeedGeeks
SpeedGeeksSpeedGeeks
SpeedGeeks
 
In Search of Speed
In Search of SpeedIn Search of Speed
In Search of Speed
 
Padrino - the Godfather of Sinatra
Padrino - the Godfather of SinatraPadrino - the Godfather of Sinatra
Padrino - the Godfather of Sinatra
 
Apache Solr Changes the Way You Build Sites
Apache Solr Changes the Way You Build SitesApache Solr Changes the Way You Build Sites
Apache Solr Changes the Way You Build Sites
 
Beyond web services: supporting mashup artists at Yahoo!
Beyond web services: supporting mashup artists at Yahoo!Beyond web services: supporting mashup artists at Yahoo!
Beyond web services: supporting mashup artists at Yahoo!
 
Sourcer\'s Daily Dozen for ERE- Arbita JobMachine
Sourcer\'s Daily Dozen for ERE- Arbita JobMachineSourcer\'s Daily Dozen for ERE- Arbita JobMachine
Sourcer\'s Daily Dozen for ERE- Arbita JobMachine
 
Behavior Driven Development and Automation Testing Using Cucumber
Behavior Driven Development and Automation Testing Using CucumberBehavior Driven Development and Automation Testing Using Cucumber
Behavior Driven Development and Automation Testing Using Cucumber
 
High Performance Webdesign
High Performance WebdesignHigh Performance Webdesign
High Performance Webdesign
 
Socket applications
Socket applicationsSocket applications
Socket applications
 
Обход проверки безопасности в магазинах мобильных приложений при помощи платф...
Обход проверки безопасности в магазинах мобильных приложений при помощи платф...Обход проверки безопасности в магазинах мобильных приложений при помощи платф...
Обход проверки безопасности в магазинах мобильных приложений при помощи платф...
 
Spark Hsinchu meetup
Spark Hsinchu meetupSpark Hsinchu meetup
Spark Hsinchu meetup
 
Service Oriented Integration with ServiceMix
Service Oriented Integration with ServiceMixService Oriented Integration with ServiceMix
Service Oriented Integration with ServiceMix
 

More from Saurabh Sahni

Hacking 101 & Yahoo Mobile Developer Suite - YMDC NYC
Hacking 101 & Yahoo Mobile Developer Suite - YMDC NYCHacking 101 & Yahoo Mobile Developer Suite - YMDC NYC
Hacking 101 & Yahoo Mobile Developer Suite - YMDC NYCSaurabh Sahni
 
Finding things on the web with Yahoo! BOSS: IIT Delhi
Finding things on the web with Yahoo! BOSS: IIT DelhiFinding things on the web with Yahoo! BOSS: IIT Delhi
Finding things on the web with Yahoo! BOSS: IIT DelhiSaurabh Sahni
 
YQL: Hacking on steroids - Yahoo! Open Hack Day 2012
YQL: Hacking on steroids - Yahoo! Open Hack Day 2012YQL: Hacking on steroids - Yahoo! Open Hack Day 2012
YQL: Hacking on steroids - Yahoo! Open Hack Day 2012Saurabh Sahni
 
Hacking for Innovation - WPP, New York
Hacking for Innovation - WPP, New YorkHacking for Innovation - WPP, New York
Hacking for Innovation - WPP, New YorkSaurabh Sahni
 
Innovating the Hacker Way
Innovating the Hacker WayInnovating the Hacker Way
Innovating the Hacker WaySaurabh Sahni
 
Finding things with BOSS
Finding things with BOSSFinding things with BOSS
Finding things with BOSSSaurabh Sahni
 
Let the hacking continue - Post Open Hack India Mixer
Let the hacking continue - Post Open Hack India MixerLet the hacking continue - Post Open Hack India Mixer
Let the hacking continue - Post Open Hack India MixerSaurabh Sahni
 
Hack 101 - IIT Delhi HackU 2011
Hack 101 - IIT Delhi HackU 2011Hack 101 - IIT Delhi HackU 2011
Hack 101 - IIT Delhi HackU 2011Saurabh Sahni
 
Yahoo! Open Stack & YQL
Yahoo! Open Stack & YQLYahoo! Open Stack & YQL
Yahoo! Open Stack & YQLSaurabh Sahni
 
Hacking for Innovation: IIT Kharagpur
Hacking for Innovation: IIT KharagpurHacking for Innovation: IIT Kharagpur
Hacking for Innovation: IIT KharagpurSaurabh Sahni
 
HackU: IIT Madras: Hacking Yahoo! Social
HackU: IIT Madras: Hacking Yahoo! SocialHackU: IIT Madras: Hacking Yahoo! Social
HackU: IIT Madras: Hacking Yahoo! SocialSaurabh Sahni
 
Open Hack Day Bangalore: Hacking Yahoo! Social
Open Hack Day Bangalore: Hacking Yahoo! SocialOpen Hack Day Bangalore: Hacking Yahoo! Social
Open Hack Day Bangalore: Hacking Yahoo! SocialSaurabh Sahni
 
Hackuzela: Hacking Yahoo! Social
Hackuzela: Hacking Yahoo! SocialHackuzela: Hacking Yahoo! Social
Hackuzela: Hacking Yahoo! SocialSaurabh Sahni
 

More from Saurabh Sahni (14)

Hacking 101 & Yahoo Mobile Developer Suite - YMDC NYC
Hacking 101 & Yahoo Mobile Developer Suite - YMDC NYCHacking 101 & Yahoo Mobile Developer Suite - YMDC NYC
Hacking 101 & Yahoo Mobile Developer Suite - YMDC NYC
 
Finding things on the web with Yahoo! BOSS: IIT Delhi
Finding things on the web with Yahoo! BOSS: IIT DelhiFinding things on the web with Yahoo! BOSS: IIT Delhi
Finding things on the web with Yahoo! BOSS: IIT Delhi
 
Hacking 101
Hacking 101Hacking 101
Hacking 101
 
YQL: Hacking on steroids - Yahoo! Open Hack Day 2012
YQL: Hacking on steroids - Yahoo! Open Hack Day 2012YQL: Hacking on steroids - Yahoo! Open Hack Day 2012
YQL: Hacking on steroids - Yahoo! Open Hack Day 2012
 
Hacking for Innovation - WPP, New York
Hacking for Innovation - WPP, New YorkHacking for Innovation - WPP, New York
Hacking for Innovation - WPP, New York
 
Innovating the Hacker Way
Innovating the Hacker WayInnovating the Hacker Way
Innovating the Hacker Way
 
Finding things with BOSS
Finding things with BOSSFinding things with BOSS
Finding things with BOSS
 
Let the hacking continue - Post Open Hack India Mixer
Let the hacking continue - Post Open Hack India MixerLet the hacking continue - Post Open Hack India Mixer
Let the hacking continue - Post Open Hack India Mixer
 
Hack 101 - IIT Delhi HackU 2011
Hack 101 - IIT Delhi HackU 2011Hack 101 - IIT Delhi HackU 2011
Hack 101 - IIT Delhi HackU 2011
 
Yahoo! Open Stack & YQL
Yahoo! Open Stack & YQLYahoo! Open Stack & YQL
Yahoo! Open Stack & YQL
 
Hacking for Innovation: IIT Kharagpur
Hacking for Innovation: IIT KharagpurHacking for Innovation: IIT Kharagpur
Hacking for Innovation: IIT Kharagpur
 
HackU: IIT Madras: Hacking Yahoo! Social
HackU: IIT Madras: Hacking Yahoo! SocialHackU: IIT Madras: Hacking Yahoo! Social
HackU: IIT Madras: Hacking Yahoo! Social
 
Open Hack Day Bangalore: Hacking Yahoo! Social
Open Hack Day Bangalore: Hacking Yahoo! SocialOpen Hack Day Bangalore: Hacking Yahoo! Social
Open Hack Day Bangalore: Hacking Yahoo! Social
 
Hackuzela: Hacking Yahoo! Social
Hackuzela: Hacking Yahoo! SocialHackuzela: Hacking Yahoo! Social
Hackuzela: Hacking Yahoo! Social
 

Recently uploaded

General Principles of Intellectual Property: Concepts of Intellectual Proper...
General Principles of Intellectual Property: Concepts of Intellectual  Proper...General Principles of Intellectual Property: Concepts of Intellectual  Proper...
General Principles of Intellectual Property: Concepts of Intellectual Proper...Poonam Aher Patil
 
Micro-Scholarship, What it is, How can it help me.pdf
Micro-Scholarship, What it is, How can it help me.pdfMicro-Scholarship, What it is, How can it help me.pdf
Micro-Scholarship, What it is, How can it help me.pdfPoh-Sun Goh
 
COMMUNICATING NEGATIVE NEWS - APPROACHES .pptx
COMMUNICATING NEGATIVE NEWS - APPROACHES .pptxCOMMUNICATING NEGATIVE NEWS - APPROACHES .pptx
COMMUNICATING NEGATIVE NEWS - APPROACHES .pptxannathomasp01
 
Towards a code of practice for AI in AT.pptx
Towards a code of practice for AI in AT.pptxTowards a code of practice for AI in AT.pptx
Towards a code of practice for AI in AT.pptxJisc
 
NO1 Top Black Magic Specialist In Lahore Black magic In Pakistan Kala Ilam Ex...
NO1 Top Black Magic Specialist In Lahore Black magic In Pakistan Kala Ilam Ex...NO1 Top Black Magic Specialist In Lahore Black magic In Pakistan Kala Ilam Ex...
NO1 Top Black Magic Specialist In Lahore Black magic In Pakistan Kala Ilam Ex...Amil baba
 
Plant propagation: Sexual and Asexual propapagation.pptx
Plant propagation: Sexual and Asexual propapagation.pptxPlant propagation: Sexual and Asexual propapagation.pptx
Plant propagation: Sexual and Asexual propapagation.pptxUmeshTimilsina1
 
The basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptxThe basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptxheathfieldcps1
 
Salient Features of India constitution especially power and functions
Salient Features of India constitution especially power and functionsSalient Features of India constitution especially power and functions
Salient Features of India constitution especially power and functionsKarakKing
 
Sensory_Experience_and_Emotional_Resonance_in_Gabriel_Okaras_The_Piano_and_Th...
Sensory_Experience_and_Emotional_Resonance_in_Gabriel_Okaras_The_Piano_and_Th...Sensory_Experience_and_Emotional_Resonance_in_Gabriel_Okaras_The_Piano_and_Th...
Sensory_Experience_and_Emotional_Resonance_in_Gabriel_Okaras_The_Piano_and_Th...Pooja Bhuva
 
Jamworks pilot and AI at Jisc (20/03/2024)
Jamworks pilot and AI at Jisc (20/03/2024)Jamworks pilot and AI at Jisc (20/03/2024)
Jamworks pilot and AI at Jisc (20/03/2024)Jisc
 
Key note speaker Neum_Admir Softic_ENG.pdf
Key note speaker Neum_Admir Softic_ENG.pdfKey note speaker Neum_Admir Softic_ENG.pdf
Key note speaker Neum_Admir Softic_ENG.pdfAdmir Softic
 
Wellbeing inclusion and digital dystopias.pptx
Wellbeing inclusion and digital dystopias.pptxWellbeing inclusion and digital dystopias.pptx
Wellbeing inclusion and digital dystopias.pptxJisc
 
Interdisciplinary_Insights_Data_Collection_Methods.pptx
Interdisciplinary_Insights_Data_Collection_Methods.pptxInterdisciplinary_Insights_Data_Collection_Methods.pptx
Interdisciplinary_Insights_Data_Collection_Methods.pptxPooja Bhuva
 
Basic Civil Engineering first year Notes- Chapter 4 Building.pptx
Basic Civil Engineering first year Notes- Chapter 4 Building.pptxBasic Civil Engineering first year Notes- Chapter 4 Building.pptx
Basic Civil Engineering first year Notes- Chapter 4 Building.pptxDenish Jangid
 
ICT role in 21st century education and it's challenges.
ICT role in 21st century education and it's challenges.ICT role in 21st century education and it's challenges.
ICT role in 21st century education and it's challenges.MaryamAhmad92
 
HMCS Max Bernays Pre-Deployment Brief (May 2024).pptx
HMCS Max Bernays Pre-Deployment Brief (May 2024).pptxHMCS Max Bernays Pre-Deployment Brief (May 2024).pptx
HMCS Max Bernays Pre-Deployment Brief (May 2024).pptxEsquimalt MFRC
 
Python Notes for mca i year students osmania university.docx
Python Notes for mca i year students osmania university.docxPython Notes for mca i year students osmania university.docx
Python Notes for mca i year students osmania university.docxRamakrishna Reddy Bijjam
 
80 ĐỀ THI THỬ TUYỂN SINH TIẾNG ANH VÀO 10 SỞ GD – ĐT THÀNH PHỐ HỒ CHÍ MINH NĂ...
80 ĐỀ THI THỬ TUYỂN SINH TIẾNG ANH VÀO 10 SỞ GD – ĐT THÀNH PHỐ HỒ CHÍ MINH NĂ...80 ĐỀ THI THỬ TUYỂN SINH TIẾNG ANH VÀO 10 SỞ GD – ĐT THÀNH PHỐ HỒ CHÍ MINH NĂ...
80 ĐỀ THI THỬ TUYỂN SINH TIẾNG ANH VÀO 10 SỞ GD – ĐT THÀNH PHỐ HỒ CHÍ MINH NĂ...Nguyen Thanh Tu Collection
 
ICT Role in 21st Century Education & its Challenges.pptx
ICT Role in 21st Century Education & its Challenges.pptxICT Role in 21st Century Education & its Challenges.pptx
ICT Role in 21st Century Education & its Challenges.pptxAreebaZafar22
 
Fostering Friendships - Enhancing Social Bonds in the Classroom
Fostering Friendships - Enhancing Social Bonds  in the ClassroomFostering Friendships - Enhancing Social Bonds  in the Classroom
Fostering Friendships - Enhancing Social Bonds in the ClassroomPooky Knightsmith
 

Recently uploaded (20)

General Principles of Intellectual Property: Concepts of Intellectual Proper...
General Principles of Intellectual Property: Concepts of Intellectual  Proper...General Principles of Intellectual Property: Concepts of Intellectual  Proper...
General Principles of Intellectual Property: Concepts of Intellectual Proper...
 
Micro-Scholarship, What it is, How can it help me.pdf
Micro-Scholarship, What it is, How can it help me.pdfMicro-Scholarship, What it is, How can it help me.pdf
Micro-Scholarship, What it is, How can it help me.pdf
 
COMMUNICATING NEGATIVE NEWS - APPROACHES .pptx
COMMUNICATING NEGATIVE NEWS - APPROACHES .pptxCOMMUNICATING NEGATIVE NEWS - APPROACHES .pptx
COMMUNICATING NEGATIVE NEWS - APPROACHES .pptx
 
Towards a code of practice for AI in AT.pptx
Towards a code of practice for AI in AT.pptxTowards a code of practice for AI in AT.pptx
Towards a code of practice for AI in AT.pptx
 
NO1 Top Black Magic Specialist In Lahore Black magic In Pakistan Kala Ilam Ex...
NO1 Top Black Magic Specialist In Lahore Black magic In Pakistan Kala Ilam Ex...NO1 Top Black Magic Specialist In Lahore Black magic In Pakistan Kala Ilam Ex...
NO1 Top Black Magic Specialist In Lahore Black magic In Pakistan Kala Ilam Ex...
 
Plant propagation: Sexual and Asexual propapagation.pptx
Plant propagation: Sexual and Asexual propapagation.pptxPlant propagation: Sexual and Asexual propapagation.pptx
Plant propagation: Sexual and Asexual propapagation.pptx
 
The basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptxThe basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptx
 
Salient Features of India constitution especially power and functions
Salient Features of India constitution especially power and functionsSalient Features of India constitution especially power and functions
Salient Features of India constitution especially power and functions
 
Sensory_Experience_and_Emotional_Resonance_in_Gabriel_Okaras_The_Piano_and_Th...
Sensory_Experience_and_Emotional_Resonance_in_Gabriel_Okaras_The_Piano_and_Th...Sensory_Experience_and_Emotional_Resonance_in_Gabriel_Okaras_The_Piano_and_Th...
Sensory_Experience_and_Emotional_Resonance_in_Gabriel_Okaras_The_Piano_and_Th...
 
Jamworks pilot and AI at Jisc (20/03/2024)
Jamworks pilot and AI at Jisc (20/03/2024)Jamworks pilot and AI at Jisc (20/03/2024)
Jamworks pilot and AI at Jisc (20/03/2024)
 
Key note speaker Neum_Admir Softic_ENG.pdf
Key note speaker Neum_Admir Softic_ENG.pdfKey note speaker Neum_Admir Softic_ENG.pdf
Key note speaker Neum_Admir Softic_ENG.pdf
 
Wellbeing inclusion and digital dystopias.pptx
Wellbeing inclusion and digital dystopias.pptxWellbeing inclusion and digital dystopias.pptx
Wellbeing inclusion and digital dystopias.pptx
 
Interdisciplinary_Insights_Data_Collection_Methods.pptx
Interdisciplinary_Insights_Data_Collection_Methods.pptxInterdisciplinary_Insights_Data_Collection_Methods.pptx
Interdisciplinary_Insights_Data_Collection_Methods.pptx
 
Basic Civil Engineering first year Notes- Chapter 4 Building.pptx
Basic Civil Engineering first year Notes- Chapter 4 Building.pptxBasic Civil Engineering first year Notes- Chapter 4 Building.pptx
Basic Civil Engineering first year Notes- Chapter 4 Building.pptx
 
ICT role in 21st century education and it's challenges.
ICT role in 21st century education and it's challenges.ICT role in 21st century education and it's challenges.
ICT role in 21st century education and it's challenges.
 
HMCS Max Bernays Pre-Deployment Brief (May 2024).pptx
HMCS Max Bernays Pre-Deployment Brief (May 2024).pptxHMCS Max Bernays Pre-Deployment Brief (May 2024).pptx
HMCS Max Bernays Pre-Deployment Brief (May 2024).pptx
 
Python Notes for mca i year students osmania university.docx
Python Notes for mca i year students osmania university.docxPython Notes for mca i year students osmania university.docx
Python Notes for mca i year students osmania university.docx
 
80 ĐỀ THI THỬ TUYỂN SINH TIẾNG ANH VÀO 10 SỞ GD – ĐT THÀNH PHỐ HỒ CHÍ MINH NĂ...
80 ĐỀ THI THỬ TUYỂN SINH TIẾNG ANH VÀO 10 SỞ GD – ĐT THÀNH PHỐ HỒ CHÍ MINH NĂ...80 ĐỀ THI THỬ TUYỂN SINH TIẾNG ANH VÀO 10 SỞ GD – ĐT THÀNH PHỐ HỒ CHÍ MINH NĂ...
80 ĐỀ THI THỬ TUYỂN SINH TIẾNG ANH VÀO 10 SỞ GD – ĐT THÀNH PHỐ HỒ CHÍ MINH NĂ...
 
ICT Role in 21st Century Education & its Challenges.pptx
ICT Role in 21st Century Education & its Challenges.pptxICT Role in 21st Century Education & its Challenges.pptx
ICT Role in 21st Century Education & its Challenges.pptx
 
Fostering Friendships - Enhancing Social Bonds in the Classroom
Fostering Friendships - Enhancing Social Bonds  in the ClassroomFostering Friendships - Enhancing Social Bonds  in the Classroom
Fostering Friendships - Enhancing Social Bonds in the Classroom
 

BOSS: HackU IIT Bombay

  • 1. Build your Own Search Service Chris Heilmann Saurabh Sahni HackU: IIT Bombay 5th Feb’ 2009
  • 2. Outline •  Search engines using BOSS •  About BOSS API –  What? –  Why? –  Features •  How to use it –  BOSS API –  BOSS Mashup framework -2-
  • 10. Keyword finder - http://keywordfinder.org/ - 10 -
  • 16. About BOSS API - 16 -
  • 17. What? •  Open Yahoo’s core search features via web services to let 3rd parties revolutionize Search •  Unrestricted http://developer.yahoo.com/search/boss - 17 -
  • 18. Opening the search technology stack Rank Assist EXTRACT Retrieve SPAM <-> Gold Usage CRAWL Web Map Analyze Index Index 50B pages * 20ms page download = 31 years - 18 -
  • 19. Opening the search technology stack Your App here WEB API Rank Assist EXTRACT Retrieve SPAM <-> Gold Usage CRAWL Web Map Analyze Index Index 50B pages * 20ms page download = 31 years - 19 -
  • 20. Why? •  Removes entry barriers –  massive capital investment –  access to top technical talent •  Asset to Innovate –  Develop new relevance models • Leverage user insights • Use tags, bookmarks –  Change presentation style •  Search anywhere –  Improve Vertical Quality w/ Web comprehensiveness –  Fragment the market, foster more players, choice, competition - 20 -
  • 21. BOSS API features •  Unlimited queries per day •  No branding or attribution •  No restrictions on presentation •  Ability to re-order results and blend-in addition content •  Access to multiple verticals (web search, image, news) •  Spell checks, keyword suggestions •  40+ supported language and region pairs •  Ability to monetize - 21 -
  • 22. How to use it? - 22 -
  • 23. Get Started •  Register for an application id http://developer.yahoo.com/wsregapp/ •  Documentation http://developer.yahoo.com/search/boss/boss_guide/ •  Code samples: Javascript, PHP and Python http://www.saurabhsahni.com/boss-examples.zip - 23 -
  • 24. BOSS API Searching Slumdog Millionaire (Source: http://en.wikipedia.org/wiki/File:Slumdog_Millionaire_poster.jpg) - 24 -
  • 25. BOSS API •  Search for slumdog millionaire: –  http://boss.yahooapis.com/ysearch/web /v1/slumdog+millionaire ?appid=xyz&format=xml - 25 -
  • 26. BOSS API: XML response http://boss.yahooapis.com/ysearch/web/v1/slumdog+millionaire?appid=xyz&format=xml - 26 -
  • 27. BOSS API •  Exact search for “slumdog millionaire” –  http://boss.yahooapis.com/ysearch/web /v1/%22slumdog+millionaire%22 ?appid=xyz&format=xml - 27 -
  • 28. BOSS API •  Search for slumdog millionaire only on indiatimes.com: –  Add site:indiatimes.com to your query –  http://boss.yahooapis.com/ysearch/web/v1/slumdog +millionaire+site%3Aindiatimes.com ?appid=xyz&format=xml •  Search for slumdog millionaire on selected movie sites –  Add param sites=indiatimes.com,movies.yahoo.com,imdb.com –  http://boss.yahooapis.com/ysearch/web/v1/slumdog +millionaire?appid=xyz&sites=indiatimes.co m%2Cmovies.yahoo.com&format=xml - 28 -
  • 30. BOSS API •  Find related keywords –  Add parameter view=keyterms –  http://boss.yahooapis.com/ysearch/web/v1/slumdog +millionaire?appid=xyz&view=keyterms&format=xml - 30 -
  • 32. BOSS API •  Search images –  http://boss.yahooapis.com/ysearch/images/v1/slumdog +millionaire?dimensions=small - 32 -
  • 34. BOSS API •  Search news –  http://boss.yahooapis.com/ysearch/news/v1/slumdog +millionaire?age=15d - 34 -
  • 36. BOSS API Spell check request http://boss.yahooapis.com/ysearch/spelling/v1/ milionare?format=xml Response - 36 -
  • 37. BOSS API REST Interface http://boss.yahooapis.com/ysearch/{vert}/v1/{query} •  {query}: term to look for (url-encoded) •  {vert} := {web, news, images, spelling} •  @ required –  appid •  @ optional –  start, count, lang, region, format, callback, sites - 37 -
  • 38. BOSS Mashup Framework •  Python (v2.5+) library •  BOSS Search SDK plus … •  SQL for remixing arbitrary XML/JSON sources http://developer.yahoo.com/search/boss/mashup.html - 38 -
  • 39. BMF + Google App Engine •  Enhanced version of BMF to GAE platform •  http://zooie.wordpress.com/2008/08/04/yahoo-boss-google-app-engine-integrated/ •  Enables quick deployment of BOSS applications online - 39 -
  • 41. BOSS in Academic Research •  The biggest dataset available on web •  Very useful for Web-mining research experiments –  Natural language processing –  Semantic extraction –  Related keywords –  Similarity detection –  Clustering algorithms –  Spelling corrections - 41 -
  • 42. Thank You Questions? More: http://developer.yahoo.com/search/boss/ - 42 -
  • 44. Search UI Templates are Included in the BOSS Mashup Framework http://www.yahoo.com BOSS Mashup Framework simplifies aggregating and presenting multiple data sources - 44 -
  • 45. BMF Features •  select, group, sort, union, joins, udfs, where •  Text normalization and duplicate removal •  Auto-transformation of resource-oriented API results into tables w/o parsing •  All-in-memory storage and retrieval operations •  Ability to join lists of tables via an arbitrary predicate function (map-like) •  Search UI template framework •  Single search function provides total access to BOSS REST API - 45 -