Contextual Search: personalization Machine Learning based ‘search’ algorithm selects the main story and the three alternate stories based on the users demographics (age, gender etc.) and previous behavior. Display advertizing is a similar top-1 search problem on the collection of advertisements.
Aggregation across different dimensions Hyperlocal: showing content from across Yahoo that is relevant to a particular neighbourhood.
Direct answers in search Information box with content from and links to Yahoo! Travel Points of interest in Vienna, Austria Since Aug, 2010, ‘regular’ search results are ‘Powered by Bing’ Products from Yahoo! Shopping
This is not a business model http://en.wikipedia.org/wiki/Underpants_Gnomes
Semantics at every step of the IR process bla bla bla? q=“bla” * 3 Document processing bla bla bla Indexing Ranking Query interpretation Result presentation The IR engine The Web bla bla bla bla bla bla “ bla” θ (q,d)
Linked Data: interlinked RDF documents example:roi “ Roi Blanco” name foaf:Person sameAs example:roi2 worksWith example:peter “ pmika@yahoo-inc.com” email type type Roi’s homepage Yahoo Friend-of-a-Friend ontology
RDFa: metadata embedded in HTML … <p typeof=”foaf:Person" about="http://example.org/roi"> <span property=”foaf:name” >Roi Blanco</span>. <a rel=”owl:sameAs" href="http://research.yahoo.com/roi"> Roi Blanco </a>. You can contact him at <a rel=”foaf:mbox" href="mailto:roi@yahoo-inc.com"> via email </ a>. </p> ... Roi’s homepage
In fact, some of these searches are so hard that the users don’t even try them anymore
With ads, the situation is even worse due to the sparsity problem. Note how poor the ads are…
Search is a form of content aggregation
Semantic search can be seen as a retrieval paradigm Centered on the use of semantics Incorporates the semantics entailed by the query and (or) the resources into the matching process, it essentially performs semantic search.
Bar celona
Close to the topic of keyword-search in databases, except knowledge-bases have a schema-oblivious design Different papers assume vastly different query needs even on the same type of data