This keynote presentation describes the critical role that search and Lucene has in building next generation products that understand reputation and relevance. We also describe how data science and machine learning have been applied at LinkedIn to collect, interpret, and index data around topical reputation.
Lucene Revolution is the biggest open source conference dedicated to Apache Lucene/Solr.
29. Lead designer and engineer for the implementation of a user-
centric, fully-configurable UI for data aggregation and reporting.
Developed over 20 SaaS custom applications using Python,
Javascript and RoR.
Tagging Skill Phrases
Tagging: Extract potential skill phrases from text
Standardize unambiguous phrase variants
29
JavaScript RoR SaaS Python
ror
rubyonrails
ruby on rails development
ruby rails
ruby on rail
Ruby on Rails
Document
(ex: Profile)
Tokenization
Skills Tagger
Phrases
(up to 6 words)
Skills Classifier
Skills
(unordered)
Skills
(ranked by relevance)
31. Skill Inference
How suggested/inferred skills work:
– The skill likelihood is a conditional model
– Probabilities are combined using a Naïve Bayes
Classifier
If you are an engineer at Apple, you probably know
about iPhone Development.
31
Profile
Extract
attributes
- Company ID
- Title ID
- Groups ID
- Industry ID
- …
Skills Classifier
Skills
(ranked by likelihood)
Feature
Vectors