Josh Wills - Data Labeling as Religious Experience

•

4 gefällt mir•630 views

The document discusses obtaining labeled data and introduces weak supervision as an alternative to full manual labeling. It notes that weak supervision uses labeling functions to generate noisy training labels at scale, which can then be combined using a generative model to infer true labels. The document also briefly mentions Snorkel, a system for creating labeling functions, and Snuba, its successor which focuses on scaling to very large datasets.

Technologie

About Me
● Google Engineer (2007-
11)
● Cloudera’s Director of
Data Science (2011-15)
● Slack’s Director of Data
Engineering (2015-
2017)
● Slack Engineer (2017-

Search Problems: A Comparison
1. Corpus/queries are
public.
1. Lots of head queries.
1. Web pages want to be
found.
1. Corpus/queries are
private.
1. Almost no head
queries.
1. Messages don’t care
about being found.

Snorkel And The Rise of Weak Supervision

Empfohlen

Making Friends And Enemies With Pivot Tablesryanbarnett

M3 l16 translation at facebookBoPeng76

Lean Data ScienceDomino Data Lab

Moving worlds and qualityMajd Uddin

Calling Voyager: Interface Design for NASA’s Deep Space NetworkFITC

Content Jam 2016: Unthinkable: How the World’s Most Creative Content Marketer...Orbit Media Studios

The future of conversation uiAndrés Leonardo Martinez Ortiz

APIStrat & APIDays Berlin 2015Joyce Stack

Empfohlen

Making Friends And Enemies With Pivot Tablesryanbarnett

M3 l16 translation at facebookBoPeng76

Lean Data ScienceDomino Data Lab

Moving worlds and qualityMajd Uddin

Calling Voyager: Interface Design for NASA’s Deep Space NetworkFITC

Content Jam 2016: Unthinkable: How the World’s Most Creative Content Marketer...Orbit Media Studios

The future of conversation uiAndrés Leonardo Martinez Ortiz

APIStrat & APIDays Berlin 2015Joyce Stack

Jamila Smith-Loud - Understanding Human Impact: Social and Equity Assessments...MLconf

Ted Willke - The Brain’s Guide to Dealing with Context in Language UnderstandingMLconf

Justin Armstrong - Applying Computer Vision to Reduce Contamination in the Re...MLconf

Igor Markov - Quantum Computing: a Treasure Hunt, not a Gold RushMLconf

Vinay Prabhu - Project GaitNet: Ushering in the ImageNet moment for human Gai...MLconf

Jekaterina Novikova - Machine Learning Methods in Detecting Alzheimer’s Disea...MLconf

Meghana Ravikumar - Optimized Image Classification on the CheapMLconf

Noam Finkelstein - The Importance of Modeling Data CollectionMLconf

June Andrews - The Uncanny Valley of MLMLconf

Sneha Rajana - Deep Learning Architectures for Semantic Relation Detection TasksMLconf

Anoop Deoras - Building an Incrementally Trained, Local Taste Aware, Global D...MLconf

Vito Ostuni - The Voice: New Challenges in a Zero UI WorldMLconf

Anna choromanska - Data-driven Challenges in AI: Scale, Information Selection...MLconf

Janani Kalyanam - Machine Learning to Detect Illegal Online Sales of Prescrip...MLconf

Esperanza Lopez Aguilera - Using a Bayesian Neural Network in the Detection o...MLconf

Neel Sundaresan - Teaching a machine to codeMLconf

Rishabh Mehrotra - Recommendations in a Marketplace: Personalizing Explainabl...MLconf

Soumith Chintala - Increasing the Impact of AI Through Better SoftwareMLconf

Roy Lowrance - Predicting Bond Prices: Regime ChangesMLconf

Madalina Fiterau - Hybrid Machine Learning Methods for the Interpretation and...MLconf

Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUK Journal

Finology Group – Insurtech Innovation Award 2024The Digital Insurer

Weitere ähnliche Inhalte

Mehr von MLconf