SlideShare ist ein Scribd-Unternehmen logo
1 von 26
Downloaden Sie, um offline zu lesen
Active Learning


Shrey Malik
0901CS32
s.malik@iitp.ac.in
What is it ?

●   Machine Learning

    ●   Making a program Curious !

    ●   Teach it to decide on its own.

    ●   Give it some intelligence


Make a program to label documents according to contents :

Sports , Technology, History, Geography, Politics etc...
What is it ?

Step 1.
   Download a lot of documents from     the web


Step 2.
   Label Them !


Labeling is quite a painful task. Somehow our program should
be able to distinguish b/w the various categories.


Teach the program using examples (Training set) and make sure
it makes intelligent decision in real world situations.


Question ! How many and what examples ?
Example



          All kinds of “unlabeled” data
Example




One way : Select a few data points at random, label
them give the input output set to the program … and
let it “learn” from these examples.


Supervised Learning .
Example




One way : Select a few data points at random, label them give
the input output set to the program … and let it “learn” from
these examples. BUT ! Keep in mind (memory) the location of
other labeled points.
Semi Supervised & Active learning !
Example




Got a better generalization this time !
Didn't we ??
Active Learning
Somehow make the set of training examples smaller
& results, more accurate.
So how to make Training set smaller & smarter ??

Select the Training examples which are most uncertain … instead
of doing it at random .


The program asks Queries from the “Oracle” in the form of
unlabeled instances to be labeled.


In this way, the active learner aims to achieve high accuracy
using as few labeled instances as possible, thereby minimizing
the cost of obtaining labeled data.

Eg. Query the unlabeled point that is:
Closest to the boundary.
OR                                                     
Most Uncertain
OR
Most likely to decrease overall certainty.
Etc etc.
How does the learner ask queries ?


There are several different problem scenarios in which
       the learner may be able to ask queries.



                     For example:
Membership Query Synthesis
   The learner may request labels for any unlabeled instance in the
input space, including (and typically assuming) queries that the
learner generates de novo, rather than those sampled from some
underlying natural distribution.




 BUT sometimes the queries to label are quite awkward !



                                   *De novo means from the source,fresh & itself.
Stream-Based Selective Sampling


   Obtain an unlabeled instance, sampled from the actual
distribution.
   Now, the learner decide whether to request its label or
not.




                     The learner !
Pool-Based Sampling
  For many real-world learning problems, large collections of
  unlabeled data Can be gathered at once.




                     The learner !

                 Se
                    le   cts
                               t he
                                      Be
                                        st
                                             Qu
                                               er y


                           A large pool of Instances
… and how does the program select the best Query ??


Uncertainty Sampling
  Query the instance for which it is least confident.

               x∗ = max( 1 − P ( y' | x ) )

               Where y' = max( P ( y | x ) )


             X* = The best Query
             P(y|x) = conditional probability …
…and how does the program select the best Query ??


 Query-By-Committee
  Maintain a committee of models all trained on `that`
  Input space & let them label it …
  Now select the queries for which they disagree
  the most !
For measuring the level of disagreement:




   Yi      :: ranges over all possible labelings.
   V (yi ) :: number of “votes” that a label receives from among the committee
               members’ predictions.
   C        :: Committee size !



        … and then there are a lot of other algorithms also !
The Algorithm…

1.Start with a large pool of unlabeled data


Select the single most informative instance to be
labeled by the oracle


Add the labeled query to the Training set


Re-train using this newly acquired knowledge


Goto 1
Is Active learning 'The' thing ?

Assumptions


   1.Annotator, the Oracle is always right .
   2.If Annotator is wrong, see rule one !


   3.Labeling is sooo expansive … is it ???


   So can my machine learn more economically if it is
              allowed to ask questions ???


              Are you from delhi ?
                                       Used the
                                        metro ?

              Seen the qutub
                 minar ?
Suggested Improvements in it ...

Dr. Burr Settles ...

The Oracle has to wait as learner “re-trains” after
each label By him/her. learner should
Ask to label a batch of queries at once instead …
Querying in BATCHES
Suggested Improvements in it ...

Dr. Burr Settles ...
Oracles are not always right …
   They can be fatigued
   Error in instruments etc


CrowdSourcing on web
   You just played a fun game :
                 Tag as many rockstars in the pic as you can
                             in one minute

   Challenge your friends                                Like on facebook


   ...meanwhile the learner was learning from your
   labels … thanku Oracle !
Suggested Improvements in it ...

Dr. Burr Settles ...
Goal: to minimize the overall of training an accurate model.
     Simply reducing the number of labeled instances
     Wont help.


Cost Sensitive Active Learning approaches explicitly
Account for varying labeling costs while selecting
Queries.


eg. Kapoor et al. Proposed a decision-theoratic approach.
   Takes into account both labeling & misclassification cost.
   Assumption: Cost of labeling prop. To length.
Suggested Improvements in it ...

Dr. Burr Settles ...

If labeling cost is not known,
Try to predict the real, unknown annotation cost based on a
few simple “meta features” on the instances.


Research has shown that these learned cost-models are
significantly better than simpler cost heuristics
(e.g., a linear function of length).
Active Learning :: Practical Examples

                    Drug Design

Unlabeled Points :: A large (really large) pool of Chemical
Compounds.
Label :: Active (binds to a target) or Not.
Getting a label :: The Experiment.
Active Learning :: Practical Examples
            Pedestrian Detection
Conclusion


Machines should be able to do all the things we hate … &
machine learning will play a big role in achieving this goal.


And to make machine learning faster and cheaper … active
                   learning is the key !


Machine/Active learning is a very good area for research !


                              Machines will become
                       Intelligent and wage a war against
                                    Humanity !
Thank You :)


    Do Check out
http://en.akinator.com

Weitere ähnliche Inhalte

Was ist angesagt?

Machine Learning Course | Edureka
Machine Learning Course | EdurekaMachine Learning Course | Edureka
Machine Learning Course | EdurekaEdureka!
 
Machine learning
Machine learningMachine learning
Machine learningeonx_32
 
An introduction to Machine Learning
An introduction to Machine LearningAn introduction to Machine Learning
An introduction to Machine Learningbutest
 
Supervised and Unsupervised Learning In Machine Learning | Machine Learning T...
Supervised and Unsupervised Learning In Machine Learning | Machine Learning T...Supervised and Unsupervised Learning In Machine Learning | Machine Learning T...
Supervised and Unsupervised Learning In Machine Learning | Machine Learning T...Simplilearn
 
Machine Learning and its Applications
Machine Learning and its ApplicationsMachine Learning and its Applications
Machine Learning and its ApplicationsDr Ganesh Iyer
 
Introduction to Machine Learning
Introduction to Machine LearningIntroduction to Machine Learning
Introduction to Machine LearningEng Teong Cheah
 
Lecture 1: What is Machine Learning?
Lecture 1: What is Machine Learning?Lecture 1: What is Machine Learning?
Lecture 1: What is Machine Learning?Marina Santini
 
MACHINE LEARNING PPT(ML) rohit.pptx
MACHINE LEARNING  PPT(ML) rohit.pptxMACHINE LEARNING  PPT(ML) rohit.pptx
MACHINE LEARNING PPT(ML) rohit.pptxNikhilRanaCSELEET005
 
Machine Learning Tutorial | Machine Learning Basics | Machine Learning Algori...
Machine Learning Tutorial | Machine Learning Basics | Machine Learning Algori...Machine Learning Tutorial | Machine Learning Basics | Machine Learning Algori...
Machine Learning Tutorial | Machine Learning Basics | Machine Learning Algori...Simplilearn
 
Machine Learning presentation.
Machine Learning presentation.Machine Learning presentation.
Machine Learning presentation.butest
 
Introduction to machine learning
Introduction to machine learningIntroduction to machine learning
Introduction to machine learningKoundinya Desiraju
 
Deep neural networks
Deep neural networksDeep neural networks
Deep neural networksSi Haem
 
Lecture1 introduction to machine learning
Lecture1 introduction to machine learningLecture1 introduction to machine learning
Lecture1 introduction to machine learningUmmeSalmaM1
 
AI Vs ML Vs DL PowerPoint Presentation Slide Templates Complete Deck
AI Vs ML Vs DL PowerPoint Presentation Slide Templates Complete DeckAI Vs ML Vs DL PowerPoint Presentation Slide Templates Complete Deck
AI Vs ML Vs DL PowerPoint Presentation Slide Templates Complete DeckSlideTeam
 
Introduction to Machine Learning
Introduction to Machine LearningIntroduction to Machine Learning
Introduction to Machine LearningRahul Jain
 
Machine Learning and Real-World Applications
Machine Learning and Real-World ApplicationsMachine Learning and Real-World Applications
Machine Learning and Real-World ApplicationsMachinePulse
 
Introduction to Deep Learning
Introduction to Deep LearningIntroduction to Deep Learning
Introduction to Deep LearningOswald Campesato
 

Was ist angesagt? (20)

Machine Learning Course | Edureka
Machine Learning Course | EdurekaMachine Learning Course | Edureka
Machine Learning Course | Edureka
 
Machine learning
Machine learning Machine learning
Machine learning
 
Machine learning
Machine learningMachine learning
Machine learning
 
An introduction to Machine Learning
An introduction to Machine LearningAn introduction to Machine Learning
An introduction to Machine Learning
 
Supervised and Unsupervised Learning In Machine Learning | Machine Learning T...
Supervised and Unsupervised Learning In Machine Learning | Machine Learning T...Supervised and Unsupervised Learning In Machine Learning | Machine Learning T...
Supervised and Unsupervised Learning In Machine Learning | Machine Learning T...
 
Machine Learning and its Applications
Machine Learning and its ApplicationsMachine Learning and its Applications
Machine Learning and its Applications
 
Machine learning
Machine learningMachine learning
Machine learning
 
Introduction to Machine Learning
Introduction to Machine LearningIntroduction to Machine Learning
Introduction to Machine Learning
 
Lecture 1: What is Machine Learning?
Lecture 1: What is Machine Learning?Lecture 1: What is Machine Learning?
Lecture 1: What is Machine Learning?
 
MACHINE LEARNING PPT(ML) rohit.pptx
MACHINE LEARNING  PPT(ML) rohit.pptxMACHINE LEARNING  PPT(ML) rohit.pptx
MACHINE LEARNING PPT(ML) rohit.pptx
 
Machine Learning Tutorial | Machine Learning Basics | Machine Learning Algori...
Machine Learning Tutorial | Machine Learning Basics | Machine Learning Algori...Machine Learning Tutorial | Machine Learning Basics | Machine Learning Algori...
Machine Learning Tutorial | Machine Learning Basics | Machine Learning Algori...
 
Machine Learning presentation.
Machine Learning presentation.Machine Learning presentation.
Machine Learning presentation.
 
Introduction to machine learning
Introduction to machine learningIntroduction to machine learning
Introduction to machine learning
 
Deep neural networks
Deep neural networksDeep neural networks
Deep neural networks
 
Lecture1 introduction to machine learning
Lecture1 introduction to machine learningLecture1 introduction to machine learning
Lecture1 introduction to machine learning
 
Machine Can Think
Machine Can ThinkMachine Can Think
Machine Can Think
 
AI Vs ML Vs DL PowerPoint Presentation Slide Templates Complete Deck
AI Vs ML Vs DL PowerPoint Presentation Slide Templates Complete DeckAI Vs ML Vs DL PowerPoint Presentation Slide Templates Complete Deck
AI Vs ML Vs DL PowerPoint Presentation Slide Templates Complete Deck
 
Introduction to Machine Learning
Introduction to Machine LearningIntroduction to Machine Learning
Introduction to Machine Learning
 
Machine Learning and Real-World Applications
Machine Learning and Real-World ApplicationsMachine Learning and Real-World Applications
Machine Learning and Real-World Applications
 
Introduction to Deep Learning
Introduction to Deep LearningIntroduction to Deep Learning
Introduction to Deep Learning
 

Andere mochten auch

10remarkableentrepreneurshipthoughts 131008125313-phpapp01
10remarkableentrepreneurshipthoughts 131008125313-phpapp0110remarkableentrepreneurshipthoughts 131008125313-phpapp01
10remarkableentrepreneurshipthoughts 131008125313-phpapp01Gina Gu
 
Primero scotia presentation final
Primero scotia presentation finalPrimero scotia presentation final
Primero scotia presentation finalprimero_mining
 
10 главных мифов об авторском праве
10 главных мифов об авторском праве10 главных мифов об авторском праве
10 главных мифов об авторском правеhadgiewa
 
#5 inspirational scenes from films
#5 inspirational scenes from films#5 inspirational scenes from films
#5 inspirational scenes from filmsdebbie14
 
Planning for draft 3
Planning for draft 3Planning for draft 3
Planning for draft 3debbie14
 
Tugas amdal uu no 32 tahun 2009 pplh terhadap lingkungan tambang
Tugas amdal uu no 32 tahun 2009 pplh terhadap lingkungan tambangTugas amdal uu no 32 tahun 2009 pplh terhadap lingkungan tambang
Tugas amdal uu no 32 tahun 2009 pplh terhadap lingkungan tambangSylvester Saragih
 
Primero bmo presentation final
Primero bmo presentation finalPrimero bmo presentation final
Primero bmo presentation finalprimero_mining
 
Primero baml presentation sept 2014
Primero baml presentation sept 2014Primero baml presentation sept 2014
Primero baml presentation sept 2014primero_mining
 
Media studies mark_scheme
Media studies mark_schemeMedia studies mark_scheme
Media studies mark_schemedebbie14
 
Primero 2014 bmo conference final
Primero 2014 bmo conference finalPrimero 2014 bmo conference final
Primero 2014 bmo conference finalprimero_mining
 
10remarkableentrepreneurshipthoughts 131008125313-phpapp01
10remarkableentrepreneurshipthoughts 131008125313-phpapp0110remarkableentrepreneurshipthoughts 131008125313-phpapp01
10remarkableentrepreneurshipthoughts 131008125313-phpapp01Gina Gu
 
Еремурус про екоосвіту_GreenDrinks 25.12.2012
Еремурус про екоосвіту_GreenDrinks 25.12.2012Еремурус про екоосвіту_GreenDrinks 25.12.2012
Еремурус про екоосвіту_GreenDrinks 25.12.2012Oleksandra Shandra
 
Rules and weather22
Rules and weather22Rules and weather22
Rules and weather22nmcquade
 

Andere mochten auch (20)

10remarkableentrepreneurshipthoughts 131008125313-phpapp01
10remarkableentrepreneurshipthoughts 131008125313-phpapp0110remarkableentrepreneurshipthoughts 131008125313-phpapp01
10remarkableentrepreneurshipthoughts 131008125313-phpapp01
 
Primero scotia presentation final
Primero scotia presentation finalPrimero scotia presentation final
Primero scotia presentation final
 
Kandesign
KandesignKandesign
Kandesign
 
10 главных мифов об авторском праве
10 главных мифов об авторском праве10 главных мифов об авторском праве
10 главных мифов об авторском праве
 
#5 inspirational scenes from films
#5 inspirational scenes from films#5 inspirational scenes from films
#5 inspirational scenes from films
 
βιβλιογραφία του κ
βιβλιογραφία του κβιβλιογραφία του κ
βιβλιογραφία του κ
 
Planning for draft 3
Planning for draft 3Planning for draft 3
Planning for draft 3
 
Tugas amdal uu no 32 tahun 2009 pplh terhadap lingkungan tambang
Tugas amdal uu no 32 tahun 2009 pplh terhadap lingkungan tambangTugas amdal uu no 32 tahun 2009 pplh terhadap lingkungan tambang
Tugas amdal uu no 32 tahun 2009 pplh terhadap lingkungan tambang
 
Verb to be
Verb to beVerb to be
Verb to be
 
Primero bmo presentation final
Primero bmo presentation finalPrimero bmo presentation final
Primero bmo presentation final
 
Presentation1
Presentation1Presentation1
Presentation1
 
Primero baml presentation sept 2014
Primero baml presentation sept 2014Primero baml presentation sept 2014
Primero baml presentation sept 2014
 
Media studies mark_scheme
Media studies mark_schemeMedia studies mark_scheme
Media studies mark_scheme
 
Primero 2014 bmo conference final
Primero 2014 bmo conference finalPrimero 2014 bmo conference final
Primero 2014 bmo conference final
 
10remarkableentrepreneurshipthoughts 131008125313-phpapp01
10remarkableentrepreneurshipthoughts 131008125313-phpapp0110remarkableentrepreneurshipthoughts 131008125313-phpapp01
10remarkableentrepreneurshipthoughts 131008125313-phpapp01
 
Bestech new project
Bestech new projectBestech new project
Bestech new project
 
Isi makalah uji kuat tarik
Isi makalah uji kuat tarikIsi makalah uji kuat tarik
Isi makalah uji kuat tarik
 
Еремурус про екоосвіту_GreenDrinks 25.12.2012
Еремурус про екоосвіту_GreenDrinks 25.12.2012Еремурус про екоосвіту_GreenDrinks 25.12.2012
Еремурус про екоосвіту_GreenDrinks 25.12.2012
 
Rules and weather22
Rules and weather22Rules and weather22
Rules and weather22
 
Question 4
Question 4Question 4
Question 4
 

Ähnlich wie Machine Learning

Machine Learning Basics - By Animesh Sinha
Machine Learning Basics - By Animesh Sinha Machine Learning Basics - By Animesh Sinha
Machine Learning Basics - By Animesh Sinha Animesh Sinha
 
Machine Learning an Research Overview
Machine Learning an Research OverviewMachine Learning an Research Overview
Machine Learning an Research OverviewKathirvel Ayyaswamy
 
Exploratory Testing Explained
Exploratory Testing ExplainedExploratory Testing Explained
Exploratory Testing ExplainedTechWell
 
Machine learning and types
Machine learning and typesMachine learning and types
Machine learning and typesPadma Metta
 
Machine Learning SPPU Unit 1
Machine Learning SPPU Unit 1Machine Learning SPPU Unit 1
Machine Learning SPPU Unit 1Amruta Aphale
 
Deep learning introduction
Deep learning introductionDeep learning introduction
Deep learning introductionAdwait Bhave
 
Machine Learning.pptx
Machine Learning.pptxMachine Learning.pptx
Machine Learning.pptxchadhar227
 
Machine Learning Landscape
Machine Learning LandscapeMachine Learning Landscape
Machine Learning LandscapeEng Teong Cheah
 
NYAI - Interactive Machine Learning by Daniel Hsu
NYAI - Interactive Machine Learning by Daniel HsuNYAI - Interactive Machine Learning by Daniel Hsu
NYAI - Interactive Machine Learning by Daniel HsuRizwan Habib
 
Lect 7 intro to M.L..pdf
Lect 7 intro to M.L..pdfLect 7 intro to M.L..pdf
Lect 7 intro to M.L..pdfHassanElalfy4
 
Introduction AI ML& Mathematicals of ML.pdf
Introduction AI ML& Mathematicals of ML.pdfIntroduction AI ML& Mathematicals of ML.pdf
Introduction AI ML& Mathematicals of ML.pdfGandhiMathy6
 
Lessons learned from building practical deep learning systems
Lessons learned from building practical deep learning systemsLessons learned from building practical deep learning systems
Lessons learned from building practical deep learning systemsXavier Amatriain
 
Machine learning basics by akanksha bali
Machine learning basics by akanksha baliMachine learning basics by akanksha bali
Machine learning basics by akanksha baliAkanksha Bali
 
Machine learning basics
Machine learning basics Machine learning basics
Machine learning basics Akanksha Bali
 
Big data expo - machine learning in the elastic stack
Big data expo - machine learning in the elastic stack Big data expo - machine learning in the elastic stack
Big data expo - machine learning in the elastic stack BigDataExpo
 
EssentialsOfMachineLearning.pdf
EssentialsOfMachineLearning.pdfEssentialsOfMachineLearning.pdf
EssentialsOfMachineLearning.pdfAnkita Tiwari
 

Ähnlich wie Machine Learning (20)

Machine Learning Basics - By Animesh Sinha
Machine Learning Basics - By Animesh Sinha Machine Learning Basics - By Animesh Sinha
Machine Learning Basics - By Animesh Sinha
 
Machine Learning an Research Overview
Machine Learning an Research OverviewMachine Learning an Research Overview
Machine Learning an Research Overview
 
Exploratory Testing Explained
Exploratory Testing ExplainedExploratory Testing Explained
Exploratory Testing Explained
 
Machine_Learning.pptx
Machine_Learning.pptxMachine_Learning.pptx
Machine_Learning.pptx
 
Lab 7.pptx
Lab 7.pptxLab 7.pptx
Lab 7.pptx
 
Machine learning and types
Machine learning and typesMachine learning and types
Machine learning and types
 
Machine Learning SPPU Unit 1
Machine Learning SPPU Unit 1Machine Learning SPPU Unit 1
Machine Learning SPPU Unit 1
 
Deep learning introduction
Deep learning introductionDeep learning introduction
Deep learning introduction
 
Machine Learning.pptx
Machine Learning.pptxMachine Learning.pptx
Machine Learning.pptx
 
Machine Learning Landscape
Machine Learning LandscapeMachine Learning Landscape
Machine Learning Landscape
 
NYAI - Interactive Machine Learning by Daniel Hsu
NYAI - Interactive Machine Learning by Daniel HsuNYAI - Interactive Machine Learning by Daniel Hsu
NYAI - Interactive Machine Learning by Daniel Hsu
 
Lect 7 intro to M.L..pdf
Lect 7 intro to M.L..pdfLect 7 intro to M.L..pdf
Lect 7 intro to M.L..pdf
 
Introduction AI ML& Mathematicals of ML.pdf
Introduction AI ML& Mathematicals of ML.pdfIntroduction AI ML& Mathematicals of ML.pdf
Introduction AI ML& Mathematicals of ML.pdf
 
Lessons learned from building practical deep learning systems
Lessons learned from building practical deep learning systemsLessons learned from building practical deep learning systems
Lessons learned from building practical deep learning systems
 
Machine learning basics by akanksha bali
Machine learning basics by akanksha baliMachine learning basics by akanksha bali
Machine learning basics by akanksha bali
 
Machine learning basics
Machine learning basics Machine learning basics
Machine learning basics
 
Big data expo - machine learning in the elastic stack
Big data expo - machine learning in the elastic stack Big data expo - machine learning in the elastic stack
Big data expo - machine learning in the elastic stack
 
Lecture 1
Lecture 1Lecture 1
Lecture 1
 
lec1.ppt
lec1.pptlec1.ppt
lec1.ppt
 
EssentialsOfMachineLearning.pdf
EssentialsOfMachineLearning.pdfEssentialsOfMachineLearning.pdf
EssentialsOfMachineLearning.pdf
 

Kürzlich hochgeladen

Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Commit University
 
Powerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time ClashPowerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time Clashcharlottematthew16
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfAddepto
 
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationBeyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationSafe Software
 
My INSURER PTE LTD - Insurtech Innovation Award 2024
My INSURER PTE LTD - Insurtech Innovation Award 2024My INSURER PTE LTD - Insurtech Innovation Award 2024
My INSURER PTE LTD - Insurtech Innovation Award 2024The Digital Insurer
 
DevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenDevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenHervé Boutemy
 
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr LapshynFwdays
 
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada
 
Commit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyCommit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyAlfredo García Lavilla
 
Story boards and shot lists for my a level piece
Story boards and shot lists for my a level pieceStory boards and shot lists for my a level piece
Story boards and shot lists for my a level piececharlottematthew16
 
What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024Stephanie Beckett
 
SAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxSAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxNavinnSomaal
 
Vector Databases 101 - An introduction to the world of Vector Databases
Vector Databases 101 - An introduction to the world of Vector DatabasesVector Databases 101 - An introduction to the world of Vector Databases
Vector Databases 101 - An introduction to the world of Vector DatabasesZilliz
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Mattias Andersson
 
Artificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxArtificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxhariprasad279825
 
Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsRizwan Syed
 
Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 3652toLead Limited
 
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLScyllaDB
 
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Patryk Bandurski
 

Kürzlich hochgeladen (20)

Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!
 
Powerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time ClashPowerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time Clash
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdf
 
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationBeyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
 
My INSURER PTE LTD - Insurtech Innovation Award 2024
My INSURER PTE LTD - Insurtech Innovation Award 2024My INSURER PTE LTD - Insurtech Innovation Award 2024
My INSURER PTE LTD - Insurtech Innovation Award 2024
 
DevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenDevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache Maven
 
DMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special EditionDMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special Edition
 
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
 
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
 
Commit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyCommit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easy
 
Story boards and shot lists for my a level piece
Story boards and shot lists for my a level pieceStory boards and shot lists for my a level piece
Story boards and shot lists for my a level piece
 
What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024
 
SAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxSAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptx
 
Vector Databases 101 - An introduction to the world of Vector Databases
Vector Databases 101 - An introduction to the world of Vector DatabasesVector Databases 101 - An introduction to the world of Vector Databases
Vector Databases 101 - An introduction to the world of Vector Databases
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?
 
Artificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxArtificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptx
 
Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL Certs
 
Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365
 
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQL
 
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
 

Machine Learning

  • 2. What is it ? ● Machine Learning ● Making a program Curious ! ● Teach it to decide on its own. ● Give it some intelligence Make a program to label documents according to contents : Sports , Technology, History, Geography, Politics etc...
  • 3. What is it ? Step 1. Download a lot of documents from the web Step 2. Label Them ! Labeling is quite a painful task. Somehow our program should be able to distinguish b/w the various categories. Teach the program using examples (Training set) and make sure it makes intelligent decision in real world situations. Question ! How many and what examples ?
  • 4. Example All kinds of “unlabeled” data
  • 5. Example One way : Select a few data points at random, label them give the input output set to the program … and let it “learn” from these examples. Supervised Learning .
  • 6. Example One way : Select a few data points at random, label them give the input output set to the program … and let it “learn” from these examples. BUT ! Keep in mind (memory) the location of other labeled points. Semi Supervised & Active learning !
  • 7. Example Got a better generalization this time ! Didn't we ??
  • 8. Active Learning Somehow make the set of training examples smaller & results, more accurate.
  • 9. So how to make Training set smaller & smarter ?? Select the Training examples which are most uncertain … instead of doing it at random . The program asks Queries from the “Oracle” in the form of unlabeled instances to be labeled. In this way, the active learner aims to achieve high accuracy using as few labeled instances as possible, thereby minimizing the cost of obtaining labeled data. Eg. Query the unlabeled point that is: Closest to the boundary. OR Most Uncertain OR Most likely to decrease overall certainty. Etc etc.
  • 10. How does the learner ask queries ? There are several different problem scenarios in which the learner may be able to ask queries. For example:
  • 11. Membership Query Synthesis The learner may request labels for any unlabeled instance in the input space, including (and typically assuming) queries that the learner generates de novo, rather than those sampled from some underlying natural distribution. BUT sometimes the queries to label are quite awkward ! *De novo means from the source,fresh & itself.
  • 12. Stream-Based Selective Sampling Obtain an unlabeled instance, sampled from the actual distribution. Now, the learner decide whether to request its label or not. The learner !
  • 13. Pool-Based Sampling For many real-world learning problems, large collections of unlabeled data Can be gathered at once. The learner ! Se le cts t he Be st Qu er y A large pool of Instances
  • 14. … and how does the program select the best Query ?? Uncertainty Sampling Query the instance for which it is least confident. x∗ = max( 1 − P ( y' | x ) ) Where y' = max( P ( y | x ) ) X* = The best Query P(y|x) = conditional probability …
  • 15. …and how does the program select the best Query ?? Query-By-Committee Maintain a committee of models all trained on `that` Input space & let them label it … Now select the queries for which they disagree the most !
  • 16. For measuring the level of disagreement: Yi :: ranges over all possible labelings. V (yi ) :: number of “votes” that a label receives from among the committee members’ predictions. C :: Committee size ! … and then there are a lot of other algorithms also !
  • 17. The Algorithm… 1.Start with a large pool of unlabeled data Select the single most informative instance to be labeled by the oracle Add the labeled query to the Training set Re-train using this newly acquired knowledge Goto 1
  • 18. Is Active learning 'The' thing ? Assumptions 1.Annotator, the Oracle is always right . 2.If Annotator is wrong, see rule one ! 3.Labeling is sooo expansive … is it ??? So can my machine learn more economically if it is allowed to ask questions ??? Are you from delhi ? Used the metro ? Seen the qutub minar ?
  • 19. Suggested Improvements in it ... Dr. Burr Settles ... The Oracle has to wait as learner “re-trains” after each label By him/her. learner should Ask to label a batch of queries at once instead … Querying in BATCHES
  • 20. Suggested Improvements in it ... Dr. Burr Settles ... Oracles are not always right … They can be fatigued Error in instruments etc CrowdSourcing on web You just played a fun game : Tag as many rockstars in the pic as you can in one minute Challenge your friends Like on facebook ...meanwhile the learner was learning from your labels … thanku Oracle !
  • 21. Suggested Improvements in it ... Dr. Burr Settles ... Goal: to minimize the overall of training an accurate model. Simply reducing the number of labeled instances Wont help. Cost Sensitive Active Learning approaches explicitly Account for varying labeling costs while selecting Queries. eg. Kapoor et al. Proposed a decision-theoratic approach. Takes into account both labeling & misclassification cost. Assumption: Cost of labeling prop. To length.
  • 22. Suggested Improvements in it ... Dr. Burr Settles ... If labeling cost is not known, Try to predict the real, unknown annotation cost based on a few simple “meta features” on the instances. Research has shown that these learned cost-models are significantly better than simpler cost heuristics (e.g., a linear function of length).
  • 23. Active Learning :: Practical Examples Drug Design Unlabeled Points :: A large (really large) pool of Chemical Compounds. Label :: Active (binds to a target) or Not. Getting a label :: The Experiment.
  • 24. Active Learning :: Practical Examples Pedestrian Detection
  • 25. Conclusion Machines should be able to do all the things we hate … & machine learning will play a big role in achieving this goal. And to make machine learning faster and cheaper … active learning is the key ! Machine/Active learning is a very good area for research ! Machines will become Intelligent and wage a war against Humanity !
  • 26. Thank You :) Do Check out http://en.akinator.com