SlideShare ist ein Scribd-Unternehmen logo
1 von 23
Mathematical Roadmap for IR

Introduction to the mathematical concepts
commonly used in IR/ST

               - Abhay   Shete, 42
Why bother about the maths ?

●   Lots of open source libraries out there
●   General approach for external software
    –   Include it
    –   Build it
    –   Forget it !!
IR is fuzzy
●   Consider the following sentences
    –   India beat Pakistan
    –   Sachin has scored over 10,000 runs in Test cricket
    –   Sachin Tendulkar was unable to run to the non-
        striker's end
●   How do we humans interpret this information ?
●   How do machines interpret it ?
What excites me about IR ?

 ●   Its all about indulging and flirting with sexy
     models !!!!
Models of interest

●   Vector space models
●   Inductive models
    –   Probabilistic
    –   Neural networks, decision trees
●   Hybrid – vector space and inductive
●   Others...
Vector Space Model
●   Recall junior college mathematics
●   Dot-product, cross-product, projections, etc..
Vector Space Model

●   A1, A2 are articles about animals
●   P1 contains politics
●   To find similarity scores between
    –   a) A1, A2 ==> assume score is S1
    –   b) A1, P1 ==> assume score is S2
●    Intuitively S1 > S2
●    How do u create a model around this ?
Vector Space Model
●   A1 = {dog(1), cat(2), lion(1), rhino(1), tiger(2)}
●   A2 = {eagle(1), cat(1), tiger(1), fox, hyena
●
    N Dimensional Space !!!
     d1 = 1(dog) + 2(cat) + 1(lion) + 1(rhino) +
    2(tiger) + 0(eagle) + 0(fox) + 0(hyena)
●    d2 = 0(dog) + 1(cat) + 0(lion) + 0(rhino) +
    1(tiger) + 1(eagle) + 1(fox) + 1(hyena)
●    Similarity is measured by the angle between the
    two vectors !!
Similarity measured by the angle between
              two vectors !!


 If X and Y are two n-dimensional vectors
 <xi> and <yi>, the angle θ between them
 satisfies:
       X Y = |X| |Y| cos θ
       cos θ = X Y / (|X||Y|)
News alerts demo...
Linear Algebra

●   Vectors and matrices are equivalent
●   Important Linear Algebra concepts:
    –   Singular Value Decomposition
    –   EigenVectors
         ●   Latent Semantic Indexing (LSI)
         ●   Face recognition
Latent Semantic Indexing

●   Exact keyword match not required
●   No predfined semantic knowledge base
●   Demo
    –   http://lsi.research.telcordia.com/lsi-bin/lsiQuery
●   How does it work ?
    –   Fish Tank Analogy
Uncertainities, Pitfalls

●   Not suitable for fine grained search
●   Web Scale corpus !!
●   Does anybody find Google suggestions really
    helpful ?
Face recognition application

●   EigenValues, covariance and SVD
●   Intuitive Understanding
    –   Create a vector space from the pixels modelling the
        face
    –   Find “average” face vector
    –   Find the specific features of every face by computing
        the variance from the average vector
    –   Take the top N specific features of this vector space
        and create the “classifier” vector
Document Clustering

●   Clusty.com demo
●   Clustering approaches
Inductive Models

●   Require prior information (Training set) to
    extrapolate to future unseen cases.

●   Create a “function” from the input training data
    which is applied on test data to produce output.
Bag 1         Bag 2        Bag 3

Ball drawn at random from a bag
and found to be white. What are the
chances of it being drawn from Bag
3?
Bag 1         Bag 2          Bag 3


Ball drawn at random from a bag
and found to be green. What are
the chances that it is from Bag 2 ?
●   Any way to model this ?
●   Bayes Theorem !!!
Sachin, Pilot,         Rock, band,
Sachin, cricket,     political, career,   took, crowd,
  Tendulkar,         drew, crowds,        storm, elected
  run, wicket,                            , year,
                     centre,              grammy
  crowd              government,
  cheered            elections


 Cricket Bag          Politics Bag         Music Bag


  Test sentence:- Sachin's 10,000th run was
  highly appreciated by the crowd

  What chances this is from Bag “Cricket” ?
Other applications on probabilistic
models
●   Targeted Advertising – Demo
●   POS Tagging
    http://www.infogistics.com/posdemo.htm
●   Named Entity Recognition
Neural Networks

●   Demo video (5 minutes) – Machine learns how to
    steer the vehicle by observing the driver
●   After training, capable of steering the vehicle on
    its own.
Uncertainities, Pitfalls
●   Depends on the training set
●   Training set may not be in line with the
    application for which it is being used.
    –   POS Tagging done on Wall Street Corpus
    –   “Like” Applying the same for “like” analyzing teen
        text !!
    –   “dove makes my skin smooth.”
●   Can you deal with “The Black Swan”!!
●   Is inductive learning really needed ?
●   Combine with other features/approaches ?
●   More data better than a good algorithm.

Weitere ähnliche Inhalte

Ähnlich wie Mathematical Concepts for IR Models

DL Classe 0 - You can do it
DL Classe 0 - You can do itDL Classe 0 - You can do it
DL Classe 0 - You can do itGregory Renard
 
Assessing computational thinking
Assessing computational thinkingAssessing computational thinking
Assessing computational thinkingDaniel Duckworth
 
AI - To get an overview
AI   - To get an overviewAI   - To get an overview
AI - To get an overviewKranti Asapu
 
Begin with Data Scientist
Begin with Data ScientistBegin with Data Scientist
Begin with Data ScientistNarong Intiruk
 
Artificial Intelligence - Anna Uni -v1.pdf
Artificial Intelligence - Anna Uni -v1.pdfArtificial Intelligence - Anna Uni -v1.pdf
Artificial Intelligence - Anna Uni -v1.pdfJayanti Prasad Ph.D.
 
Unit I Introduction to AI K.Sundar,AP/CSE,VEC
Unit I Introduction to AI K.Sundar,AP/CSE,VECUnit I Introduction to AI K.Sundar,AP/CSE,VEC
Unit I Introduction to AI K.Sundar,AP/CSE,VECsundarKanagaraj1
 
2. AI - Introduction.pdf
2. AI - Introduction.pdf2. AI - Introduction.pdf
2. AI - Introduction.pdfSandi Rahmadika
 
CactusCon 2017 - OODA Loop in life & cyber threat intelligence
CactusCon 2017 - OODA Loop in life & cyber threat intelligenceCactusCon 2017 - OODA Loop in life & cyber threat intelligence
CactusCon 2017 - OODA Loop in life & cyber threat intelligenceDave Eilken
 
Mauritius Big Data and Machine Learning JEDI workshop
Mauritius Big Data and Machine Learning JEDI workshopMauritius Big Data and Machine Learning JEDI workshop
Mauritius Big Data and Machine Learning JEDI workshopCosmoAIMS Bassett
 
Keynote at the 2018 SIGGRAPH Conference on Motion, Interaction and Games
Keynote at the 2018 SIGGRAPH Conference on Motion, Interaction and GamesKeynote at the 2018 SIGGRAPH Conference on Motion, Interaction and Games
Keynote at the 2018 SIGGRAPH Conference on Motion, Interaction and GamesRogelio E. Cardona-Rivera
 
Machine Learning & AI - 2022 intro for pre-college students.pdf
Machine Learning & AI - 2022 intro for pre-college students.pdfMachine Learning & AI - 2022 intro for pre-college students.pdf
Machine Learning & AI - 2022 intro for pre-college students.pdfEd Fernandez
 
Ai career day
Ai career dayAi career day
Ai career dayUri Weiss
 
Artificial Intelligence in Gaming
Artificial Intelligence in GamingArtificial Intelligence in Gaming
Artificial Intelligence in GamingSatvik J
 
Today is all about AI
Today is all about AIToday is all about AI
Today is all about AIPetru Cioată
 
DriveBuild: Automation of Tests in the Field of Autonomous Cars
DriveBuild: Automation of Tests in the Field of Autonomous CarsDriveBuild: Automation of Tests in the Field of Autonomous Cars
DriveBuild: Automation of Tests in the Field of Autonomous CarsUniversity of Passau
 

Ähnlich wie Mathematical Concepts for IR Models (20)

DL Classe 0 - You can do it
DL Classe 0 - You can do itDL Classe 0 - You can do it
DL Classe 0 - You can do it
 
Assessing computational thinking
Assessing computational thinkingAssessing computational thinking
Assessing computational thinking
 
Lecture 1.pdf
Lecture 1.pdfLecture 1.pdf
Lecture 1.pdf
 
AI - To get an overview
AI   - To get an overviewAI   - To get an overview
AI - To get an overview
 
Begin with Data Scientist
Begin with Data ScientistBegin with Data Scientist
Begin with Data Scientist
 
Artificial Intelligence - Anna Uni -v1.pdf
Artificial Intelligence - Anna Uni -v1.pdfArtificial Intelligence - Anna Uni -v1.pdf
Artificial Intelligence - Anna Uni -v1.pdf
 
Lecture 1 introduction
Lecture 1   introductionLecture 1   introduction
Lecture 1 introduction
 
Unit I Introduction to AI K.Sundar,AP/CSE,VEC
Unit I Introduction to AI K.Sundar,AP/CSE,VECUnit I Introduction to AI K.Sundar,AP/CSE,VEC
Unit I Introduction to AI K.Sundar,AP/CSE,VEC
 
2. AI - Introduction.pdf
2. AI - Introduction.pdf2. AI - Introduction.pdf
2. AI - Introduction.pdf
 
CactusCon 2017 - OODA Loop in life & cyber threat intelligence
CactusCon 2017 - OODA Loop in life & cyber threat intelligenceCactusCon 2017 - OODA Loop in life & cyber threat intelligence
CactusCon 2017 - OODA Loop in life & cyber threat intelligence
 
Mauritius Big Data and Machine Learning JEDI workshop
Mauritius Big Data and Machine Learning JEDI workshopMauritius Big Data and Machine Learning JEDI workshop
Mauritius Big Data and Machine Learning JEDI workshop
 
Intro AI.pdf
Intro AI.pdfIntro AI.pdf
Intro AI.pdf
 
Keynote at the 2018 SIGGRAPH Conference on Motion, Interaction and Games
Keynote at the 2018 SIGGRAPH Conference on Motion, Interaction and GamesKeynote at the 2018 SIGGRAPH Conference on Motion, Interaction and Games
Keynote at the 2018 SIGGRAPH Conference on Motion, Interaction and Games
 
Machine Learning & AI - 2022 intro for pre-college students.pdf
Machine Learning & AI - 2022 intro for pre-college students.pdfMachine Learning & AI - 2022 intro for pre-college students.pdf
Machine Learning & AI - 2022 intro for pre-college students.pdf
 
Ai career day
Ai career dayAi career day
Ai career day
 
Artificial Intelligence in Gaming
Artificial Intelligence in GamingArtificial Intelligence in Gaming
Artificial Intelligence in Gaming
 
What is Ai ?
What is Ai ?What is Ai ?
What is Ai ?
 
Today is all about AI
Today is all about AIToday is all about AI
Today is all about AI
 
n01.ppt
n01.pptn01.ppt
n01.ppt
 
DriveBuild: Automation of Tests in the Field of Autonomous Cars
DriveBuild: Automation of Tests in the Field of Autonomous CarsDriveBuild: Automation of Tests in the Field of Autonomous Cars
DriveBuild: Automation of Tests in the Field of Autonomous Cars
 

Kürzlich hochgeladen

Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Scriptwesley chun
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountPuma Security, LLC
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?Igalia
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdfhans926745
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking MenDelhi Call girls
 
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Igalia
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...Martijn de Jong
 
Advantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your BusinessAdvantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your BusinessPixlogix Infotech
 
A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024Results
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsMaria Levchenko
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...apidays
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slidevu2urc
 
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxFactors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxKatpro Technologies
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsEnterprise Knowledge
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfsudhanshuwaghmare1
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationSafe Software
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024The Digital Insurer
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CVKhem
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationMichael W. Hawkins
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)wesley chun
 

Kürzlich hochgeladen (20)

Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path Mount
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
 
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 
Advantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your BusinessAdvantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your Business
 
A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed texts
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
 
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxFactors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CV
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day Presentation
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)
 

Mathematical Concepts for IR Models

  • 1. Mathematical Roadmap for IR Introduction to the mathematical concepts commonly used in IR/ST - Abhay Shete, 42
  • 2. Why bother about the maths ? ● Lots of open source libraries out there ● General approach for external software – Include it – Build it – Forget it !!
  • 3. IR is fuzzy ● Consider the following sentences – India beat Pakistan – Sachin has scored over 10,000 runs in Test cricket – Sachin Tendulkar was unable to run to the non- striker's end ● How do we humans interpret this information ? ● How do machines interpret it ?
  • 4. What excites me about IR ? ● Its all about indulging and flirting with sexy models !!!!
  • 5. Models of interest ● Vector space models ● Inductive models – Probabilistic – Neural networks, decision trees ● Hybrid – vector space and inductive ● Others...
  • 6. Vector Space Model ● Recall junior college mathematics ● Dot-product, cross-product, projections, etc..
  • 7. Vector Space Model ● A1, A2 are articles about animals ● P1 contains politics ● To find similarity scores between – a) A1, A2 ==> assume score is S1 – b) A1, P1 ==> assume score is S2 ● Intuitively S1 > S2 ● How do u create a model around this ?
  • 8. Vector Space Model ● A1 = {dog(1), cat(2), lion(1), rhino(1), tiger(2)} ● A2 = {eagle(1), cat(1), tiger(1), fox, hyena ● N Dimensional Space !!! d1 = 1(dog) + 2(cat) + 1(lion) + 1(rhino) + 2(tiger) + 0(eagle) + 0(fox) + 0(hyena) ● d2 = 0(dog) + 1(cat) + 0(lion) + 0(rhino) + 1(tiger) + 1(eagle) + 1(fox) + 1(hyena) ● Similarity is measured by the angle between the two vectors !!
  • 9. Similarity measured by the angle between two vectors !! If X and Y are two n-dimensional vectors <xi> and <yi>, the angle θ between them satisfies: X Y = |X| |Y| cos θ cos θ = X Y / (|X||Y|)
  • 11. Linear Algebra ● Vectors and matrices are equivalent ● Important Linear Algebra concepts: – Singular Value Decomposition – EigenVectors ● Latent Semantic Indexing (LSI) ● Face recognition
  • 12. Latent Semantic Indexing ● Exact keyword match not required ● No predfined semantic knowledge base ● Demo – http://lsi.research.telcordia.com/lsi-bin/lsiQuery ● How does it work ? – Fish Tank Analogy
  • 13. Uncertainities, Pitfalls ● Not suitable for fine grained search ● Web Scale corpus !! ● Does anybody find Google suggestions really helpful ?
  • 14. Face recognition application ● EigenValues, covariance and SVD ● Intuitive Understanding – Create a vector space from the pixels modelling the face – Find “average” face vector – Find the specific features of every face by computing the variance from the average vector – Take the top N specific features of this vector space and create the “classifier” vector
  • 15. Document Clustering ● Clusty.com demo ● Clustering approaches
  • 16. Inductive Models ● Require prior information (Training set) to extrapolate to future unseen cases. ● Create a “function” from the input training data which is applied on test data to produce output.
  • 17. Bag 1 Bag 2 Bag 3 Ball drawn at random from a bag and found to be white. What are the chances of it being drawn from Bag 3?
  • 18. Bag 1 Bag 2 Bag 3 Ball drawn at random from a bag and found to be green. What are the chances that it is from Bag 2 ?
  • 19. Any way to model this ? ● Bayes Theorem !!!
  • 20. Sachin, Pilot, Rock, band, Sachin, cricket, political, career, took, crowd, Tendulkar, drew, crowds, storm, elected run, wicket, , year, centre, grammy crowd government, cheered elections Cricket Bag Politics Bag Music Bag Test sentence:- Sachin's 10,000th run was highly appreciated by the crowd What chances this is from Bag “Cricket” ?
  • 21. Other applications on probabilistic models ● Targeted Advertising – Demo ● POS Tagging http://www.infogistics.com/posdemo.htm ● Named Entity Recognition
  • 22. Neural Networks ● Demo video (5 minutes) – Machine learns how to steer the vehicle by observing the driver ● After training, capable of steering the vehicle on its own.
  • 23. Uncertainities, Pitfalls ● Depends on the training set ● Training set may not be in line with the application for which it is being used. – POS Tagging done on Wall Street Corpus – “Like” Applying the same for “like” analyzing teen text !! – “dove makes my skin smooth.” ● Can you deal with “The Black Swan”!! ● Is inductive learning really needed ? ● Combine with other features/approaches ? ● More data better than a good algorithm.