SlideShare ist ein Scribd-Unternehmen logo
1 von 63
SEMI-SUPERVISED CLASSIFICATION FOR
NATURAL LANGUAGE PROCESSING
PRESENTATION AT A GLANCE
•
–
–
–
–

•
–
–
–
–

•
2
SEMI-SUPERVISED LEARNING
•
–

•
•

3
SEMI-SUPERVISED LEARNING PROBLEMS

(1)
Learn from labeled data

Inductive
Learning

(2)
Apply learning on
unlabeled data to label
them
Transductive
Learning

(4)
Apply learning on
unseen unlabeled data

(3)
If confident in labeling,
then learn from
(1) and (2)

4
SEMI-SUPERVISED LEARNING PROBLEMS
•
–

•
–

5
SCOPES OF SEMI-SUPERVISED LEARNING
•
–
–
–

6
HOW DOES SEMI-SUPERVISED CLASSIFICATION WORK?

7
TYPES OF SEMI-SUPERVISED LEARNING
•
•
•
•
•

8
GENERATIVE VS DISCRIMINATIVE MODELS
(x,y)

Discriminative Models

Generative Models

9
GENERATIVE VS DISCRIMINATIVE MODELS
•
•

•

10
GENERATIVE VS DISCRIMINATIVE MODELS
•

•
•

11
GENERATIVE VS DISCRIMINATIVE MODELS
Conditional Probability,
to determine class
boundaries

Transductive SVM,
Graph-based
methods

Joint Probability P(x,y),
for any given y, we can
generate its x

EM Algorithm,
Self-learning

Cannot be used without considering P(x)
Difficult because P(x|y) are inadequate

12
GENERATIVE VS DISCRIMINATIVE MODELS

•
•

•

•
•

•

13
IS THERE A FREE LUNCH?
•
–

•

14
IS THERE A FREE LUNCH?
•
•

•

15
IS THERE A FREE LUNCH?

•
–

•

16
SELF-TRAINING

17
CO-TRAINING
•
•

18
CO-TRAINING

19
CO-TRAINING
•
•

•
•
•
20
CO-TRAINING
•
•
•

•

21
CO-TRAINING: COVEATS

22
ACTIVE LEARNING

23
WHICH METHOD SHOULD I USE?
•
–

•
–

•
–

•
–
24
WHICH METHOD SHOULD I USE?
•
–

•
–

25
SEMI-SUPERVISED CLASSIFICATION FOR NLP
•
•
•
•

26
EFFECTIVE SELF-TRAINING
FOR PARSING

27
INTRODUCTION
•
•

–

28
METHODS
•
•
•
–

29
DATASETS
•
–

•
•
•
–

30
RESULTS
•

–

•
–
–

31
LIMITATIONS
•
•
•
–
32
SEMI-SUPERVISED SPAM FILTERING:
DOES IT WORK?

33
INTRODUCTION
•

•

34
BACKGROUND
•
–
•
•
•
•
•

–
•
•
•
35
BACKGROUND
•
•
•
•

36
BACKGROUND
•
–
–

37
METHODS AND MATERIALS
•
–
•
•
•

–
•
•

38
RESULTS: DELAYED FEEDBACK VS CROSS-USER

Delayed Feedback

Cross-User

39
RESULTS: CROSS-CORPUS
•
•

40
EXTRACTIVE SUMMARIZATION USING
SUPERVISED AND SEMI-SUPERVISED
LEARNING

41
INTRODUCTION
•
•

42
METHOD
•

•
–

•
–
–
43
DATASETS
•
•
–

•
•
–

•
–
44
RESULTS: FEATURE SELECTION
•

Human Summary ROUGE I
Score was 0.422

45
RESULTS: EFFECT OF UNLABELED DATA

More labeled data
produced better Fscore

46
RESULTS: SUPERVISED VS SEMI-SUPERVISED

47
RESULTS: EFFECT OF SUMMARY LENGTH

48
LIMITATIONS
•
–

•
–

49
SEMI-SUPERVISED CLASSIFICATION FOR
EXTRACTING PROTEIN INTERACTION SENTENCES
USING DEPENDENCY PARSING

50
INTRODUCTION
•
•
•

•
•

51
INTRODUCTION
•
•

52
METHOD
•

•

53
DATASETS
•

–
–

54
RESULTS: AIMED DATASET

55
RESULTS: CB DATASET

56
RESULTS: EFFECT OF TRAINING DATA SIZE
(AIMED)
•

•

57
RESULTS: EFFECT OF TRAINING DATA SIZE
(CB)
•
•

58
LIMITATIONS
•
–

•
–

59
HOW MUCH UNLABELED DATA IS USED?

60
CONCLUSIONS
•
•
–
–
–

61
CONCLUSIONS
•
–
–
–

–

62
CONCLUSIONS

63

Weitere ähnliche Inhalte

Andere mochten auch

L1 l2 l3 introduction to machine translation
L1 l2 l3  introduction to machine translationL1 l2 l3  introduction to machine translation
L1 l2 l3 introduction to machine translation
Rushdi Shams
 
Probabilistic logic
Probabilistic logicProbabilistic logic
Probabilistic logic
Rushdi Shams
 
L13 why software fails
L13  why software failsL13  why software fails
L13 why software fails
Rushdi Shams
 
Knowledge representation
Knowledge representationKnowledge representation
Knowledge representation
Rushdi Shams
 
L1 overview of software engineering
L1  overview of software engineeringL1  overview of software engineering
L1 overview of software engineering
Rushdi Shams
 
Lecture 5, 6 and 7 cpu scheduling
Lecture 5, 6 and 7  cpu schedulingLecture 5, 6 and 7  cpu scheduling
Lecture 5, 6 and 7 cpu scheduling
Rushdi Shams
 
Lecture 7, 8, 9 and 10 Inter Process Communication (IPC) in Operating Systems
Lecture 7, 8, 9 and 10  Inter Process Communication (IPC) in Operating SystemsLecture 7, 8, 9 and 10  Inter Process Communication (IPC) in Operating Systems
Lecture 7, 8, 9 and 10 Inter Process Communication (IPC) in Operating Systems
Rushdi Shams
 
Knowledge structure
Knowledge structureKnowledge structure
Knowledge structure
Rushdi Shams
 
Propositional logic
Propositional logicPropositional logic
Propositional logic
Rushdi Shams
 
Syntax and semantics
Syntax and semanticsSyntax and semantics
Syntax and semantics
Rushdi Shams
 

Andere mochten auch (19)

Semi-Automatic Classification Algorithm: The differences between Minimum Dist...
Semi-Automatic Classification Algorithm: The differences between Minimum Dist...Semi-Automatic Classification Algorithm: The differences between Minimum Dist...
Semi-Automatic Classification Algorithm: The differences between Minimum Dist...
 
L1 l2 l3 introduction to machine translation
L1 l2 l3  introduction to machine translationL1 l2 l3  introduction to machine translation
L1 l2 l3 introduction to machine translation
 
Probabilistic logic
Probabilistic logicProbabilistic logic
Probabilistic logic
 
L13 why software fails
L13  why software failsL13  why software fails
L13 why software fails
 
Knowledge representation
Knowledge representationKnowledge representation
Knowledge representation
 
L1 overview of software engineering
L1  overview of software engineeringL1  overview of software engineering
L1 overview of software engineering
 
Lecture 5, 6 and 7 cpu scheduling
Lecture 5, 6 and 7  cpu schedulingLecture 5, 6 and 7  cpu scheduling
Lecture 5, 6 and 7 cpu scheduling
 
Image classification, remote sensing, P K MANI
Image classification, remote sensing, P K MANIImage classification, remote sensing, P K MANI
Image classification, remote sensing, P K MANI
 
L1 phishing
L1  phishingL1  phishing
L1 phishing
 
Lecture 7, 8, 9 and 10 Inter Process Communication (IPC) in Operating Systems
Lecture 7, 8, 9 and 10  Inter Process Communication (IPC) in Operating SystemsLecture 7, 8, 9 and 10  Inter Process Communication (IPC) in Operating Systems
Lecture 7, 8, 9 and 10 Inter Process Communication (IPC) in Operating Systems
 
Weka
WekaWeka
Weka
 
Knowledge structure
Knowledge structureKnowledge structure
Knowledge structure
 
Propositional logic
Propositional logicPropositional logic
Propositional logic
 
Semi supervised learning
Semi supervised learningSemi supervised learning
Semi supervised learning
 
Natural Language Processing and Machine Learning for Discovery
Natural Language Processing and Machine Learning for DiscoveryNatural Language Processing and Machine Learning for Discovery
Natural Language Processing and Machine Learning for Discovery
 
Community Detection in Social Media
Community Detection in Social MediaCommunity Detection in Social Media
Community Detection in Social Media
 
Syntax and semantics
Syntax and semanticsSyntax and semantics
Syntax and semantics
 
NLTK - Natural Language Processing in Python
NLTK - Natural Language Processing in PythonNLTK - Natural Language Processing in Python
NLTK - Natural Language Processing in Python
 
The Natural Approach | Methods and Approaches of Language Teaching
The Natural Approach | Methods and Approaches of Language Teaching The Natural Approach | Methods and Approaches of Language Teaching
The Natural Approach | Methods and Approaches of Language Teaching
 

Mehr von Rushdi Shams (14)

Research Methodology and Tips on Better Research
Research Methodology and Tips on Better ResearchResearch Methodology and Tips on Better Research
Research Methodology and Tips on Better Research
 
Common evaluation measures in NLP and IR
Common evaluation measures in NLP and IRCommon evaluation measures in NLP and IR
Common evaluation measures in NLP and IR
 
Machine learning with nlp 101
Machine learning with nlp 101Machine learning with nlp 101
Machine learning with nlp 101
 
First order logic
First order logicFirst order logic
First order logic
 
Belief function
Belief functionBelief function
Belief function
 
L5 understanding hacking
L5  understanding hackingL5  understanding hacking
L5 understanding hacking
 
L3 defense
L3  defenseL3  defense
L3 defense
 
L2 Intrusion Detection System (IDS)
L2  Intrusion Detection System (IDS)L2  Intrusion Detection System (IDS)
L2 Intrusion Detection System (IDS)
 
L2 l3 l4 software process models
L2 l3 l4  software process modelsL2 l3 l4  software process models
L2 l3 l4 software process models
 
Lecture 14,15 and 16 file systems
Lecture 14,15 and 16  file systemsLecture 14,15 and 16  file systems
Lecture 14,15 and 16 file systems
 
Lecture 11,12 and 13 deadlocks
Lecture 11,12 and 13  deadlocksLecture 11,12 and 13  deadlocks
Lecture 11,12 and 13 deadlocks
 
Lecture 1 and 2 processes
Lecture 1 and 2  processesLecture 1 and 2  processes
Lecture 1 and 2 processes
 
Lecture 3 and 4 threads
Lecture 3 and 4  threadsLecture 3 and 4  threads
Lecture 3 and 4 threads
 
My slide relational algebra
My slide  relational algebraMy slide  relational algebra
My slide relational algebra
 

Kürzlich hochgeladen

The basics of sentences session 4pptx.pptx
The basics of sentences session 4pptx.pptxThe basics of sentences session 4pptx.pptx
The basics of sentences session 4pptx.pptx
heathfieldcps1
 

Kürzlich hochgeladen (20)

When Quality Assurance Meets Innovation in Higher Education - Report launch w...
When Quality Assurance Meets Innovation in Higher Education - Report launch w...When Quality Assurance Meets Innovation in Higher Education - Report launch w...
When Quality Assurance Meets Innovation in Higher Education - Report launch w...
 
“O BEIJO” EM ARTE .
“O BEIJO” EM ARTE                       .“O BEIJO” EM ARTE                       .
“O BEIJO” EM ARTE .
 
24 ĐỀ THAM KHẢO KÌ THI TUYỂN SINH VÀO LỚP 10 MÔN TIẾNG ANH SỞ GIÁO DỤC HẢI DƯ...
24 ĐỀ THAM KHẢO KÌ THI TUYỂN SINH VÀO LỚP 10 MÔN TIẾNG ANH SỞ GIÁO DỤC HẢI DƯ...24 ĐỀ THAM KHẢO KÌ THI TUYỂN SINH VÀO LỚP 10 MÔN TIẾNG ANH SỞ GIÁO DỤC HẢI DƯ...
24 ĐỀ THAM KHẢO KÌ THI TUYỂN SINH VÀO LỚP 10 MÔN TIẾNG ANH SỞ GIÁO DỤC HẢI DƯ...
 
II BIOSENSOR PRINCIPLE APPLICATIONS AND WORKING II
II BIOSENSOR PRINCIPLE APPLICATIONS AND WORKING IIII BIOSENSOR PRINCIPLE APPLICATIONS AND WORKING II
II BIOSENSOR PRINCIPLE APPLICATIONS AND WORKING II
 
Chapter 7 Pharmacosy Traditional System of Medicine & Ayurvedic Preparations ...
Chapter 7 Pharmacosy Traditional System of Medicine & Ayurvedic Preparations ...Chapter 7 Pharmacosy Traditional System of Medicine & Ayurvedic Preparations ...
Chapter 7 Pharmacosy Traditional System of Medicine & Ayurvedic Preparations ...
 
An Overview of the Odoo 17 Knowledge App
An Overview of the Odoo 17 Knowledge AppAn Overview of the Odoo 17 Knowledge App
An Overview of the Odoo 17 Knowledge App
 
Stl Algorithms in C++ jjjjjjjjjjjjjjjjjj
Stl Algorithms in C++ jjjjjjjjjjjjjjjjjjStl Algorithms in C++ jjjjjjjjjjjjjjjjjj
Stl Algorithms in C++ jjjjjjjjjjjjjjjjjj
 
Including Mental Health Support in Project Delivery, 14 May.pdf
Including Mental Health Support in Project Delivery, 14 May.pdfIncluding Mental Health Support in Project Delivery, 14 May.pdf
Including Mental Health Support in Project Delivery, 14 May.pdf
 
Championnat de France de Tennis de table/
Championnat de France de Tennis de table/Championnat de France de Tennis de table/
Championnat de France de Tennis de table/
 
Exploring Gemini AI and Integration with MuleSoft | MuleSoft Mysore Meetup #45
Exploring Gemini AI and Integration with MuleSoft | MuleSoft Mysore Meetup #45Exploring Gemini AI and Integration with MuleSoft | MuleSoft Mysore Meetup #45
Exploring Gemini AI and Integration with MuleSoft | MuleSoft Mysore Meetup #45
 
Graduate Outcomes Presentation Slides - English (v3).pptx
Graduate Outcomes Presentation Slides - English (v3).pptxGraduate Outcomes Presentation Slides - English (v3).pptx
Graduate Outcomes Presentation Slides - English (v3).pptx
 
The Liver & Gallbladder (Anatomy & Physiology).pptx
The Liver &  Gallbladder (Anatomy & Physiology).pptxThe Liver &  Gallbladder (Anatomy & Physiology).pptx
The Liver & Gallbladder (Anatomy & Physiology).pptx
 
Removal Strategy _ FEFO _ Working with Perishable Products in Odoo 17
Removal Strategy _ FEFO _ Working with Perishable Products in Odoo 17Removal Strategy _ FEFO _ Working with Perishable Products in Odoo 17
Removal Strategy _ FEFO _ Working with Perishable Products in Odoo 17
 
Spring gala 2024 photo slideshow - Celebrating School-Community Partnerships
Spring gala 2024 photo slideshow - Celebrating School-Community PartnershipsSpring gala 2024 photo slideshow - Celebrating School-Community Partnerships
Spring gala 2024 photo slideshow - Celebrating School-Community Partnerships
 
The basics of sentences session 4pptx.pptx
The basics of sentences session 4pptx.pptxThe basics of sentences session 4pptx.pptx
The basics of sentences session 4pptx.pptx
 
ANTI PARKISON DRUGS.pptx
ANTI         PARKISON          DRUGS.pptxANTI         PARKISON          DRUGS.pptx
ANTI PARKISON DRUGS.pptx
 
ĐỀ THAM KHẢO KÌ THI TUYỂN SINH VÀO LỚP 10 MÔN TIẾNG ANH FORM 50 CÂU TRẮC NGHI...
ĐỀ THAM KHẢO KÌ THI TUYỂN SINH VÀO LỚP 10 MÔN TIẾNG ANH FORM 50 CÂU TRẮC NGHI...ĐỀ THAM KHẢO KÌ THI TUYỂN SINH VÀO LỚP 10 MÔN TIẾNG ANH FORM 50 CÂU TRẮC NGHI...
ĐỀ THAM KHẢO KÌ THI TUYỂN SINH VÀO LỚP 10 MÔN TIẾNG ANH FORM 50 CÂU TRẮC NGHI...
 
demyelinated disorder: multiple sclerosis.pptx
demyelinated disorder: multiple sclerosis.pptxdemyelinated disorder: multiple sclerosis.pptx
demyelinated disorder: multiple sclerosis.pptx
 
MOOD STABLIZERS DRUGS.pptx
MOOD     STABLIZERS           DRUGS.pptxMOOD     STABLIZERS           DRUGS.pptx
MOOD STABLIZERS DRUGS.pptx
 
How To Create Editable Tree View in Odoo 17
How To Create Editable Tree View in Odoo 17How To Create Editable Tree View in Odoo 17
How To Create Editable Tree View in Odoo 17
 

Semi-supervised classification for natural language processing