SlideShare ist ein Scribd-Unternehmen logo
1 von 63
SEMI-SUPERVISED CLASSIFICATION FOR
NATURAL LANGUAGE PROCESSING
PRESENTATION AT A GLANCE
•
–
–
–
–

•
–
–
–
–

•
2
SEMI-SUPERVISED LEARNING
•
–

•
•

3
SEMI-SUPERVISED LEARNING PROBLEMS

(1)
Learn from labeled data

Inductive
Learning

(2)
Apply learning on
unlabeled data to label
them
Transductive
Learning

(4)
Apply learning on
unseen unlabeled data

(3)
If confident in labeling,
then learn from
(1) and (2)

4
SEMI-SUPERVISED LEARNING PROBLEMS
•
–

•
–

5
SCOPES OF SEMI-SUPERVISED LEARNING
•
–
–
–

6
HOW DOES SEMI-SUPERVISED CLASSIFICATION WORK?

7
TYPES OF SEMI-SUPERVISED LEARNING
•
•
•
•
•

8
GENERATIVE VS DISCRIMINATIVE MODELS
(x,y)

Discriminative Models

Generative Models

9
GENERATIVE VS DISCRIMINATIVE MODELS
•
•

•

10
GENERATIVE VS DISCRIMINATIVE MODELS
•

•
•

11
GENERATIVE VS DISCRIMINATIVE MODELS
Conditional Probability,
to determine class
boundaries

Transductive SVM,
Graph-based
methods

Joint Probability P(x,y),
for any given y, we can
generate its x

EM Algorithm,
Self-learning

Cannot be used without considering P(x)
Difficult because P(x|y) are inadequate

12
GENERATIVE VS DISCRIMINATIVE MODELS

•
•

•

•
•

•

13
IS THERE A FREE LUNCH?
•
–

•

14
IS THERE A FREE LUNCH?
•
•

•

15
IS THERE A FREE LUNCH?

•
–

•

16
SELF-TRAINING

17
CO-TRAINING
•
•

18
CO-TRAINING

19
CO-TRAINING
•
•

•
•
•
20
CO-TRAINING
•
•
•

•

21
CO-TRAINING: COVEATS

22
ACTIVE LEARNING

23
WHICH METHOD SHOULD I USE?
•
–

•
–

•
–

•
–
24
WHICH METHOD SHOULD I USE?
•
–

•
–

25
SEMI-SUPERVISED CLASSIFICATION FOR NLP
•
•
•
•

26
EFFECTIVE SELF-TRAINING
FOR PARSING

27
INTRODUCTION
•
•

–

28
METHODS
•
•
•
–

29
DATASETS
•
–

•
•
•
–

30
RESULTS
•

–

•
–
–

31
LIMITATIONS
•
•
•
–
32
SEMI-SUPERVISED SPAM FILTERING:
DOES IT WORK?

33
INTRODUCTION
•

•

34
BACKGROUND
•
–
•
•
•
•
•

–
•
•
•
35
BACKGROUND
•
•
•
•

36
BACKGROUND
•
–
–

37
METHODS AND MATERIALS
•
–
•
•
•

–
•
•

38
RESULTS: DELAYED FEEDBACK VS CROSS-USER

Delayed Feedback

Cross-User

39
RESULTS: CROSS-CORPUS
•
•

40
EXTRACTIVE SUMMARIZATION USING
SUPERVISED AND SEMI-SUPERVISED
LEARNING

41
INTRODUCTION
•
•

42
METHOD
•

•
–

•
–
–
43
DATASETS
•
•
–

•
•
–

•
–
44
RESULTS: FEATURE SELECTION
•

Human Summary ROUGE I
Score was 0.422

45
RESULTS: EFFECT OF UNLABELED DATA

More labeled data
produced better Fscore

46
RESULTS: SUPERVISED VS SEMI-SUPERVISED

47
RESULTS: EFFECT OF SUMMARY LENGTH

48
LIMITATIONS
•
–

•
–

49
SEMI-SUPERVISED CLASSIFICATION FOR
EXTRACTING PROTEIN INTERACTION SENTENCES
USING DEPENDENCY PARSING

50
INTRODUCTION
•
•
•

•
•

51
INTRODUCTION
•
•

52
METHOD
•

•

53
DATASETS
•

–
–

54
RESULTS: AIMED DATASET

55
RESULTS: CB DATASET

56
RESULTS: EFFECT OF TRAINING DATA SIZE
(AIMED)
•

•

57
RESULTS: EFFECT OF TRAINING DATA SIZE
(CB)
•
•

58
LIMITATIONS
•
–

•
–

59
HOW MUCH UNLABELED DATA IS USED?

60
CONCLUSIONS
•
•
–
–
–

61
CONCLUSIONS
•
–
–
–

–

62
CONCLUSIONS

63

Weitere ähnliche Inhalte

Andere mochten auch

L1 l2 l3 introduction to machine translation
L1 l2 l3  introduction to machine translationL1 l2 l3  introduction to machine translation
L1 l2 l3 introduction to machine translation
Rushdi Shams
 
Probabilistic logic
Probabilistic logicProbabilistic logic
Probabilistic logic
Rushdi Shams
 
L13 why software fails
L13  why software failsL13  why software fails
L13 why software fails
Rushdi Shams
 
Knowledge representation
Knowledge representationKnowledge representation
Knowledge representation
Rushdi Shams
 
L1 overview of software engineering
L1  overview of software engineeringL1  overview of software engineering
L1 overview of software engineering
Rushdi Shams
 
Lecture 5, 6 and 7 cpu scheduling
Lecture 5, 6 and 7  cpu schedulingLecture 5, 6 and 7  cpu scheduling
Lecture 5, 6 and 7 cpu scheduling
Rushdi Shams
 
Lecture 7, 8, 9 and 10 Inter Process Communication (IPC) in Operating Systems
Lecture 7, 8, 9 and 10  Inter Process Communication (IPC) in Operating SystemsLecture 7, 8, 9 and 10  Inter Process Communication (IPC) in Operating Systems
Lecture 7, 8, 9 and 10 Inter Process Communication (IPC) in Operating Systems
Rushdi Shams
 
Knowledge structure
Knowledge structureKnowledge structure
Knowledge structure
Rushdi Shams
 
Propositional logic
Propositional logicPropositional logic
Propositional logic
Rushdi Shams
 
Syntax and semantics
Syntax and semanticsSyntax and semantics
Syntax and semantics
Rushdi Shams
 

Andere mochten auch (19)

Semi-Automatic Classification Algorithm: The differences between Minimum Dist...
Semi-Automatic Classification Algorithm: The differences between Minimum Dist...Semi-Automatic Classification Algorithm: The differences between Minimum Dist...
Semi-Automatic Classification Algorithm: The differences between Minimum Dist...
 
L1 l2 l3 introduction to machine translation
L1 l2 l3  introduction to machine translationL1 l2 l3  introduction to machine translation
L1 l2 l3 introduction to machine translation
 
Probabilistic logic
Probabilistic logicProbabilistic logic
Probabilistic logic
 
L13 why software fails
L13  why software failsL13  why software fails
L13 why software fails
 
Knowledge representation
Knowledge representationKnowledge representation
Knowledge representation
 
L1 overview of software engineering
L1  overview of software engineeringL1  overview of software engineering
L1 overview of software engineering
 
Lecture 5, 6 and 7 cpu scheduling
Lecture 5, 6 and 7  cpu schedulingLecture 5, 6 and 7  cpu scheduling
Lecture 5, 6 and 7 cpu scheduling
 
Image classification, remote sensing, P K MANI
Image classification, remote sensing, P K MANIImage classification, remote sensing, P K MANI
Image classification, remote sensing, P K MANI
 
L1 phishing
L1  phishingL1  phishing
L1 phishing
 
Lecture 7, 8, 9 and 10 Inter Process Communication (IPC) in Operating Systems
Lecture 7, 8, 9 and 10  Inter Process Communication (IPC) in Operating SystemsLecture 7, 8, 9 and 10  Inter Process Communication (IPC) in Operating Systems
Lecture 7, 8, 9 and 10 Inter Process Communication (IPC) in Operating Systems
 
Weka
WekaWeka
Weka
 
Knowledge structure
Knowledge structureKnowledge structure
Knowledge structure
 
Propositional logic
Propositional logicPropositional logic
Propositional logic
 
Semi supervised learning
Semi supervised learningSemi supervised learning
Semi supervised learning
 
Natural Language Processing and Machine Learning for Discovery
Natural Language Processing and Machine Learning for DiscoveryNatural Language Processing and Machine Learning for Discovery
Natural Language Processing and Machine Learning for Discovery
 
Community Detection in Social Media
Community Detection in Social MediaCommunity Detection in Social Media
Community Detection in Social Media
 
Syntax and semantics
Syntax and semanticsSyntax and semantics
Syntax and semantics
 
NLTK - Natural Language Processing in Python
NLTK - Natural Language Processing in PythonNLTK - Natural Language Processing in Python
NLTK - Natural Language Processing in Python
 
The Natural Approach | Methods and Approaches of Language Teaching
The Natural Approach | Methods and Approaches of Language Teaching The Natural Approach | Methods and Approaches of Language Teaching
The Natural Approach | Methods and Approaches of Language Teaching
 

Mehr von Rushdi Shams (14)

Research Methodology and Tips on Better Research
Research Methodology and Tips on Better ResearchResearch Methodology and Tips on Better Research
Research Methodology and Tips on Better Research
 
Common evaluation measures in NLP and IR
Common evaluation measures in NLP and IRCommon evaluation measures in NLP and IR
Common evaluation measures in NLP and IR
 
Machine learning with nlp 101
Machine learning with nlp 101Machine learning with nlp 101
Machine learning with nlp 101
 
First order logic
First order logicFirst order logic
First order logic
 
Belief function
Belief functionBelief function
Belief function
 
L5 understanding hacking
L5  understanding hackingL5  understanding hacking
L5 understanding hacking
 
L3 defense
L3  defenseL3  defense
L3 defense
 
L2 Intrusion Detection System (IDS)
L2  Intrusion Detection System (IDS)L2  Intrusion Detection System (IDS)
L2 Intrusion Detection System (IDS)
 
L2 l3 l4 software process models
L2 l3 l4  software process modelsL2 l3 l4  software process models
L2 l3 l4 software process models
 
Lecture 14,15 and 16 file systems
Lecture 14,15 and 16  file systemsLecture 14,15 and 16  file systems
Lecture 14,15 and 16 file systems
 
Lecture 11,12 and 13 deadlocks
Lecture 11,12 and 13  deadlocksLecture 11,12 and 13  deadlocks
Lecture 11,12 and 13 deadlocks
 
Lecture 1 and 2 processes
Lecture 1 and 2  processesLecture 1 and 2  processes
Lecture 1 and 2 processes
 
Lecture 3 and 4 threads
Lecture 3 and 4  threadsLecture 3 and 4  threads
Lecture 3 and 4 threads
 
My slide relational algebra
My slide  relational algebraMy slide  relational algebra
My slide relational algebra
 

Kürzlich hochgeladen

Spellings Wk 3 English CAPS CARES Please Practise
Spellings Wk 3 English CAPS CARES Please PractiseSpellings Wk 3 English CAPS CARES Please Practise
Spellings Wk 3 English CAPS CARES Please Practise
AnaAcapella
 
1029 - Danh muc Sach Giao Khoa 10 . pdf
1029 -  Danh muc Sach Giao Khoa 10 . pdf1029 -  Danh muc Sach Giao Khoa 10 . pdf
1029 - Danh muc Sach Giao Khoa 10 . pdf
QucHHunhnh
 
1029-Danh muc Sach Giao Khoa khoi 6.pdf
1029-Danh muc Sach Giao Khoa khoi  6.pdf1029-Danh muc Sach Giao Khoa khoi  6.pdf
1029-Danh muc Sach Giao Khoa khoi 6.pdf
QucHHunhnh
 

Kürzlich hochgeladen (20)

Food safety_Challenges food safety laboratories_.pdf
Food safety_Challenges food safety laboratories_.pdfFood safety_Challenges food safety laboratories_.pdf
Food safety_Challenges food safety laboratories_.pdf
 
Mixin Classes in Odoo 17 How to Extend Models Using Mixin Classes
Mixin Classes in Odoo 17  How to Extend Models Using Mixin ClassesMixin Classes in Odoo 17  How to Extend Models Using Mixin Classes
Mixin Classes in Odoo 17 How to Extend Models Using Mixin Classes
 
Python Notes for mca i year students osmania university.docx
Python Notes for mca i year students osmania university.docxPython Notes for mca i year students osmania university.docx
Python Notes for mca i year students osmania university.docx
 
Micro-Scholarship, What it is, How can it help me.pdf
Micro-Scholarship, What it is, How can it help me.pdfMicro-Scholarship, What it is, How can it help me.pdf
Micro-Scholarship, What it is, How can it help me.pdf
 
How to Give a Domain for a Field in Odoo 17
How to Give a Domain for a Field in Odoo 17How to Give a Domain for a Field in Odoo 17
How to Give a Domain for a Field in Odoo 17
 
ICT Role in 21st Century Education & its Challenges.pptx
ICT Role in 21st Century Education & its Challenges.pptxICT Role in 21st Century Education & its Challenges.pptx
ICT Role in 21st Century Education & its Challenges.pptx
 
Dyslexia AI Workshop for Slideshare.pptx
Dyslexia AI Workshop for Slideshare.pptxDyslexia AI Workshop for Slideshare.pptx
Dyslexia AI Workshop for Slideshare.pptx
 
Spellings Wk 3 English CAPS CARES Please Practise
Spellings Wk 3 English CAPS CARES Please PractiseSpellings Wk 3 English CAPS CARES Please Practise
Spellings Wk 3 English CAPS CARES Please Practise
 
Accessible Digital Futures project (20/03/2024)
Accessible Digital Futures project (20/03/2024)Accessible Digital Futures project (20/03/2024)
Accessible Digital Futures project (20/03/2024)
 
Making communications land - Are they received and understood as intended? we...
Making communications land - Are they received and understood as intended? we...Making communications land - Are they received and understood as intended? we...
Making communications land - Are they received and understood as intended? we...
 
ComPTIA Overview | Comptia Security+ Book SY0-701
ComPTIA Overview | Comptia Security+ Book SY0-701ComPTIA Overview | Comptia Security+ Book SY0-701
ComPTIA Overview | Comptia Security+ Book SY0-701
 
ICT role in 21st century education and it's challenges.
ICT role in 21st century education and it's challenges.ICT role in 21st century education and it's challenges.
ICT role in 21st century education and it's challenges.
 
Google Gemini An AI Revolution in Education.pptx
Google Gemini An AI Revolution in Education.pptxGoogle Gemini An AI Revolution in Education.pptx
Google Gemini An AI Revolution in Education.pptx
 
Fostering Friendships - Enhancing Social Bonds in the Classroom
Fostering Friendships - Enhancing Social Bonds  in the ClassroomFostering Friendships - Enhancing Social Bonds  in the Classroom
Fostering Friendships - Enhancing Social Bonds in the Classroom
 
1029 - Danh muc Sach Giao Khoa 10 . pdf
1029 -  Danh muc Sach Giao Khoa 10 . pdf1029 -  Danh muc Sach Giao Khoa 10 . pdf
1029 - Danh muc Sach Giao Khoa 10 . pdf
 
Unit-V; Pricing (Pharma Marketing Management).pptx
Unit-V; Pricing (Pharma Marketing Management).pptxUnit-V; Pricing (Pharma Marketing Management).pptx
Unit-V; Pricing (Pharma Marketing Management).pptx
 
1029-Danh muc Sach Giao Khoa khoi 6.pdf
1029-Danh muc Sach Giao Khoa khoi  6.pdf1029-Danh muc Sach Giao Khoa khoi  6.pdf
1029-Danh muc Sach Giao Khoa khoi 6.pdf
 
Basic Civil Engineering first year Notes- Chapter 4 Building.pptx
Basic Civil Engineering first year Notes- Chapter 4 Building.pptxBasic Civil Engineering first year Notes- Chapter 4 Building.pptx
Basic Civil Engineering first year Notes- Chapter 4 Building.pptx
 
Sociology 101 Demonstration of Learning Exhibit
Sociology 101 Demonstration of Learning ExhibitSociology 101 Demonstration of Learning Exhibit
Sociology 101 Demonstration of Learning Exhibit
 
UGC NET Paper 1 Mathematical Reasoning & Aptitude.pdf
UGC NET Paper 1 Mathematical Reasoning & Aptitude.pdfUGC NET Paper 1 Mathematical Reasoning & Aptitude.pdf
UGC NET Paper 1 Mathematical Reasoning & Aptitude.pdf
 

Semi-supervised classification for natural language processing