SlideShare ist ein Scribd-Unternehmen logo
1 von 58
NLP - State of the Art
(and some missing parts…)
Liad Magen
Wien, am 27.03.2019
Hello, World!
Israel
computer science
biomedical engineering
NLP & AI Researcher @ enliteAI
Keep-Current Project:
- Machine Learning Seminar
- fast.ai + PyTorch course
data4good
EnliteAI Company Presentation | office@enlite.ai | 20190223 | © EnliteAI 2019
Services by enliteAI
AI Strategy &
Transformation
AI Lab
Prototyping &
Project Delivery
▪ Conduct AI readiness
check across organization
▪ Discover Use-cases to
build-up momentum
▪ Define AI Strategy
▪ Leverage AI Research for
practical applications
▪ Extend research to
production-grade
▪ Build models for commercial
use
▪ Development of POCs, MVPs
and prototypes
▪ Implementation &
Integration
▪ Roll-out & Support
3
data4good Hackathon
Applying data science to human needs and generating
social impact:
● GrünStattGrau
● Hilfswerk International
● Hilfswerk Österreich
● CivesSolutions
April 27-28th, 2019
A1 Telekom, Lassallestraße 9
What is NLP?
Enabling computers to interact with users in their language
NLP is everywhere
Applications:
- Language understanding
- Search engines
- Machine translation
- Text analysis (i.e. sentiment)
- Chatbots / Personal Assistance
NLP Building blocks
- Chunking
- Sequence Tagging
- Part of speech
- Named Entity Recognition (NER)
- Coreference Resolution
- ...
- Syntactic Parsing
- And more...
NLP in a nutshell
Human Language
Non-trivial useful output
takes as input text in human language and process it
in a way that suggests an intelligent process was involved
NLP is hard!
I’m eating pizza with olives
I’m eating pizza with friends
NLP is hard!
Google translate:
State of the art Der letzte Stand der
Technik
State of the art “Country of art”
German
Hebrew
Variability
AmbiguityHe acquired it
He purchased it
He bought it
It was bought by him
It was sold to him
She sold it to him
She sold him that
Twitter:
I loooooove it!
everytime → every time
oscar nom’d doc → Oscar-nominated documentary
is bananas → is great
- by Wei Xu
Bank
Apple
Star
Spring
Play
I’m reading a Book
I will Book the flight
Ambiguity - Real news headlines
● Scientists examine whales from space
● The pope's baby step on gays
● Girl hit by car in hospital
● Iraqi head seeks arms
● Lung cancer in women mushrooms
● Squad helps dog bite victim
● Miners refuse to work after death
● Stolen painting found by tree
● Actor sent to jail for not finishing sentence
● Stadium air conditioning fails — Fans protest
What do you mean no?
Martin is a data scientist
Martin is a great data scientist
What do you mean no?
Martin is a data scientist
Martin is not a data scientist
Martin is a great data scientist
Martin is not a great data scientist
What do you mean no?
Martin is a data scientist
Martin is not a data scientist
Martin is a great data scientist
Martin is not a great data scientist
Martin is not a data scientist
Martin is not a mad scientist
Negation + Monotonicity
John did not work for Microsoft
in 1989
as a VP
of sales
Restrictivity / Intersectivity
He is a great pianist → He is a pianist
He is a fake doctor → He is a doctor
I found my old jacket → I found my jacket
How do we do NLP?
Rule based
Systems
1950s-1990s Corpus based
statistics
1990s-2000s Machine
Learning
2000s-2014 Deep Learning
2014-2020(?)
LinguisticsExperience
Machine Learning Expertise
Transparent Debugability
Black box
How should we do NLP?
Rule based
Systems
1950s-1990s Corpus based
statistics
1990s-2000s Machine
Learning
2000s-2014 Deep Learning
2014-2020(?)
LinguisticsExperience
Machine Learning Expertise
Transparent Debugability
Black box
NLP Tomorrow
Rule based
Systems
1950s-1990s Corpus based
statistics
1990s-2000s Machine
Learning
2000s-2014 Deep Learning
2014-2020(?)
LinguisticsExperience
Machine Learning Expertise
Transparent Debugability
Black box
Humans writing rules
aided by ML/DL -
Resulting in
transparent and
debuggable models
NLP Today
- Chris Manning (Stanford) @ Simon’s institute - 27.3.2017
BiLSTM hagmoney ( > 2015)
Works on low level tasks:
● Coordination boundary prediction:
He will attend the meeting and present the results on Tuesday
He will attend the meeting on Tuesday
He will present the results on Tuesday
● Syntactic Parsing
● And more...
BiLSTM hagmoney ( > 2015)
But less on higher levels:
BiLSTM text generation with style:
“The film’s ultimate pleasure if you want to fall in love with the ending, you won’t be disappointed”
“The film’s simple, and a refreshing take on the complex family drama of the regions of human
intelligence.”
https://arxiv.org/pdf/1707.02633.pdf - Controlling Linguistic Style Aspects in Neural Language Generation
BiLSTM hagmoney ( > 2015)
But less on higher levels:
BiLSTM text generation with style:
“There are some funny parts but overall I didn’t like the first few funny parts, but overall pretty decent .”
“Ultimately, I can honestly say that this movie is full of stupid stupid and stupid stupid stupid stupid stupid.”
https://arxiv.org/pdf/1707.02633.pdf - Controlling Linguistic Style Aspects in Neural Language Generation
NLP in the academy
Multiple stacked networks
Each architecture solves a specific problem
(or a dataset...)
NGrams, TF/IDF, LDA / Topic modeling
Word2vec / GloVe / FastText
ELMo (AI2) / BERT / ERNIE (Baidu)
GPT-2 (OpenAI)
LSTM
spaCy
Flair
& Regular Expressions...
NLP in the industry
20 years of NLP research
Great progress on “low-level” text understanding
tasks:
● Morphology
● Syntax / Parsing
● Semantic Roles.
● Lexical Semantics
● Discourse. Coreference.
● Negation scope
Representation learning, text-to-text semantics,
RNNs
● Word vectors
● Concept vectors
● Sentence vectors
● Paraphrasing
20 years of NLP research
Great progress on “low-level” text understanding
tasks:
● Morphology
● Syntax / Parsing
● Semantic Roles.
● Lexical Semantics
● Discourse. Coreference.
● Negation scope
Representation learning, text-to-text semantics,
RNNs
● Word vectors
● Concept vectors
● Sentence vectors
● Paraphrasing
Remarkable Achievements
Not useful for non-experts
Automatically learn intricate
linguistic patterns
Bad with nuance.
Contains biases.
Hard to control.
Short recup of the last few years
2013
Language model
Learn joint likelihood of training sentences
Word2Vec
… GloVe … FastText …
First use of transfer learning in NLP
2018
ELMo - Contextual word vectors
ULMFiT - transfer learning + fine tuning
BERT - the above & more, on large scale
2019
Transformer-XL
GPT-2
BERT - architecture in brief
Embedding:
BERT - architecture in brief
● Embedding + Masking (Cloze)
● Transformer
● Sentence pairing
● Fine tuning
○ Sentence class Label
○ NER Sequence Labels
● Question Answering:
○ Start+End Vector &
Softmax over all positions
(as additional parameters)
○ Or Running Q + A (for each option)
and training on the class label
Further reading
● Toward data science:
○ Dissecting BERT (parts 1 & 2) - by Miguel Romero Calvo
○ Deconstructing BERT (parts 1 & 2) - by Jesse Vig
● HarvardNLP - The Annotated Transformer
● Jay Alammar - The illustrated BERT, ELMo and co.
● NLP.Stanford.edu/Seminar - Jacob Delvin presentation about BERT
Size matters
● ULMFiT
● ELMo
● GLoMo
● GPT
● BERT
● Transformer-XL
● MT-DNN
● GPT-2
● Contextual representations
● Generalized Language models
● Transfer learning in NLP
Text Generation
Dissecting GPT-2 results
HavardNLP
Giant Language model Test Room
http://gltr.io
NLP Progress - sample domains
Cross lingual Text Classification: pyTorch - LASER
Text Classification (English): ULMFiT
Automatic Speech Recognition (ASR): Google Tensorflow Lingvo
Translation: DeepL / Transformer Big (Facebook + Google)
NER: flair / BERT
Entity linking: DeepType - OpenAI
nlpprogress.com | github.com/syhw/wer_are_we
Q&A Datasets & Benchmarks
General Language Understanding Evaluation (GLUE)
Stanford Natural Language Inference (SNLI)
SQuAD / SQuAD 2.0
RACE (Carnegie Mellon)
HotpotQA, MS Marco, NarrativeQA, CoQA ...
Google AI natural questions dataset
Question Answering - State of the art
Question Answering - State of the art?
Depends on the dataset…
Each information extraction (IE) task is a different NLP application
(Also relevant for chatbots / personal assistants)
Information Extraction
Turning text into structured data
● Most data in the world is in text
● We can only easily analyze structured data
● Today - no easy way to do it on scale
● Possible for specific tasks - but requires
○ Enough data
○ Team of NLP experts
○ Time
Solving IE can be transformative to
industry & science
Question Answering - next steps
In which state was Yahoo founded?
http://ai.stanford.edu/blog/beyond-local-pattern-matching/
Question Answering - next steps
● (Cleverly) add more inductive biases
● Common sense
Possible methods:
● Interactive learning (human in the loop)
● Deep reinforcement learning
● GANs
Still missing...
1. Understanding (Not only NLP-related…)
○ What is captured by a network?
○ What can/’t the networks learn
2. Explainability
3. Fairness & accountability
○ Removing bias (current techniques only “put a lipstick on a pig”)
4. Working with little annotated data
Transferring knowledge across domains
5. Handling missing data
Thank you!
We’re “standing on the shoulders of giants”
- Isaac Newton
Appendix 1 - handling missing data
Handling missing data
- Mary is a great programmer but John isn’t
- The hotel received great reviews at Yelp but mediocre reviews at
TripAdvisor
- Fred took a picture of you and Susan of me
Handling missing data
- Mary is a great programmer but John isn’t _____ (verb phrase ellipsis)
- The hotel received great reviews at Yelp but _____ mediocre reviews at
TripAdvisor (argument clusters)
- Fred took a picture of you and Susan ______ of me (gapping)
(Briding)
Prices of oil
I turn 40 this year
He drove 40 on the highway
I’ll give you 40 for that
I turn 40 (age) this year
He drove 40 (kmh) on the highway
I’ll give you 40 (currency) for that
Handling missing data
● Current end-to-end systems fails on them
● Stanford parser can partially handle several cases
● Work in progress in BIU
Appendix 2
Transferring knowledge across domains
Example - review sentiment analysis
“The soup was delicious” → +
“The steak was tough” → -
“The dessert was decadent” → +
[food] was [good_food_adj]
[food] was [bad_food_adj]
Example - review sentiment analysis
Real world:
the soup that the server recommended was amazing.
the soup, which I expected to be good, was actually pretty bad
the soup, which was served cold and was looking terrible, was surprisingly great.
Example - review sentiment analysis
Syntax to the rescue!
Capture this:
With a single declarative pattern:
Can help to generalize and transfer knowledge across domains
Example - review sentiment analysis
Until it doesn’t:
I didn’t think the food was great:
Would combining rules with deep learning be the solution?
Thank you again!
Human: What do we want?
Computer: Natural-language processing!
Human: When do we want it?
Computer: When do we want what?
Dynamic Word Embeddings
Dynamic Word Embeddings for Evolving Semantic Discovery
https://arxiv.org/pdf/1703.00607.pdf

Weitere ähnliche Inhalte

Was ist angesagt?

Natural Language Processing in AI
Natural Language Processing in AINatural Language Processing in AI
Natural Language Processing in AISaurav Shrestha
 
Sarcasm Detection: Achilles Heel of sentiment analysis
Sarcasm Detection: Achilles Heel of sentiment analysisSarcasm Detection: Achilles Heel of sentiment analysis
Sarcasm Detection: Achilles Heel of sentiment analysisAnuj Gupta
 
BERT: Bidirectional Encoder Representations from Transformers
BERT: Bidirectional Encoder Representations from TransformersBERT: Bidirectional Encoder Representations from Transformers
BERT: Bidirectional Encoder Representations from TransformersLiangqun Lu
 
Natural Language Processing
Natural Language ProcessingNatural Language Processing
Natural Language ProcessingMariana Soffer
 
An introduction to the Transformers architecture and BERT
An introduction to the Transformers architecture and BERTAn introduction to the Transformers architecture and BERT
An introduction to the Transformers architecture and BERTSuman Debnath
 
Sequence to sequence (encoder-decoder) learning
Sequence to sequence (encoder-decoder) learningSequence to sequence (encoder-decoder) learning
Sequence to sequence (encoder-decoder) learningRoberto Pereira Silveira
 
NLP_KASHK:Evaluating Language Model
NLP_KASHK:Evaluating Language ModelNLP_KASHK:Evaluating Language Model
NLP_KASHK:Evaluating Language ModelHemantha Kulathilake
 
Question Answering System using machine learning approach
Question Answering System using machine learning approachQuestion Answering System using machine learning approach
Question Answering System using machine learning approachGarima Nanda
 
Introduction to spaCy
Introduction to spaCyIntroduction to spaCy
Introduction to spaCyRyo Takahashi
 
Comparative Analysis of Transformer Based Pre-Trained NLP Models
Comparative Analysis of Transformer Based Pre-Trained NLP ModelsComparative Analysis of Transformer Based Pre-Trained NLP Models
Comparative Analysis of Transformer Based Pre-Trained NLP Modelssaurav singla
 
NLP_KASHK:Finite-State Morphological Parsing
NLP_KASHK:Finite-State Morphological ParsingNLP_KASHK:Finite-State Morphological Parsing
NLP_KASHK:Finite-State Morphological ParsingHemantha Kulathilake
 
A Simple Explanation of XLNet
A Simple Explanation of XLNetA Simple Explanation of XLNet
A Simple Explanation of XLNetDomyoung Lee
 

Was ist angesagt? (20)

Bert
BertBert
Bert
 
Natural Language Processing in AI
Natural Language Processing in AINatural Language Processing in AI
Natural Language Processing in AI
 
Grammar
GrammarGrammar
Grammar
 
Sarcasm Detection: Achilles Heel of sentiment analysis
Sarcasm Detection: Achilles Heel of sentiment analysisSarcasm Detection: Achilles Heel of sentiment analysis
Sarcasm Detection: Achilles Heel of sentiment analysis
 
5. phase of nlp
5. phase of nlp5. phase of nlp
5. phase of nlp
 
BERT: Bidirectional Encoder Representations from Transformers
BERT: Bidirectional Encoder Representations from TransformersBERT: Bidirectional Encoder Representations from Transformers
BERT: Bidirectional Encoder Representations from Transformers
 
Natural Language Processing
Natural Language ProcessingNatural Language Processing
Natural Language Processing
 
An introduction to the Transformers architecture and BERT
An introduction to the Transformers architecture and BERTAn introduction to the Transformers architecture and BERT
An introduction to the Transformers architecture and BERT
 
Sequence to sequence (encoder-decoder) learning
Sequence to sequence (encoder-decoder) learningSequence to sequence (encoder-decoder) learning
Sequence to sequence (encoder-decoder) learning
 
BERT introduction
BERT introductionBERT introduction
BERT introduction
 
NLP_KASHK:Evaluating Language Model
NLP_KASHK:Evaluating Language ModelNLP_KASHK:Evaluating Language Model
NLP_KASHK:Evaluating Language Model
 
5. bleu
5. bleu5. bleu
5. bleu
 
Question Answering System using machine learning approach
Question Answering System using machine learning approachQuestion Answering System using machine learning approach
Question Answering System using machine learning approach
 
Introduction to spaCy
Introduction to spaCyIntroduction to spaCy
Introduction to spaCy
 
Transformers in 2021
Transformers in 2021Transformers in 2021
Transformers in 2021
 
NLP
NLPNLP
NLP
 
Comparative Analysis of Transformer Based Pre-Trained NLP Models
Comparative Analysis of Transformer Based Pre-Trained NLP ModelsComparative Analysis of Transformer Based Pre-Trained NLP Models
Comparative Analysis of Transformer Based Pre-Trained NLP Models
 
Natural language processing
Natural language processingNatural language processing
Natural language processing
 
NLP_KASHK:Finite-State Morphological Parsing
NLP_KASHK:Finite-State Morphological ParsingNLP_KASHK:Finite-State Morphological Parsing
NLP_KASHK:Finite-State Morphological Parsing
 
A Simple Explanation of XLNet
A Simple Explanation of XLNetA Simple Explanation of XLNet
A Simple Explanation of XLNet
 

Ähnlich wie State of the art in Natural Language Processing (March 2019)

Continual Learning: why, how, and when
Continual Learning: why, how, and whenContinual Learning: why, how, and when
Continual Learning: why, how, and whenGabriele Graffieti
 
ITB_2023_Chatgpt_Box_Scott_Steinbeck.pdf
ITB_2023_Chatgpt_Box_Scott_Steinbeck.pdfITB_2023_Chatgpt_Box_Scott_Steinbeck.pdf
ITB_2023_Chatgpt_Box_Scott_Steinbeck.pdfOrtus Solutions, Corp
 
1. 'Interoperability. A quick chat, a few war stories'. Carl Wilson, Open Pla...
1. 'Interoperability. A quick chat, a few war stories'. Carl Wilson, Open Pla...1. 'Interoperability. A quick chat, a few war stories'. Carl Wilson, Open Pla...
1. 'Interoperability. A quick chat, a few war stories'. Carl Wilson, Open Pla...IMPACT Centre of Competence
 
Thomas Wolf "An Introduction to Transfer Learning and Hugging Face"
Thomas Wolf "An Introduction to Transfer Learning and Hugging Face"Thomas Wolf "An Introduction to Transfer Learning and Hugging Face"
Thomas Wolf "An Introduction to Transfer Learning and Hugging Face"Fwdays
 
Training at AI Frontiers 2018 - Ni Lao: Weakly Supervised Natural Language Un...
Training at AI Frontiers 2018 - Ni Lao: Weakly Supervised Natural Language Un...Training at AI Frontiers 2018 - Ni Lao: Weakly Supervised Natural Language Un...
Training at AI Frontiers 2018 - Ni Lao: Weakly Supervised Natural Language Un...AI Frontiers
 
A few questions about large scale machine learning
A few questions about large scale machine learningA few questions about large scale machine learning
A few questions about large scale machine learningTheodoros Vasiloudis
 
Nlp 2020 global ai conf -jeff_shomaker_final
Nlp 2020 global ai conf -jeff_shomaker_finalNlp 2020 global ai conf -jeff_shomaker_final
Nlp 2020 global ai conf -jeff_shomaker_finalJeffrey Shomaker
 
Distributed Models Over Distributed Data with MLflow, Pyspark, and Pandas
Distributed Models Over Distributed Data with MLflow, Pyspark, and PandasDistributed Models Over Distributed Data with MLflow, Pyspark, and Pandas
Distributed Models Over Distributed Data with MLflow, Pyspark, and PandasDatabricks
 
A Plan for Sustainable MIR Evaluation
A Plan for Sustainable MIR EvaluationA Plan for Sustainable MIR Evaluation
A Plan for Sustainable MIR EvaluationJulián Urbano
 
A step towards machine learning at accionlabs
A step towards machine learning at accionlabsA step towards machine learning at accionlabs
A step towards machine learning at accionlabsChetan Khatri
 
Babak Rasolzadeh: The importance of entities
Babak Rasolzadeh: The importance of entitiesBabak Rasolzadeh: The importance of entities
Babak Rasolzadeh: The importance of entitiesZoltan Varju
 
AI and ChatGPT in Online Education
AI and ChatGPT in Online Education AI and ChatGPT in Online Education
AI and ChatGPT in Online Education D2L Barry
 
Deep Learning and the state of AI / 2016
Deep Learning and the state of AI / 2016Deep Learning and the state of AI / 2016
Deep Learning and the state of AI / 2016Grigory Sapunov
 
Why natural language is next step in the AI evolution
Why natural language is next step in the AI evolutionWhy natural language is next step in the AI evolution
Why natural language is next step in the AI evolutioninovex GmbH
 
Yenikod Yazılım Kursu - Kodlama Öğrenebilir Miyim? Kodlama Bana Göre Mi?
Yenikod Yazılım Kursu - Kodlama Öğrenebilir Miyim? Kodlama Bana Göre Mi?Yenikod Yazılım Kursu - Kodlama Öğrenebilir Miyim? Kodlama Bana Göre Mi?
Yenikod Yazılım Kursu - Kodlama Öğrenebilir Miyim? Kodlama Bana Göre Mi?Mustafa Ekim
 
DataScientist Job : Between Myths and Reality.pdf
DataScientist Job : Between Myths and Reality.pdfDataScientist Job : Between Myths and Reality.pdf
DataScientist Job : Between Myths and Reality.pdfJedha Bootcamp
 
The Rise Of Conversational AI with David Low
The Rise Of Conversational AI with David LowThe Rise Of Conversational AI with David Low
The Rise Of Conversational AI with David LowDatabricks
 
GDSC Introduction to Deep Learning Workshop
GDSC Introduction to Deep Learning WorkshopGDSC Introduction to Deep Learning Workshop
GDSC Introduction to Deep Learning Workshopssuser540861
 

Ähnlich wie State of the art in Natural Language Processing (March 2019) (20)

Continual Learning: why, how, and when
Continual Learning: why, how, and whenContinual Learning: why, how, and when
Continual Learning: why, how, and when
 
ITB_2023_Chatgpt_Box_Scott_Steinbeck.pdf
ITB_2023_Chatgpt_Box_Scott_Steinbeck.pdfITB_2023_Chatgpt_Box_Scott_Steinbeck.pdf
ITB_2023_Chatgpt_Box_Scott_Steinbeck.pdf
 
1. 'Interoperability. A quick chat, a few war stories'. Carl Wilson, Open Pla...
1. 'Interoperability. A quick chat, a few war stories'. Carl Wilson, Open Pla...1. 'Interoperability. A quick chat, a few war stories'. Carl Wilson, Open Pla...
1. 'Interoperability. A quick chat, a few war stories'. Carl Wilson, Open Pla...
 
Thomas Wolf "An Introduction to Transfer Learning and Hugging Face"
Thomas Wolf "An Introduction to Transfer Learning and Hugging Face"Thomas Wolf "An Introduction to Transfer Learning and Hugging Face"
Thomas Wolf "An Introduction to Transfer Learning and Hugging Face"
 
Machine Translation: The Neural Frontier
Machine Translation: The Neural FrontierMachine Translation: The Neural Frontier
Machine Translation: The Neural Frontier
 
Training at AI Frontiers 2018 - Ni Lao: Weakly Supervised Natural Language Un...
Training at AI Frontiers 2018 - Ni Lao: Weakly Supervised Natural Language Un...Training at AI Frontiers 2018 - Ni Lao: Weakly Supervised Natural Language Un...
Training at AI Frontiers 2018 - Ni Lao: Weakly Supervised Natural Language Un...
 
A few questions about large scale machine learning
A few questions about large scale machine learningA few questions about large scale machine learning
A few questions about large scale machine learning
 
Nlp 2020 global ai conf -jeff_shomaker_final
Nlp 2020 global ai conf -jeff_shomaker_finalNlp 2020 global ai conf -jeff_shomaker_final
Nlp 2020 global ai conf -jeff_shomaker_final
 
Distributed Models Over Distributed Data with MLflow, Pyspark, and Pandas
Distributed Models Over Distributed Data with MLflow, Pyspark, and PandasDistributed Models Over Distributed Data with MLflow, Pyspark, and Pandas
Distributed Models Over Distributed Data with MLflow, Pyspark, and Pandas
 
A Plan for Sustainable MIR Evaluation
A Plan for Sustainable MIR EvaluationA Plan for Sustainable MIR Evaluation
A Plan for Sustainable MIR Evaluation
 
A step towards machine learning at accionlabs
A step towards machine learning at accionlabsA step towards machine learning at accionlabs
A step towards machine learning at accionlabs
 
Babak Rasolzadeh: The importance of entities
Babak Rasolzadeh: The importance of entitiesBabak Rasolzadeh: The importance of entities
Babak Rasolzadeh: The importance of entities
 
AI and ChatGPT in Online Education
AI and ChatGPT in Online Education AI and ChatGPT in Online Education
AI and ChatGPT in Online Education
 
Machine learning 101 Talk at Freshworks
Machine learning 101 Talk at FreshworksMachine learning 101 Talk at Freshworks
Machine learning 101 Talk at Freshworks
 
Deep Learning and the state of AI / 2016
Deep Learning and the state of AI / 2016Deep Learning and the state of AI / 2016
Deep Learning and the state of AI / 2016
 
Why natural language is next step in the AI evolution
Why natural language is next step in the AI evolutionWhy natural language is next step in the AI evolution
Why natural language is next step in the AI evolution
 
Yenikod Yazılım Kursu - Kodlama Öğrenebilir Miyim? Kodlama Bana Göre Mi?
Yenikod Yazılım Kursu - Kodlama Öğrenebilir Miyim? Kodlama Bana Göre Mi?Yenikod Yazılım Kursu - Kodlama Öğrenebilir Miyim? Kodlama Bana Göre Mi?
Yenikod Yazılım Kursu - Kodlama Öğrenebilir Miyim? Kodlama Bana Göre Mi?
 
DataScientist Job : Between Myths and Reality.pdf
DataScientist Job : Between Myths and Reality.pdfDataScientist Job : Between Myths and Reality.pdf
DataScientist Job : Between Myths and Reality.pdf
 
The Rise Of Conversational AI with David Low
The Rise Of Conversational AI with David LowThe Rise Of Conversational AI with David Low
The Rise Of Conversational AI with David Low
 
GDSC Introduction to Deep Learning Workshop
GDSC Introduction to Deep Learning WorkshopGDSC Introduction to Deep Learning Workshop
GDSC Introduction to Deep Learning Workshop
 

Kürzlich hochgeladen

BigBuy dropshipping via API with DroFx.pptx
BigBuy dropshipping via API with DroFx.pptxBigBuy dropshipping via API with DroFx.pptx
BigBuy dropshipping via API with DroFx.pptxolyaivanovalion
 
VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...
VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...
VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...Suhani Kapoor
 
04242024_CCC TUG_Joins and Relationships
04242024_CCC TUG_Joins and Relationships04242024_CCC TUG_Joins and Relationships
04242024_CCC TUG_Joins and Relationshipsccctableauusergroup
 
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al BarshaAl Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al BarshaAroojKhan71
 
Unveiling Insights: The Role of a Data Analyst
Unveiling Insights: The Role of a Data AnalystUnveiling Insights: The Role of a Data Analyst
Unveiling Insights: The Role of a Data AnalystSamantha Rae Coolbeth
 
VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...
VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...
VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...Suhani Kapoor
 
BabyOno dropshipping via API with DroFx.pptx
BabyOno dropshipping via API with DroFx.pptxBabyOno dropshipping via API with DroFx.pptx
BabyOno dropshipping via API with DroFx.pptxolyaivanovalion
 
Call me @ 9892124323 Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
Call me @ 9892124323  Cheap Rate Call Girls in Vashi with Real Photo 100% SecureCall me @ 9892124323  Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
Call me @ 9892124323 Cheap Rate Call Girls in Vashi with Real Photo 100% SecurePooja Nehwal
 
VidaXL dropshipping via API with DroFx.pptx
VidaXL dropshipping via API with DroFx.pptxVidaXL dropshipping via API with DroFx.pptx
VidaXL dropshipping via API with DroFx.pptxolyaivanovalion
 
Schema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdfSchema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdfLars Albertsson
 
Industrialised data - the key to AI success.pdf
Industrialised data - the key to AI success.pdfIndustrialised data - the key to AI success.pdf
Industrialised data - the key to AI success.pdfLars Albertsson
 
Log Analysis using OSSEC sasoasasasas.pptx
Log Analysis using OSSEC sasoasasasas.pptxLog Analysis using OSSEC sasoasasasas.pptx
Log Analysis using OSSEC sasoasasasas.pptxJohnnyPlasten
 
CebaBaby dropshipping via API with DroFX.pptx
CebaBaby dropshipping via API with DroFX.pptxCebaBaby dropshipping via API with DroFX.pptx
CebaBaby dropshipping via API with DroFX.pptxolyaivanovalion
 
Week-01-2.ppt BBB human Computer interaction
Week-01-2.ppt BBB human Computer interactionWeek-01-2.ppt BBB human Computer interaction
Week-01-2.ppt BBB human Computer interactionfulawalesam
 
Generative AI on Enterprise Cloud with NiFi and Milvus
Generative AI on Enterprise Cloud with NiFi and MilvusGenerative AI on Enterprise Cloud with NiFi and Milvus
Generative AI on Enterprise Cloud with NiFi and MilvusTimothy Spann
 
Midocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFxMidocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFxolyaivanovalion
 
100-Concepts-of-AI by Anupama Kate .pptx
100-Concepts-of-AI by Anupama Kate .pptx100-Concepts-of-AI by Anupama Kate .pptx
100-Concepts-of-AI by Anupama Kate .pptxAnupama Kate
 
Smarteg dropshipping via API with DroFx.pptx
Smarteg dropshipping via API with DroFx.pptxSmarteg dropshipping via API with DroFx.pptx
Smarteg dropshipping via API with DroFx.pptxolyaivanovalion
 
B2 Creative Industry Response Evaluation.docx
B2 Creative Industry Response Evaluation.docxB2 Creative Industry Response Evaluation.docx
B2 Creative Industry Response Evaluation.docxStephen266013
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...apidays
 

Kürzlich hochgeladen (20)

BigBuy dropshipping via API with DroFx.pptx
BigBuy dropshipping via API with DroFx.pptxBigBuy dropshipping via API with DroFx.pptx
BigBuy dropshipping via API with DroFx.pptx
 
VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...
VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...
VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...
 
04242024_CCC TUG_Joins and Relationships
04242024_CCC TUG_Joins and Relationships04242024_CCC TUG_Joins and Relationships
04242024_CCC TUG_Joins and Relationships
 
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al BarshaAl Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
 
Unveiling Insights: The Role of a Data Analyst
Unveiling Insights: The Role of a Data AnalystUnveiling Insights: The Role of a Data Analyst
Unveiling Insights: The Role of a Data Analyst
 
VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...
VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...
VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...
 
BabyOno dropshipping via API with DroFx.pptx
BabyOno dropshipping via API with DroFx.pptxBabyOno dropshipping via API with DroFx.pptx
BabyOno dropshipping via API with DroFx.pptx
 
Call me @ 9892124323 Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
Call me @ 9892124323  Cheap Rate Call Girls in Vashi with Real Photo 100% SecureCall me @ 9892124323  Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
Call me @ 9892124323 Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
 
VidaXL dropshipping via API with DroFx.pptx
VidaXL dropshipping via API with DroFx.pptxVidaXL dropshipping via API with DroFx.pptx
VidaXL dropshipping via API with DroFx.pptx
 
Schema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdfSchema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdf
 
Industrialised data - the key to AI success.pdf
Industrialised data - the key to AI success.pdfIndustrialised data - the key to AI success.pdf
Industrialised data - the key to AI success.pdf
 
Log Analysis using OSSEC sasoasasasas.pptx
Log Analysis using OSSEC sasoasasasas.pptxLog Analysis using OSSEC sasoasasasas.pptx
Log Analysis using OSSEC sasoasasasas.pptx
 
CebaBaby dropshipping via API with DroFX.pptx
CebaBaby dropshipping via API with DroFX.pptxCebaBaby dropshipping via API with DroFX.pptx
CebaBaby dropshipping via API with DroFX.pptx
 
Week-01-2.ppt BBB human Computer interaction
Week-01-2.ppt BBB human Computer interactionWeek-01-2.ppt BBB human Computer interaction
Week-01-2.ppt BBB human Computer interaction
 
Generative AI on Enterprise Cloud with NiFi and Milvus
Generative AI on Enterprise Cloud with NiFi and MilvusGenerative AI on Enterprise Cloud with NiFi and Milvus
Generative AI on Enterprise Cloud with NiFi and Milvus
 
Midocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFxMidocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFx
 
100-Concepts-of-AI by Anupama Kate .pptx
100-Concepts-of-AI by Anupama Kate .pptx100-Concepts-of-AI by Anupama Kate .pptx
100-Concepts-of-AI by Anupama Kate .pptx
 
Smarteg dropshipping via API with DroFx.pptx
Smarteg dropshipping via API with DroFx.pptxSmarteg dropshipping via API with DroFx.pptx
Smarteg dropshipping via API with DroFx.pptx
 
B2 Creative Industry Response Evaluation.docx
B2 Creative Industry Response Evaluation.docxB2 Creative Industry Response Evaluation.docx
B2 Creative Industry Response Evaluation.docx
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
 

State of the art in Natural Language Processing (March 2019)

  • 1. NLP - State of the Art (and some missing parts…) Liad Magen Wien, am 27.03.2019
  • 2. Hello, World! Israel computer science biomedical engineering NLP & AI Researcher @ enliteAI Keep-Current Project: - Machine Learning Seminar - fast.ai + PyTorch course data4good
  • 3. EnliteAI Company Presentation | office@enlite.ai | 20190223 | © EnliteAI 2019 Services by enliteAI AI Strategy & Transformation AI Lab Prototyping & Project Delivery ▪ Conduct AI readiness check across organization ▪ Discover Use-cases to build-up momentum ▪ Define AI Strategy ▪ Leverage AI Research for practical applications ▪ Extend research to production-grade ▪ Build models for commercial use ▪ Development of POCs, MVPs and prototypes ▪ Implementation & Integration ▪ Roll-out & Support 3
  • 4. data4good Hackathon Applying data science to human needs and generating social impact: ● GrünStattGrau ● Hilfswerk International ● Hilfswerk Österreich ● CivesSolutions April 27-28th, 2019 A1 Telekom, Lassallestraße 9
  • 5. What is NLP? Enabling computers to interact with users in their language
  • 6. NLP is everywhere Applications: - Language understanding - Search engines - Machine translation - Text analysis (i.e. sentiment) - Chatbots / Personal Assistance
  • 7. NLP Building blocks - Chunking - Sequence Tagging - Part of speech - Named Entity Recognition (NER) - Coreference Resolution - ... - Syntactic Parsing - And more...
  • 8. NLP in a nutshell Human Language Non-trivial useful output takes as input text in human language and process it in a way that suggests an intelligent process was involved
  • 9. NLP is hard! I’m eating pizza with olives I’m eating pizza with friends
  • 10. NLP is hard! Google translate: State of the art Der letzte Stand der Technik State of the art “Country of art” German Hebrew
  • 11. Variability AmbiguityHe acquired it He purchased it He bought it It was bought by him It was sold to him She sold it to him She sold him that Twitter: I loooooove it! everytime → every time oscar nom’d doc → Oscar-nominated documentary is bananas → is great - by Wei Xu Bank Apple Star Spring Play I’m reading a Book I will Book the flight
  • 12. Ambiguity - Real news headlines ● Scientists examine whales from space ● The pope's baby step on gays ● Girl hit by car in hospital ● Iraqi head seeks arms ● Lung cancer in women mushrooms ● Squad helps dog bite victim ● Miners refuse to work after death ● Stolen painting found by tree ● Actor sent to jail for not finishing sentence ● Stadium air conditioning fails — Fans protest
  • 13. What do you mean no? Martin is a data scientist Martin is a great data scientist
  • 14. What do you mean no? Martin is a data scientist Martin is not a data scientist Martin is a great data scientist Martin is not a great data scientist
  • 15. What do you mean no? Martin is a data scientist Martin is not a data scientist Martin is a great data scientist Martin is not a great data scientist Martin is not a data scientist Martin is not a mad scientist
  • 16. Negation + Monotonicity John did not work for Microsoft in 1989 as a VP of sales
  • 17. Restrictivity / Intersectivity He is a great pianist → He is a pianist He is a fake doctor → He is a doctor I found my old jacket → I found my jacket
  • 18. How do we do NLP? Rule based Systems 1950s-1990s Corpus based statistics 1990s-2000s Machine Learning 2000s-2014 Deep Learning 2014-2020(?) LinguisticsExperience Machine Learning Expertise Transparent Debugability Black box
  • 19. How should we do NLP? Rule based Systems 1950s-1990s Corpus based statistics 1990s-2000s Machine Learning 2000s-2014 Deep Learning 2014-2020(?) LinguisticsExperience Machine Learning Expertise Transparent Debugability Black box
  • 20. NLP Tomorrow Rule based Systems 1950s-1990s Corpus based statistics 1990s-2000s Machine Learning 2000s-2014 Deep Learning 2014-2020(?) LinguisticsExperience Machine Learning Expertise Transparent Debugability Black box Humans writing rules aided by ML/DL - Resulting in transparent and debuggable models
  • 21. NLP Today - Chris Manning (Stanford) @ Simon’s institute - 27.3.2017
  • 22. BiLSTM hagmoney ( > 2015) Works on low level tasks: ● Coordination boundary prediction: He will attend the meeting and present the results on Tuesday He will attend the meeting on Tuesday He will present the results on Tuesday ● Syntactic Parsing ● And more...
  • 23. BiLSTM hagmoney ( > 2015) But less on higher levels: BiLSTM text generation with style: “The film’s ultimate pleasure if you want to fall in love with the ending, you won’t be disappointed” “The film’s simple, and a refreshing take on the complex family drama of the regions of human intelligence.” https://arxiv.org/pdf/1707.02633.pdf - Controlling Linguistic Style Aspects in Neural Language Generation
  • 24. BiLSTM hagmoney ( > 2015) But less on higher levels: BiLSTM text generation with style: “There are some funny parts but overall I didn’t like the first few funny parts, but overall pretty decent .” “Ultimately, I can honestly say that this movie is full of stupid stupid and stupid stupid stupid stupid stupid.” https://arxiv.org/pdf/1707.02633.pdf - Controlling Linguistic Style Aspects in Neural Language Generation
  • 25. NLP in the academy Multiple stacked networks Each architecture solves a specific problem (or a dataset...) NGrams, TF/IDF, LDA / Topic modeling Word2vec / GloVe / FastText ELMo (AI2) / BERT / ERNIE (Baidu) GPT-2 (OpenAI) LSTM spaCy Flair & Regular Expressions... NLP in the industry
  • 26. 20 years of NLP research Great progress on “low-level” text understanding tasks: ● Morphology ● Syntax / Parsing ● Semantic Roles. ● Lexical Semantics ● Discourse. Coreference. ● Negation scope Representation learning, text-to-text semantics, RNNs ● Word vectors ● Concept vectors ● Sentence vectors ● Paraphrasing
  • 27. 20 years of NLP research Great progress on “low-level” text understanding tasks: ● Morphology ● Syntax / Parsing ● Semantic Roles. ● Lexical Semantics ● Discourse. Coreference. ● Negation scope Representation learning, text-to-text semantics, RNNs ● Word vectors ● Concept vectors ● Sentence vectors ● Paraphrasing Remarkable Achievements Not useful for non-experts Automatically learn intricate linguistic patterns Bad with nuance. Contains biases. Hard to control.
  • 28. Short recup of the last few years
  • 29. 2013 Language model Learn joint likelihood of training sentences Word2Vec … GloVe … FastText … First use of transfer learning in NLP
  • 30. 2018 ELMo - Contextual word vectors ULMFiT - transfer learning + fine tuning BERT - the above & more, on large scale 2019 Transformer-XL GPT-2
  • 31. BERT - architecture in brief Embedding:
  • 32. BERT - architecture in brief ● Embedding + Masking (Cloze) ● Transformer ● Sentence pairing ● Fine tuning ○ Sentence class Label ○ NER Sequence Labels ● Question Answering: ○ Start+End Vector & Softmax over all positions (as additional parameters) ○ Or Running Q + A (for each option) and training on the class label
  • 33. Further reading ● Toward data science: ○ Dissecting BERT (parts 1 & 2) - by Miguel Romero Calvo ○ Deconstructing BERT (parts 1 & 2) - by Jesse Vig ● HarvardNLP - The Annotated Transformer ● Jay Alammar - The illustrated BERT, ELMo and co. ● NLP.Stanford.edu/Seminar - Jacob Delvin presentation about BERT
  • 34. Size matters ● ULMFiT ● ELMo ● GLoMo ● GPT ● BERT ● Transformer-XL ● MT-DNN ● GPT-2 ● Contextual representations ● Generalized Language models ● Transfer learning in NLP
  • 35. Text Generation Dissecting GPT-2 results HavardNLP Giant Language model Test Room http://gltr.io
  • 36. NLP Progress - sample domains Cross lingual Text Classification: pyTorch - LASER Text Classification (English): ULMFiT Automatic Speech Recognition (ASR): Google Tensorflow Lingvo Translation: DeepL / Transformer Big (Facebook + Google) NER: flair / BERT Entity linking: DeepType - OpenAI nlpprogress.com | github.com/syhw/wer_are_we
  • 37. Q&A Datasets & Benchmarks General Language Understanding Evaluation (GLUE) Stanford Natural Language Inference (SNLI) SQuAD / SQuAD 2.0 RACE (Carnegie Mellon) HotpotQA, MS Marco, NarrativeQA, CoQA ... Google AI natural questions dataset
  • 38. Question Answering - State of the art
  • 39. Question Answering - State of the art? Depends on the dataset… Each information extraction (IE) task is a different NLP application (Also relevant for chatbots / personal assistants)
  • 40. Information Extraction Turning text into structured data ● Most data in the world is in text ● We can only easily analyze structured data ● Today - no easy way to do it on scale ● Possible for specific tasks - but requires ○ Enough data ○ Team of NLP experts ○ Time
  • 41. Solving IE can be transformative to industry & science
  • 42. Question Answering - next steps In which state was Yahoo founded? http://ai.stanford.edu/blog/beyond-local-pattern-matching/
  • 43. Question Answering - next steps ● (Cleverly) add more inductive biases ● Common sense Possible methods: ● Interactive learning (human in the loop) ● Deep reinforcement learning ● GANs
  • 44. Still missing... 1. Understanding (Not only NLP-related…) ○ What is captured by a network? ○ What can/’t the networks learn 2. Explainability 3. Fairness & accountability ○ Removing bias (current techniques only “put a lipstick on a pig”) 4. Working with little annotated data Transferring knowledge across domains 5. Handling missing data
  • 45. Thank you! We’re “standing on the shoulders of giants” - Isaac Newton
  • 46. Appendix 1 - handling missing data
  • 47. Handling missing data - Mary is a great programmer but John isn’t - The hotel received great reviews at Yelp but mediocre reviews at TripAdvisor - Fred took a picture of you and Susan of me
  • 48. Handling missing data - Mary is a great programmer but John isn’t _____ (verb phrase ellipsis) - The hotel received great reviews at Yelp but _____ mediocre reviews at TripAdvisor (argument clusters) - Fred took a picture of you and Susan ______ of me (gapping) (Briding) Prices of oil
  • 49. I turn 40 this year He drove 40 on the highway I’ll give you 40 for that
  • 50. I turn 40 (age) this year He drove 40 (kmh) on the highway I’ll give you 40 (currency) for that
  • 51. Handling missing data ● Current end-to-end systems fails on them ● Stanford parser can partially handle several cases ● Work in progress in BIU
  • 53. Example - review sentiment analysis “The soup was delicious” → + “The steak was tough” → - “The dessert was decadent” → + [food] was [good_food_adj] [food] was [bad_food_adj]
  • 54. Example - review sentiment analysis Real world: the soup that the server recommended was amazing. the soup, which I expected to be good, was actually pretty bad the soup, which was served cold and was looking terrible, was surprisingly great.
  • 55. Example - review sentiment analysis Syntax to the rescue! Capture this: With a single declarative pattern: Can help to generalize and transfer knowledge across domains
  • 56. Example - review sentiment analysis Until it doesn’t: I didn’t think the food was great: Would combining rules with deep learning be the solution?
  • 57. Thank you again! Human: What do we want? Computer: Natural-language processing! Human: When do we want it? Computer: When do we want what?
  • 58. Dynamic Word Embeddings Dynamic Word Embeddings for Evolving Semantic Discovery https://arxiv.org/pdf/1703.00607.pdf