Georg Wiese - 2017 - Neural Question Answering at BioASQ 5B

•

1 like•389 views

Association for Computational Linguistics

BioNLP 2017 - W17-2309

Education

Neural
Question Answering
at BioASQ 5B
Georg Wiese, Dirk Weissenborn, Mariana Neves

Motivation
● Neural question answering (QA) systems are end-to-end trainable
machine learning models which achieve top performance in domains
with large training datasets
● We apply an extractive neural QA system (FastQA [1]) to BioASQ 5B
Phase B (list & factoid questions)
● Extractive QA: Answer is given as start and end pointers in the
context (snippets)
2

Network Architecture
3
Original FastQA [1] Our Architecture

Network Architecture
4
Original FastQA [1] Our Architecture
Input Layer
● GloVe & character embeddings
(like the original FastQA)
● Biomedical embeddings [3]
● Question type features

Network Architecture
Original FastQA [1] Our Architecture
Output Layer
● Change start probability activation
from softmax to sigmoid
● -> Multiple starts can be selected for
list questions
● For each selected start, select the
corresponding end pointer via softmax
5

Network Architecture
6
Original FastQA [1] Our Architecture

Training Procedure
● Problem: Neural QA typically requires ~105
questions to train
● Datasets of such scale exist in the open domain, e.g. SQuAD [2] with
~105
factoid questions on Wikipedia articles
● We train in two steps:
7
1. Pre-training on a large (~105
questions) open-domain dataset (SQuAD)
2. Fine-tuning on BioASQ (~103
questions)

Systems
● We trained five models using 5-fold cross validation on all available
training data
● We submitted two systems:
○ Single: Best single model according to its respective development set
○ Ensemble: Ensemble of all five models (averaging scores before
sigmoid/softmax activation)
8

Results
Factoid Results:
● Our system won 3/5 batches
● Averaged over the five
batches, our system
(ensemble) was 1.5
percentage points above the
best competitor
9

Results
List Results:
● Our system won 2/5
batches
● On average, the best
competitor performed 3.4
percentage points better
than our ensemble model
10

Discussion
Strengths: Competitive performance, despite:
● Less feature engineering than traditional QA systems
● A less domain-dependent architecture, because we don’t rely on
domain-specific structured resources
Limitations:
● Extractive QA cannot generate answer which are not explicitly
mentioned in the snippets
-> No yes/no & summary questions
11

References
[1] Weissenborn et al.: “Making Neural QA as Simple as Possible but not Simpler”
[2] Rajpurkar et al.: “SQuAD: 100,000+ Questions for Machine Comprehension of Text”
[3] Pavlopoulos et al.: “Continuous Space Word Vectors Obtained by Applying Word2Vec to
Abstracts of Biomedical Articles”
12

Thank You. Questions?
Related CONLL paper:
“Neural Domain Adaptation for Biomedical
Question Answering”
Contact:
georg.wiese@student.hpi.de
13

Similar to Georg Wiese - 2017 - Neural Question Answering at BioASQ 5B

TMPA-2017: Regression Testing with Semiautomatic Test Selection for Auditing ...Iosif Itkin

Agile Acceptance testing with FitnesseClareMcLennan

Performance Test Driven Development with Oracle Coherencearagozin

Testing FrameworksMoataz Nabil

Automated Testing Environment by Bugzilla, Testopia and Jenkinswalkerchang

Improving neural question generation using answer separationNAVER Engineering

Building largescalepredictionsystemv1arthi v

Continuous Performance TestingMark Price

Test Levels & TechniquesDhanasekaran Nagarajan

Bug zillatestopiajenkinsSamira Kumar Nanda

Selenium Automation FrameworkMindfire Solutions

Developers Testing - Girl Code at bloomonIneke Scheffers

Lecture #6. automation testing (andrey oleynik)Andrey Oleynik

QA Meetup at Signavio (Berlin, 06.06.19)Anesthezia

Triantafyllia VoulibasiISSEL

Introduction to K6Knoldus Inc.

Test and Behaviour Driven Development (TDD/BDD)Lars Thorup

VT.NET 20160411: An Intro to Test Driven Development (TDD)Rob Hale

Query optimizationZunera Bukhari

prohuddle-utPLSQL v3 - Ultimate unit testing framework for OracleJacek Gebal

Similar to Georg Wiese - 2017 - Neural Question Answering at BioASQ 5B (20)

TMPA-2017: Regression Testing with Semiautomatic Test Selection for Auditing ...

Agile Acceptance testing with Fitnesse

Performance Test Driven Development with Oracle Coherence

Testing Frameworks

Automated Testing Environment by Bugzilla, Testopia and Jenkins

Improving neural question generation using answer separation

Building largescalepredictionsystemv1

Continuous Performance Testing

Test Levels & Techniques

Bug zillatestopiajenkins

Selenium Automation Framework

Developers Testing - Girl Code at bloomon

Lecture #6. automation testing (andrey oleynik)

QA Meetup at Signavio (Berlin, 06.06.19)

Triantafyllia Voulibasi

Introduction to K6

Test and Behaviour Driven Development (TDD/BDD)

VT.NET 20160411: An Intro to Test Driven Development (TDD)

Query optimization

prohuddle-utPLSQL v3 - Ultimate unit testing framework for Oracle

Recently uploaded

size separation d pharm 1st year pharmaceuticspragatimahajan3

Danh sách HSG Bộ môn cấp trường - Cấp THPT.pdfQucHHunhnh

Gyanartha SciBizTech Quiz slideshare.pptxShibin Azad

Championnat de France de Tennis de table/siemaillard

slides CapTechTalks Webinar May 2024 Alexander Perry.pptxCapitolTechU

50 ĐỀ LUYỆN THI IOE LỚP 9 - NĂM HỌC 2022-2023 (CÓ LINK HÌNH, FILE AUDIO VÀ ĐÁ...Nguyen Thanh Tu Collection

Basic phrases for greeting and assisting costumersPedroFerreira53928

The Art Pastor's Guide to Sabbath | Steve ThomasonSteve Thomason

Industrial Training Report- AKTU Industrial Training ReportAvinash Rai

Benefits and Challenges of Using Open Educational Resourcesdimpy50

....................Muslim-Law notes.pdfVikramadityaRaj

Advances in production technology of Grapes.pdfDr. M. Kumaresan Hort.

Word Stress rules esl .pptxNicholas Montgomery

INU_CAPSTONEDESIGN_비밀번호486_업로드용 발표자료.pdfbu07226

Morse OER Some Benefits and Challenges.pptxjmorse8

How to the fix Attribute Error in odoo 17Celine George

B.ed spl. HI pdusu exam paper-2023-24.pdfSpecial education needs

Dementia (Alzheimer & vasular dementia).Mohamed Rizk Khodair

UNIT – IV_PCI Complaints: Complaints and evaluation of complaints, Handling o...Sayali Powar

Basic Civil Engg Notes_Chapter-6_Environment Pollution & EngineeringDenish Jangid

Recently uploaded (20)

size separation d pharm 1st year pharmaceutics

Danh sách HSG Bộ môn cấp trường - Cấp THPT.pdf

Gyanartha SciBizTech Quiz slideshare.pptx

Championnat de France de Tennis de table/

slides CapTechTalks Webinar May 2024 Alexander Perry.pptx

50 ĐỀ LUYỆN THI IOE LỚP 9 - NĂM HỌC 2022-2023 (CÓ LINK HÌNH, FILE AUDIO VÀ ĐÁ...

Basic phrases for greeting and assisting costumers

The Art Pastor's Guide to Sabbath | Steve Thomason

Industrial Training Report- AKTU Industrial Training Report

Benefits and Challenges of Using Open Educational Resources

....................Muslim-Law notes.pdf

Advances in production technology of Grapes.pdf

Word Stress rules esl .pptx

INU_CAPSTONEDESIGN_비밀번호486_업로드용 발표자료.pdf

Morse OER Some Benefits and Challenges.pptx

How to the fix Attribute Error in odoo 17

B.ed spl. HI pdusu exam paper-2023-24.pdf

Dementia (Alzheimer & vasular dementia).

UNIT – IV_PCI Complaints: Complaints and evaluation of complaints, Handling o...

Basic Civil Engg Notes_Chapter-6_Environment Pollution & Engineering

Georg Wiese - 2017 - Neural Question Answering at BioASQ 5B

1. Neural Question Answering at BioASQ 5B Georg Wiese, Dirk Weissenborn, Mariana Neves

2. Motivation ● Neural question answering (QA) systems are end-to-end trainable machine learning models which achieve top performance in domains with large training datasets ● We apply an extractive neural QA system (FastQA [1]) to BioASQ 5B Phase B (list & factoid questions) ● Extractive QA: Answer is given as start and end pointers in the context (snippets) 2

3. Network Architecture 3 Original FastQA [1] Our Architecture

4. Network Architecture 4 Original FastQA [1] Our Architecture Input Layer ● GloVe & character embeddings (like the original FastQA) ● Biomedical embeddings [3] ● Question type features

5. Network Architecture Original FastQA [1] Our Architecture Output Layer ● Change start probability activation from softmax to sigmoid ● -> Multiple starts can be selected for list questions ● For each selected start, select the corresponding end pointer via softmax 5

6. Network Architecture 6 Original FastQA [1] Our Architecture

7. Training Procedure ● Problem: Neural QA typically requires ~105 questions to train ● Datasets of such scale exist in the open domain, e.g. SQuAD [2] with ~105 factoid questions on Wikipedia articles ● We train in two steps: 7 1. Pre-training on a large (~105 questions) open-domain dataset (SQuAD) 2. Fine-tuning on BioASQ (~103 questions)

8. Systems ● We trained five models using 5-fold cross validation on all available training data ● We submitted two systems: ○ Single: Best single model according to its respective development set ○ Ensemble: Ensemble of all five models (averaging scores before sigmoid/softmax activation) 8

9. Results Factoid Results: ● Our system won 3/5 batches ● Averaged over the five batches, our system (ensemble) was 1.5 percentage points above the best competitor 9

10. Results List Results: ● Our system won 2/5 batches ● On average, the best competitor performed 3.4 percentage points better than our ensemble model 10

11. Discussion Strengths: Competitive performance, despite: ● Less feature engineering than traditional QA systems ● A less domain-dependent architecture, because we don’t rely on domain-specific structured resources Limitations: ● Extractive QA cannot generate answer which are not explicitly mentioned in the snippets -> No yes/no & summary questions 11

12. References [1] Weissenborn et al.: “Making Neural QA as Simple as Possible but not Simpler” [2] Rajpurkar et al.: “SQuAD: 100,000+ Questions for Machine Comprehension of Text” [3] Pavlopoulos et al.: “Continuous Space Word Vectors Obtained by Applying Word2Vec to Abstracts of Biomedical Articles” 12

13. Thank You. Questions? Related CONLL paper: “Neural Domain Adaptation for Biomedical Question Answering” Contact: georg.wiese@student.hpi.de 13

Georg Wiese - 2017 - Neural Question Answering at BioASQ 5B

Recommended

Recommended

More Related Content

Similar to Georg Wiese - 2017 - Neural Question Answering at BioASQ 5B

Similar to Georg Wiese - 2017 - Neural Question Answering at BioASQ 5B (20)

More from Association for Computational Linguistics

More from Association for Computational Linguistics (20)

Recently uploaded

Recently uploaded (20)

Georg Wiese - 2017 - Neural Question Answering at BioASQ 5B