9. 実験1 Text Simplification
Data sets
● Parallel WIkipedia Simplification Corpus (PWKP) (Zhu et al., 2010)
○ train 89,042 pair
○ dev 205 pair
○ test 100 pair
● English Wikipedia and Simple English Wikipedia (EW-SEW) (Hwang et al.2015)
○ train 280,000 pair
○ dev 2000 pair
○ test 359 pair
10. 実験1 Text Simplification
Evaluation Metrics
● Automatic evaluation. BLEU(Paineni et al., 2002)
○ PWKP single reference
○ EW-SEW multi reference
● Human evaluation. (1 is very bad, 5 is very good)
○ Fluency(流暢性) 1 ~ 5
○ Adequacy(妥当性)1 ~ 5
○ Simplicity(簡潔性) 1 ~ 5
16. 実験2 Large Scale Text Summarization
Dataset
Large Scale Chinese Social Media Short Text Summarization Dataset(LCSTS)
2,400,000文ペア
● Part1 2,400,591ペア train
● Part2 8,685ぺア validation
● Part3 725ペア test
Part2とPart3は1~5で自動評価されていて、スコア3以上のものを選択