ARdoc: App Reviews Development Oriented Classiﬁer

•Als PPTX, PDF herunterladen•

2 gefällt mir•586 views

Google Play, Apple App Store and Windows Phone Store are well known distribution platforms where users can download mobile apps, rate them and write review comments about the apps they are using. Previous research studies demonstrated that these reviews contain important information to help developers improve their apps. However, analyzing reviews is challenging due to the large amount of reviews posted every day, the unstructured nature of reviews and its varying quality. In this demo we present ARdoc, a tool which combines three techniques: (1) Natural Language Parsing,(2) Text Analysis and (3) Sentiment Analysis to automatically classify useful feedback contained in app reviews important for performing software maintenance and evolution tasks. Our quantitative and qualitative analysis (involving mobile professional developers) demonstrates that ARdoc correctly classiﬁes feedback useful for maintenance perspectives in user reviews with high precision (ranging between84% and 89%), recall (ranging between 84% and 89%), and an F-Measure (ranging between 84% and 89%). While evaluating our tool developers of our study conﬁrmed the use-fulness of ARdoc in extracting important maintenance tasks for their mobile applications.

Präsentationen & Vorträge

ARdoc: App Reviews Development
Oriented Classiﬁer
Sebastiano Andrea Emitza Corrado Gerardo Harald
Panichella Di Sorbo Guzman Visaggio Canfora Gall

Reviews Include Useful
Information for Developers
Pagano et al. – RE2013 Chen et al. – ICSE2014 Galvis Carreno et al. – ICSE2013
3

Users Submit Many
Reviews Regularly
iOS apps receive on average 23
reviews per day
Facebook for iOS receive more
than 4000 reviews per day
[ Pagano et al. - RE 2013 ]
4

Past Work
Chen et al – ICSE 2014
ARMiner: an approach to help
app developers discover the
most informative user
reviews
i. text analysis and machine
learning to filter out non-
informative reviews
ii. topic analysis to recognize
topics treated in the reviews
classified as informative
5

6
Non
Informati
ve
Informative
Reviews
PROBLE
M?

Identifying Useful Reviews
i. The awful button in the page doesn’t work
ii. A button in the page should be added
7

Identifying Useful Reviews
i. The awful button in the page doesn’t work
ii. A button in the page should be added
8
BUG DESCRIPTION

Available Sources for identifying Useful Reviews
i. The awful button in the page doesn’t work
ii. A button in the page should be added
9
sentiment
lexicon
structure
Natural Language Parsing
Sentiment Analysis
Text Analysis

10
ARdoc: App Reviews
Development Oriented
Classiﬁer

ARdoc’s Architecture
11
Stanford CoreNLP
Apache Lucene API

ARdoc’s Architecture
12
Stanford CoreNLP
Apache Lucene API
WEKA

Taxonomy & Examples
14
Panichella et al. “How can I improve my app? Classifying user reviews for
software maintenance and evolution” – ICSME 2015

ARdoc’s DEMO
15
Stanford CoreNLP
Apache Lucene API
WEKA

http://www.ifi.uzh.ch/seal/people/panichella/tools/ARdoc.html

ARdoc Classification Accuracy?
18
3 Apps

ARdoc Classification Accuracy?
19
3 Apps
Minesweeper
PowernAPP
Picturex

ARdoc Classification Accuracy?
20
3 Apps
Minesweeper
PowernAPP
Picturex
https://www.scribd.com/document/323048838/ARdoc-Appendix

ARdoc Classification Accuracy
21
Minesweeper
PowernAPP
Picturex
https://www.scribd.com/document/323048838/ARdoc-Appendix
3 Apps
2) ARdoc classifies useful feedback with a precision ranging
between 84% and 89%, a recall ranging between 84% and
89%, and an F-Measure ranging between 84% and 89%
2) ARdoc classifies useful feedback with a precision ranging
between 84% and 89%, a recall ranging between 84% and
89%, and an F-Measure ranging between 84% and 89%

ARdoc Classification Accuracy
24
Minesweeper
PowernAPP
Picturex
https://www.scribd.com/document/323048838/ARdoc-Appendix
3 Apps
2) ARdoc classifies useful feedback with a precision ranging
between 84% and 89%, a recall ranging between 84% and
89%, and an F-Measure ranging between 84% and 89%
2) ARdoc classifies useful feedback with a precision ranging
between 84% and 89%, a recall ranging between 84% and
89%, and an F-Measure ranging between 84% and 89%
2) ARdoc classifies useful feedback with a precision ranging
between 84% and 89%, a recall ranging between 84% and
89%, and an F-Measure ranging between 84% and 89%

Conclusion & Future Work
25
1) ARdoc a novel tool able to mine relevant feedback for
real world developers interested in accomplishing
software maintenance and evolution tasks.
2) ARdoc classifies useful feedback with a precision ranging
between 84% and 89%, a recall ranging between 84% and
89%, and an F-Measure ranging between 84% and 89%

Conclusion & Future Work
26
1) ARdoc a novel tool able to mine relevant feedback for
real world developers interested in accomplishing
software maintenance and evolution tasks.
2) ARdoc classifies useful feedback with a precision ranging
between 84% and 89%, a recall ranging between 84% and
89%, and an F-Measure ranging between 84% and 89%
&
Di Sorbo et al. “What Would Users Change in My App? Summarizing App
Reviews for Recommending Software Changes” – FSE 16/11//2016 (Session 11)

Thanks for the Attention!
27
Stanford CoreNLP
Apache Lucene API
WEKA
Questions?

Weitere ähnliche Inhalte

Was ist angesagt?

Put Your Hands in the Mud: What Technique, Why, and HowMassimiliano Di Penta

Mobile apps-user interaction measurement & Apps ecosystemSalah Amean

Implications of Open Source Software Use (or Let's Talk Open Source)Gail Murphy

A hybrid crowd-powered.compressedjoseph wanjekeche

Android interview questions for 2 to 5 years (1)satish reddy

Tech Report: On the Effectiveness of Malware Protection on AndroidFraunhofer AISEC

The (Un) Expected Impact of Tools in Software EvolutionGail Murphy

Icsme 2021-keynote-creating-usable-and-useful-software-toolsGail Murphy

Chapter 3 - Common Test Types and Test Process for Mobile ApplicationsNeeraj Kumar Singh

Was ist angesagt? (9)

Put Your Hands in the Mud: What Technique, Why, and How

Mobile apps-user interaction measurement & Apps ecosystem

Implications of Open Source Software Use (or Let's Talk Open Source)

A hybrid crowd-powered.compressed

Android interview questions for 2 to 5 years (1)

Tech Report: On the Effectiveness of Malware Protection on Android

The (Un) Expected Impact of Tools in Software Evolution

Icsme 2021-keynote-creating-usable-and-useful-software-tools

Chapter 3 - Common Test Types and Test Process for Mobile Applications

Andere mochten auch

En el mercado HA MFL Department

Imaiesechyderabad

¿Qué comiste y bebiste ayer?HA MFL Department

Useful automationAntonina_Burlachenko

Je mentends bienHA MFL Department

Repaso: descripción y nacionalidadesHA MFL Department

Evaluation question 1Jakewootton

Production loghalo4robo

CSCL2013 - LajoieTieLab

24 x 7 leadership factoryaiesechyderabad

Pathwayaiesechyderabad

Changeaiesechyderabad

Psd conversion-servicesMarkupCloud

Adonde fuiste session 1HA MFL Department

Fast track to success in ones careeraiesechyderabad

Current Affairs - Expansionsaiesechyderabad

Esprit updatesaiesechyderabad

Sona!!!!!!!!!aiesechyderabad

O gcdp agm 2013aiesechyderabad

Sonaaiesechyderabad

Andere mochten auch (20)

En el mercado

¿Qué comiste y bebiste ayer?

Useful automation

Je mentends bien

Repaso: descripción y nacionalidades

Evaluation question 1

Production log

CSCL2013 - Lajoie

24 x 7 leadership factory

Pathway

Change

Psd conversion-services

Adonde fuiste session 1

Fast track to success in ones career

Current Affairs - Expansions

Esprit updates

Sona!!!!!!!!!

O gcdp agm 2013

Sona

Ähnlich wie ARdoc: App Reviews Development Oriented Classiﬁer

How Can I Improve My App? Classifying User Reviews for Software Maintenance a...Sebastiano Panichella

Are free Android app security analysis tools effective in detecting known vul...Venkatesh Prasad Ranganath

MACHINE LEARNING APPROACH TO LEARN AND DETECT MALWARE IN ANDROIDIRJET Journal

SURF: Summarizer of User Reviews FeedbackSebastiano Panichella

IEEE ANDROID APPLICATION 2016 TITLE AND ABSTRACTtsysglobalsolutions

Exploring the Efficiency of the Program using OOAD MetricsIRJET Journal

Survey on Fraud Malware Detection in Google Play Store IRJET Journal

IRJET - Discovery of Ranking Fraud for Mobile AppsIRJET Journal

IRJET- QUEZARD : Question Wizard using Machine Learning and Artificial Intell...IRJET Journal

A Preliminary Field Study of Game Programming on Mobile DevicesTao Xie

APIs and Restful APIsijtsrd

IRJET- Smart Travel Guide(STG)IRJET Journal

On the Link Between Mobile App Quality and User ReviewsSAIL_QU

IRJET- Android Malware Detection SystemIRJET Journal

IRJET- Foodorials- A Cooking Recipe Android AppIRJET Journal

Analytical Survey on Bug Tracking SystemInternational Journal of Computer and Communication System Engineering

Autonomous Campus Tour Guide Robot by using Ultrasonic Range Sensors and QR c...ShwetonKedia

Benchpress: Analyzing Android App Vulnerability Benchmark SuitesVenkatesh Prasad Ranganath

Social Media Content AnalyserIRJET Journal

An efficient tool for reusable softwareprjpublications

Ähnlich wie ARdoc: App Reviews Development Oriented Classiﬁer (20)

How Can I Improve My App? Classifying User Reviews for Software Maintenance a...

Are free Android app security analysis tools effective in detecting known vul...

MACHINE LEARNING APPROACH TO LEARN AND DETECT MALWARE IN ANDROID

SURF: Summarizer of User Reviews Feedback

IEEE ANDROID APPLICATION 2016 TITLE AND ABSTRACT

Exploring the Efficiency of the Program using OOAD Metrics

Survey on Fraud Malware Detection in Google Play Store

IRJET - Discovery of Ranking Fraud for Mobile Apps

IRJET- QUEZARD : Question Wizard using Machine Learning and Artificial Intell...

A Preliminary Field Study of Game Programming on Mobile Devices

APIs and Restful APIs

IRJET- Smart Travel Guide(STG)

On the Link Between Mobile App Quality and User Reviews

IRJET- Android Malware Detection System

IRJET- Foodorials- A Cooking Recipe Android App

Analytical Survey on Bug Tracking System

Autonomous Campus Tour Guide Robot by using Ultrasonic Range Sensors and QR c...

Benchpress: Analyzing Android App Vulnerability Benchmark Suites

Social Media Content Analyser

An efficient tool for reusable software

Mehr von Sebastiano Panichella

The 3rd Intl. Workshop on NL-based Software EngineeringSebastiano Panichella

Diversity-guided Search Exploration for Self-driving Cars Test Generation thr...Sebastiano Panichella

SBFT Tool Competition 2024 -- Python Test Case Generation TrackSebastiano Panichella

SBFT Tool Competition 2024 - CPS-UAV Test Case Generation TrackSebastiano Panichella

Simulation-based Testing of Unmanned Aerial Vehicles with AerialistSebastiano Panichella

Testing with Fewer Resources: Toward Adaptive Approaches for Cost-effective ...Sebastiano Panichella

COSMOS: DevOps for Complex Cyber-physical SystemsSebastiano Panichella

Testing and Development Challenges for Complex Cyber-Physical Systems: Insigh...Sebastiano Panichella

An Empirical Characterization of Software Bugs in Open-Source Cyber-Physical ...Sebastiano Panichella

Automated Identification and Qualitative Characterization of Safety Concerns ...Sebastiano Panichella

The 2nd Intl. Workshop on NL-based Software EngineeringSebastiano Panichella

The 16th Intl. Workshop on Search-Based and Fuzz TestingSebastiano Panichella

Simulation-based Test Case Generation for Unmanned Aerial Vehicles in the Nei...Sebastiano Panichella

Exposed! A case study on the vulnerability-proneness of Google Play AppsSebastiano Panichella

Search-based Software Testing (SBST) '22Sebastiano Panichella

NL-based Software Engineering (NLBSE) '22Sebastiano Panichella

NLBSE’22: Tool CompetitionSebastiano Panichella

"An NLP-based Tool for Software Artifacts Analysis" at @ICSME2021. Sebastiano Panichella

An Empirical Investigation of Relevant Changes and Automation Needs in Modern...Sebastiano Panichella

Search-Based Software Testing Tool Competition 2021 by Sebastiano Panichella,...Sebastiano Panichella

Mehr von Sebastiano Panichella (20)

The 3rd Intl. Workshop on NL-based Software Engineering

Diversity-guided Search Exploration for Self-driving Cars Test Generation thr...

SBFT Tool Competition 2024 -- Python Test Case Generation Track

SBFT Tool Competition 2024 - CPS-UAV Test Case Generation Track

Simulation-based Testing of Unmanned Aerial Vehicles with Aerialist

Testing with Fewer Resources: Toward Adaptive Approaches for Cost-effective ...

COSMOS: DevOps for Complex Cyber-physical Systems

Testing and Development Challenges for Complex Cyber-Physical Systems: Insigh...

An Empirical Characterization of Software Bugs in Open-Source Cyber-Physical ...

Automated Identification and Qualitative Characterization of Safety Concerns ...

The 2nd Intl. Workshop on NL-based Software Engineering

The 16th Intl. Workshop on Search-Based and Fuzz Testing

Simulation-based Test Case Generation for Unmanned Aerial Vehicles in the Nei...

Exposed! A case study on the vulnerability-proneness of Google Play Apps

Search-based Software Testing (SBST) '22

NL-based Software Engineering (NLBSE) '22

NLBSE’22: Tool Competition

"An NLP-based Tool for Software Artifacts Analysis" at @ICSME2021.

An Empirical Investigation of Relevant Changes and Automation Needs in Modern...

Search-Based Software Testing Tool Competition 2021 by Sebastiano Panichella,...

Kürzlich hochgeladen

No Advance 8868886958 Chandigarh Call Girls , Indian Call Girls For Full Nigh...Sheetaleventcompany

Andrés Ramírez Gossler, Facundo Schinnea - eCommerce Day Chile 2024eCommerce Institute

Call Girl Number in Khar Mumbai📲 9892124323 💞 Full Night EnjoyPooja Nehwal

Thirunelveli call girls Tamil escorts 7877702510Vipesco

Mathematics of Finance Presentation.pptxMoumonDas2

Chiulli_Aurora_Oman_Raffaele_Beowulf.pptxraffaeleoman

Governance and Nation-Building in Nigeria: Some Reflections on Options for Po...Kayode Fayemi

Introduction to Prompt Engineering (Focusing on ChatGPT)Chameera Dedduwage

CTAC 2024 Valencia - Henrik Hanke - Reduce to the max - slideshare.pdfhenrik385807

Night 7k Call Girls Noida Sector 128 Call Me: 8448380779Delhi Call girls

SaaStr Workshop Wednesday w/ Lucas Price, Yardsticksaastr

BDSM⚡Call Girls in Sector 93 Noida Escorts >༒8448380779 Escort ServiceDelhi Call girls

ANCHORING SCRIPT FOR A CULTURAL EVENT.docxNikitaBankoti2

If this Giant Must Walk: A Manifesto for a New NigeriaKayode Fayemi

Microsoft Copilot AI for Everyone - created by AITatiana Gurgel

WhatsApp 📞 9892124323 ✅Call Girls In Juhu ( Mumbai )Pooja Nehwal

CTAC 2024 Valencia - Sven Zoelle - Most Crucial Invest to Digitalisation_slid...henrik385807

Presentation on Engagement in Book Clubssamaasim06

VVIP Call Girls Nalasopara : 9892124323, Call Girls in Nalasopara ServicesPooja Nehwal

Mohammad_Alnahdi_Oral_Presentation_Assignment.pptxmohammadalnahdi22

Kürzlich hochgeladen (20)

No Advance 8868886958 Chandigarh Call Girls , Indian Call Girls For Full Nigh...

Andrés Ramírez Gossler, Facundo Schinnea - eCommerce Day Chile 2024

Call Girl Number in Khar Mumbai📲 9892124323 💞 Full Night Enjoy

Thirunelveli call girls Tamil escorts 7877702510

Mathematics of Finance Presentation.pptx

Chiulli_Aurora_Oman_Raffaele_Beowulf.pptx

Governance and Nation-Building in Nigeria: Some Reflections on Options for Po...

Introduction to Prompt Engineering (Focusing on ChatGPT)

CTAC 2024 Valencia - Henrik Hanke - Reduce to the max - slideshare.pdf

Night 7k Call Girls Noida Sector 128 Call Me: 8448380779

SaaStr Workshop Wednesday w/ Lucas Price, Yardstick

BDSM⚡Call Girls in Sector 93 Noida Escorts >༒8448380779 Escort Service

ANCHORING SCRIPT FOR A CULTURAL EVENT.docx

If this Giant Must Walk: A Manifesto for a New Nigeria

Microsoft Copilot AI for Everyone - created by AI

WhatsApp 📞 9892124323 ✅Call Girls In Juhu ( Mumbai )

CTAC 2024 Valencia - Sven Zoelle - Most Crucial Invest to Digitalisation_slid...

Presentation on Engagement in Book Clubs

VVIP Call Girls Nalasopara : 9892124323, Call Girls in Nalasopara Services

Mohammad_Alnahdi_Oral_Presentation_Assignment.pptx

ARdoc: App Reviews Development Oriented Classiﬁer

1. ARdoc: App Reviews Development Oriented Classiﬁer Sebastiano Andrea Emitza Corrado Gerardo Harald Panichella Di Sorbo Guzman Visaggio Canfora Gall

2. App User Reviews 2

3. Reviews Include Useful Information for Developers Pagano et al. – RE2013 Chen et al. – ICSE2014 Galvis Carreno et al. – ICSE2013 3

4. Users Submit Many Reviews Regularly iOS apps receive on average 23 reviews per day Facebook for iOS receive more than 4000 reviews per day [ Pagano et al. - RE 2013 ] 4

5. Past Work Chen et al – ICSE 2014 ARMiner: an approach to help app developers discover the most informative user reviews i. text analysis and machine learning to filter out non- informative reviews ii. topic analysis to recognize topics treated in the reviews classified as informative 5

6. 6 Non Informati ve Informative Reviews PROBLE M?

7. Identifying Useful Reviews i. The awful button in the page doesn’t work ii. A button in the page should be added 7

8. Identifying Useful Reviews i. The awful button in the page doesn’t work ii. A button in the page should be added 8 BUG DESCRIPTION

9. Available Sources for identifying Useful Reviews i. The awful button in the page doesn’t work ii. A button in the page should be added 9 sentiment lexicon structure Natural Language Parsing Sentiment Analysis Text Analysis

10. 10 ARdoc: App Reviews Development Oriented Classiﬁer

11. ARdoc’s Architecture 11 Stanford CoreNLP Apache Lucene API

12. ARdoc’s Architecture 12 Stanford CoreNLP Apache Lucene API WEKA

13. Taxonomy & Examples 14 Panichella et al. “How can I improve my app? Classifying user reviews for software maintenance and evolution” – ICSME 2015

14. ARdoc’s DEMO 15 Stanford CoreNLP Apache Lucene API WEKA

15. http://www.ifi.uzh.ch/seal/people/panichella/tools/ARdoc.html

16. ARdoc Classification Accuracy? 17

17. ARdoc Classification Accuracy? 18 3 Apps

18. ARdoc Classification Accuracy? 19 3 Apps Minesweeper PowernAPP Picturex

19. ARdoc Classification Accuracy? 20 3 Apps Minesweeper PowernAPP Picturex https://www.scribd.com/document/323048838/ARdoc-Appendix

20. ARdoc Classification Accuracy 21 Minesweeper PowernAPP Picturex https://www.scribd.com/document/323048838/ARdoc-Appendix 3 Apps 2) ARdoc classifies useful feedback with a precision ranging between 84% and 89%, a recall ranging between 84% and 89%, and an F-Measure ranging between 84% and 89% 2) ARdoc classifies useful feedback with a precision ranging between 84% and 89%, a recall ranging between 84% and 89%, and an F-Measure ranging between 84% and 89%

21. ARdoc Classification Accuracy 24 Minesweeper PowernAPP Picturex https://www.scribd.com/document/323048838/ARdoc-Appendix 3 Apps 2) ARdoc classifies useful feedback with a precision ranging between 84% and 89%, a recall ranging between 84% and 89%, and an F-Measure ranging between 84% and 89% 2) ARdoc classifies useful feedback with a precision ranging between 84% and 89%, a recall ranging between 84% and 89%, and an F-Measure ranging between 84% and 89% 2) ARdoc classifies useful feedback with a precision ranging between 84% and 89%, a recall ranging between 84% and 89%, and an F-Measure ranging between 84% and 89%

22. Conclusion & Future Work 25 1) ARdoc a novel tool able to mine relevant feedback for real world developers interested in accomplishing software maintenance and evolution tasks. 2) ARdoc classifies useful feedback with a precision ranging between 84% and 89%, a recall ranging between 84% and 89%, and an F-Measure ranging between 84% and 89%

23. Conclusion & Future Work 26 1) ARdoc a novel tool able to mine relevant feedback for real world developers interested in accomplishing software maintenance and evolution tasks. 2) ARdoc classifies useful feedback with a precision ranging between 84% and 89%, a recall ranging between 84% and 89%, and an F-Measure ranging between 84% and 89% & Di Sorbo et al. “What Would Users Change in My App? Summarizing App Reviews for Recommending Software Changes” – FSE 16/11//2016 (Session 11)

24. Thanks for the Attention! 27 Stanford CoreNLP Apache Lucene API WEKA Questions?

Hinweis der Redaktion

Hi, I’m Andrea Di Sorbo, I’m a ph.D. student at University of Sannio. In this paper we investigated possible ways for classifying user reviews in according to software maintenance tasks with the purpose of helping developers improving their apps.
Well, the context of our study is App Stores, such as Apple App Store and Google Play, where we know that users can download apps, give ratings and write reviews about the mobile apps they're using.
Indeed previous studies demonstrated that about one third of the information contained in user reviews is helpful for developers, giving feedback containing requests of implementation of new features, bug descriptions or requests of improvement about existing functionalities.
For example, a study by Pagano (RE2013) showed that mobile apps receive approximately 23 reviews per day and popular apps, as Facebook, receive on average more than 4000 reviews per day.
To handle this problem Chen at. al proposed AR-Miner, an approach to help app developers discover the most informative user reviews, which uses i) text analysis and machine learning to filter out non-informative reviews and ii) topic analysis to recognize topics treated in "informative" reviews.
We argue that text lexicon represents just one of the possible dimensions that can be exploited to detect informative reviews.
In the first review the user exposes a problem, while in the second one the user suggests the implementation of a new feature
Thus, understanding the intention in user reviews could add precious information for accomplishing software maintenance and evolution tasks. We believe that exists three different dimensions that can be explored to determine the intention of a given user review: the sentiment, the structure, and text lexicon.
Thus our final taxonomy was composed by only four categories of sentences: feature request, problem discovery, information seeking and information giving.
Relying on the techniques previously discussed, we can associate a label to each sentence in the review. These are some example of useful feedback by users. These example sentences contain relevant information for improving an app: the first two sentences could suggest developers new functionalities to implement, while the third and fourth sentences indicate bugs that need to be fixed.

ARdoc: App Reviews Development Oriented Classiﬁer

Empfohlen

Empfohlen

Weitere ähnliche Inhalte

Was ist angesagt?

Was ist angesagt? (9)

Andere mochten auch

Andere mochten auch (20)

Ähnlich wie ARdoc: App Reviews Development Oriented Classiﬁer

Ähnlich wie ARdoc: App Reviews Development Oriented Classiﬁer (20)

Mehr von Sebastiano Panichella

Mehr von Sebastiano Panichella (20)

Kürzlich hochgeladen

Kürzlich hochgeladen (20)

ARdoc: App Reviews Development Oriented Classiﬁer

Hinweis der Redaktion