Skip to main content
Erschienen in: Empirical Software Engineering 3/2016

01.06.2016

Analyzing and automatically labelling the types of user issues that are raised in mobile app reviews

verfasst von: Stuart McIlroy, Nasir Ali, Hammad Khalid, Ahmed E. Hassan

Erschienen in: Empirical Software Engineering | Ausgabe 3/2016

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Mobile app reviews by users contain a wealth of information on the issues that users are experiencing. For example, a review might contain a feature request, a bug report, and/or a privacy complaint. Developers, users and app store owners (e.g. Apple, Blackberry, Google, Microsoft) can benefit from a better understanding of these issues – developers can better understand users’ concerns, app store owners can spot anomalous apps, and users can compare similar apps to decide which ones to download or purchase. However, user reviews are not labelled, e.g. we do not know which types of issues are raised in a review. Hence, one must sift through potentially thousands of reviews with slang and abbreviations to understand the various types of issues. Moreover, the unstructured and informal nature of reviews complicates the automated labelling of such reviews. In this paper, we study the multi-labelled nature of reviews from 20 mobile apps in the Google Play Store and Apple App Store. We find that up to 30 % of the reviews raise various types of issues in a single review (e.g. a review might contain a feature request and a bug report). We then propose an approach that can automatically assign multiple labels to reviews based on the raised issues with a precision of 66 % and recall of 65 %. Finally, we apply our approach to address three proof-of-concept analytics use case scenarios: (i) we compare competing apps to assist developers and users, (ii) we provide an overview of 601,221 reviews from 12,000 apps in the Google Play Store to assist app store owners and developers and (iii) we detect anomalous apps in the Google Play Store to assist app store owners and users.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Literatur
Zurück zum Zitat Abubakar AM, Jawawi DN (2013) A study on code peer review process monitoring using statistical process control. In: e-Proceeding of Software Engineering Postgraduates Workshop (SEPoW), p 136 Abubakar AM, Jawawi DN (2013) A study on code peer review process monitoring using statistical process control. In: e-Proceeding of Software Engineering Postgraduates Workshop (SEPoW), p 136
Zurück zum Zitat Ahsan SN, Ferzund J, Wotawa F (2009) Automatic classification of software change request using multi-label machine learning methods. In: Software Engineering Workshop (SEW), 2009 33rd Annual IEEE:79–86 Ahsan SN, Ferzund J, Wotawa F (2009) Automatic classification of software change request using multi-label machine learning methods. In: Software Engineering Workshop (SEW), 2009 33rd Annual IEEE:79–86
Zurück zum Zitat Antoniol G, Ayari K, Di Penta M, Khomh F, Guéhéneuc YG (2008) Is it a bug or an enhancement?: a text-based approach to classify change requests. In: Proceedings of the 2008 conference of the center for advanced studies on collaborative research: meeting of minds, ser. CASCON ACM, pp 23:304–23:318 Antoniol G, Ayari K, Di Penta M, Khomh F, Guéhéneuc YG (2008) Is it a bug or an enhancement?: a text-based approach to classify change requests. In: Proceedings of the 2008 conference of the center for advanced studies on collaborative research: meeting of minds, ser. CASCON ACM, pp 23:304–23:318
Zurück zum Zitat Blei DM, Ng AY, Jordan MI (2003) Latent dirichlet allocation. J Mach Learn Technol 3:993–1022MATH Blei DM, Ng AY, Jordan MI (2003) Latent dirichlet allocation. J Mach Learn Technol 3:993–1022MATH
Zurück zum Zitat Brameier M, Banzhaf W (2001) A comparison of linear genetic programming and neural networks in medical data mining. Evol Comput IEEE Trans 5(1):17–26CrossRefMATH Brameier M, Banzhaf W (2001) A comparison of linear genetic programming and neural networks in medical data mining. Evol Comput IEEE Trans 5(1):17–26CrossRefMATH
Zurück zum Zitat Butler M (2011) Android: Changing the mobile landscape. Pervasive Comput IEEE 10(1):4–7CrossRef Butler M (2011) Android: Changing the mobile landscape. Pervasive Comput IEEE 10(1):4–7CrossRef
Zurück zum Zitat Esbensen KH, Guyot D, Westad F, Houmoller LP (2002) Multivariate data analysis: in practice: an introduction to multivariate data analysis and experimental design. Multivariate Data Analysis Esbensen KH, Guyot D, Westad F, Houmoller LP (2002) Multivariate data analysis: in practice: an introduction to multivariate data analysis and experimental design. Multivariate Data Analysis
Zurück zum Zitat Fan R-E, Lin C-J (2007) A study on threshold selection for multi-label classification. In: Department of Computer Science, National Taiwan University Fan R-E, Lin C-J (2007) A study on threshold selection for multi-label classification. In: Department of Computer Science, National Taiwan University
Zurück zum Zitat Fu B, Lin J, Li L, Faloutsos C, Hong J, Sadeh N (2013) Why people hate your app: Making sense of user feedback in a mobile app store. In: Proceedings of the 19th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, ser. KDD ’13. New York, NY, USA: ACM, pp 1276–1284. Available: doi:10.1145/2487575.2488202 Fu B, Lin J, Li L, Faloutsos C, Hong J, Sadeh N (2013) Why people hate your app: Making sense of user feedback in a mobile app store. In: Proceedings of the 19th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, ser. KDD ’13. New York, NY, USA: ACM, pp 1276–1284. Available: doi:10.​1145/​2487575.​2488202
Zurück zum Zitat Galvis Carreño LV, Winbladh K (2013) Analysis of user comments: an approach for software requirements evolution. In: Proceedings of the 2013 International Conference on Software Engineering, ser. ICSE ’13. Piscataway, NJ, USA: IEEE Press, pp 582–591 Galvis Carreño LV, Winbladh K (2013) Analysis of user comments: an approach for software requirements evolution. In: Proceedings of the 2013 International Conference on Software Engineering, ser. ICSE ’13. Piscataway, NJ, USA: IEEE Press, pp 582–591
Zurück zum Zitat Ganesan K, Zhai C, Viegas E (2012) Micropinion generation: an unsupervised approach to generating ultra-concise summaries of opinions. In: Proceedings of the 21st international conference on World Wide Web, ser. WWW ’12. New York, NY, USA: ACM, pp 869–878 Ganesan K, Zhai C, Viegas E (2012) Micropinion generation: an unsupervised approach to generating ultra-concise summaries of opinions. In: Proceedings of the 21st international conference on World Wide Web, ser. WWW ’12. New York, NY, USA: ACM, pp 869–878
Zurück zum Zitat Ghaith S, Wang M, Perry P, Murphy J (2013) Profile-based, load-independent anomaly detection and analysis in performance regression testing of software systems. In: 17th European Conference on IEEE Software Maintenance and Reengineering (CSMR), 2013, pp 379–383 Ghaith S, Wang M, Perry P, Murphy J (2013) Profile-based, load-independent anomaly detection and analysis in performance regression testing of software systems. In: 17th European Conference on IEEE Software Maintenance and Reengineering (CSMR), 2013, pp 379–383
Zurück zum Zitat Hall M, Frank E, Holmes G, Pfahringer B, Reutemann P, Witten IH (2009a) The weka data mining software: an update. SIGKDD Explor Newsl 11 (1):10–18. Available: doi:10.1145/1656274.1656278 Hall M, Frank E, Holmes G, Pfahringer B, Reutemann P, Witten IH (2009a) The weka data mining software: an update. SIGKDD Explor Newsl 11 (1):10–18. Available: doi:10.​1145/​1656274.​1656278
Zurück zum Zitat Hall M, Frank E, Holmes G, Pfahringer B, Reutemann P, Witten IH (2009b) The weka data mining software: an update. ACM SIGKDD Explor Newsl 11 (1):10–18CrossRef Hall M, Frank E, Holmes G, Pfahringer B, Reutemann P, Witten IH (2009b) The weka data mining software: an update. ACM SIGKDD Explor Newsl 11 (1):10–18CrossRef
Zurück zum Zitat Han D, Zhang C, Fan X, Hindle A, Wong K, Stroulia E (2012) Understanding android fragmentation with topic analysis of vendor-specific bugs. In: 19th Working Conference on Reverse Engineering (WCRE), 2012. IEEE, pp 83–92 Han D, Zhang C, Fan X, Hindle A, Wong K, Stroulia E (2012) Understanding android fragmentation with topic analysis of vendor-specific bugs. In: 19th Working Conference on Reverse Engineering (WCRE), 2012. IEEE, pp 83–92
Zurück zum Zitat Harman M, Jia Y, Test YZ (2012) App store mining and analysis: Msr for app stores Harman M, Jia Y, Test YZ (2012) App store mining and analysis: Msr for app stores
Zurück zum Zitat Herzig K, Just S, Zeller A (2013) It’s not a bug, it’s a feature: how misclassification impacts bug prediction. In: Proceedings of the 2013 International Conference on Software Engineering. IEEE Press, pp 392–401 Herzig K, Just S, Zeller A (2013) It’s not a bug, it’s a feature: how misclassification impacts bug prediction. In: Proceedings of the 2013 International Conference on Software Engineering. IEEE Press, pp 392–401
Zurück zum Zitat Hu M, Liu B (2004) Mining and summarizing customer reviews. In: Proceedings of the tenth ACM SIGKDD international conference on Knowledge discovery and data mining. ACM, pp 168–177 Hu M, Liu B (2004) Mining and summarizing customer reviews. In: Proceedings of the tenth ACM SIGKDD international conference on Knowledge discovery and data mining. ACM, pp 168–177
Zurück zum Zitat Iacob C, Harrison R (2013) Retrieving and analyzing mobile apps feature requests from online reviews. In: Proceedings of the Tenth International Workshop on Mining Software Repositories. IEEE Press, pp 41–44 Iacob C, Harrison R (2013) Retrieving and analyzing mobile apps feature requests from online reviews. In: Proceedings of the Tenth International Workshop on Mining Software Repositories. IEEE Press, pp 41–44
Zurück zum Zitat Jindal N, Liu B (2007) Review spam detection. In: Proceedings of the 16th international conference on World Wide Web. ACM, pp 1189–1190 Jindal N, Liu B (2007) Review spam detection. In: Proceedings of the 16th international conference on World Wide Web. ACM, pp 1189–1190
Zurück zum Zitat Khalid H, Nagappan M, Shihab E, Hassan AE (2014) Prioritizing the devices to test your app on: A case study of android game apps. In: 22nd ACM SIGSOFT International Symposium on the Foundations of Software Engineering (FSE 2014) Khalid H, Nagappan M, Shihab E, Hassan AE (2014) Prioritizing the devices to test your app on: A case study of android game apps. In: 22nd ACM SIGSOFT International Symposium on the Foundations of Software Engineering (FSE 2014)
Zurück zum Zitat Khalid H, Shihab E, Nagappan M, Hassan A (2014) What do mobile app users complain about? A study on free ios apps:1–1 Khalid H, Shihab E, Nagappan M, Hassan A (2014) What do mobile app users complain about? A study on free ios apps:1–1
Zurück zum Zitat Khalid H (2013) On identifying user complaints of ios apps. In: Proceedings of the 2013 International Conference on Software Engineering. IEEE Press Khalid H (2013) On identifying user complaints of ios apps. In: Proceedings of the 2013 International Conference on Software Engineering. IEEE Press
Zurück zum Zitat Kim H-W, Lee HL, Son JE (2011) An exploratory study on the determinants of smartphone app purchase. In: The 11th International DSI and the 16th APDSI Joint Meeting. Taipei, Taiwan Kim H-W, Lee HL, Son JE (2011) An exploratory study on the determinants of smartphone app purchase. In: The 11th International DSI and the 16th APDSI Joint Meeting. Taipei, Taiwan
Zurück zum Zitat Maas AL, Daly RE, Pham PT, Huang D, Ng AY, Potts C (2011) Learning word vectors for sentiment analysis. In: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies-Volume 1. Association for Computational Linguistics Maas AL, Daly RE, Pham PT, Huang D, Ng AY, Potts C (2011) Learning word vectors for sentiment analysis. In: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies-Volume 1. Association for Computational Linguistics
Zurück zum Zitat Melville P, Gryc W, Lawrence R (2009) Sentiment analysis of blogs by combining lexical knowledge with text classification. In: Proceedings of the 15th ACM SIGKDD international conference on Knowledge discovery and data mining. ACM, pp 1275–1284 Melville P, Gryc W, Lawrence R (2009) Sentiment analysis of blogs by combining lexical knowledge with text classification. In: Proceedings of the 15th ACM SIGKDD international conference on Knowledge discovery and data mining. ACM, pp 1275–1284
Zurück zum Zitat mobile V (2014) Developer economics Q1 2014: state of the developer nation. Tech Rep:05 mobile V (2014) Developer economics Q1 2014: state of the developer nation. Tech Rep:05
Zurück zum Zitat Mudambi SM, Schuff D (2010) What makes a helpful online review? A study of customer reviews on amazon.com. MIS Q 34(1):185–200 Mudambi SM, Schuff D (2010) What makes a helpful online review? A study of customer reviews on amazon.com. MIS Q 34(1):185–200
Zurück zum Zitat Niculescu MF, Wu DJ (2011) When should software firms commercialize new products via freemium business models. Under Review Niculescu MF, Wu DJ (2011) When should software firms commercialize new products via freemium business models. Under Review
Zurück zum Zitat Nguyen TH, Adams B, Jiang ZM, Hassan AE, Nasser M, Flora P (2012) Automated detection of performance regressions using statistical process control techniques. In: Proceedings of the third joint WOSP/SIPEW international conference on Performance Engineering. ACM, pp 299–310 Nguyen TH, Adams B, Jiang ZM, Hassan AE, Nasser M, Flora P (2012) Automated detection of performance regressions using statistical process control techniques. In: Proceedings of the third joint WOSP/SIPEW international conference on Performance Engineering. ACM, pp 299–310
Zurück zum Zitat Pagano D, Bruegge B (2013) User involvement in software evolution practice: a case study. In: Proceedings of the 2013 International Conference on Software Engineering, pp 953–962 Pagano D, Bruegge B (2013) User involvement in software evolution practice: a case study. In: Proceedings of the 2013 International Conference on Software Engineering, pp 953–962
Zurück zum Zitat Pak A, Paroubek P (2010) Twitter as a corpus for sentiment analysis and opinion mining. In: LREC Pak A, Paroubek P (2010) Twitter as a corpus for sentiment analysis and opinion mining. In: LREC
Zurück zum Zitat Pang B, Lee L (2008) Opinion mining and sentiment analysis. Found Trends Inf Retr 2(1-2):1–135CrossRef Pang B, Lee L (2008) Opinion mining and sentiment analysis. Found Trends Inf Retr 2(1-2):1–135CrossRef
Zurück zum Zitat Pang B, Lee L, Vaithyanathan S (2002) Thumbs up?: sentiment classification using machine learning techniques. In: Proceedings of the ACL-02 conference on Empirical methods in natural language processing-Volume 10. Association for Computational Linguistics, pp 79–86 Pang B, Lee L, Vaithyanathan S (2002) Thumbs up?: sentiment classification using machine learning techniques. In: Proceedings of the ACL-02 conference on Empirical methods in natural language processing-Volume 10. Association for Computational Linguistics, pp 79–86
Zurück zum Zitat Pang B, Lee L (2004) A sentimental education: Sentiment analysis using subjectivity summarization based on minimum cuts. In: Proceedings of the 42nd annual meeting on Association for Computational Linguistics. Association for Computational Linguistics, p 271 Pang B, Lee L (2004) A sentimental education: Sentiment analysis using subjectivity summarization based on minimum cuts. In: Proceedings of the 42nd annual meeting on Association for Computational Linguistics. Association for Computational Linguistics, p 271
Zurück zum Zitat Porter MF (1980) An algorithm for suffix stripping. Program: Electronic Library and Information Systems 14(3), 130–137 Porter MF (1980) An algorithm for suffix stripping. Program: Electronic Library and Information Systems 14(3), 130–137
Zurück zum Zitat Rajaraman A, Ullman JD (2012) Mining of massive datasets. In: Cambridge University Press Rajaraman A, Ullman JD (2012) Mining of massive datasets. In: Cambridge University Press
Zurück zum Zitat Ramage D, Rosen E (2011) Stanford topic modeling toolbox Ramage D, Rosen E (2011) Stanford topic modeling toolbox
Zurück zum Zitat Ramage D, Hall D, Nallapati R, Manning CD (2009) Labeled lda: A supervised topic model for credit attribution in multi-labeled corpora. In: Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing: Volume 1-Volume 1. Association for Computational Linguistics, pp 248–256 Ramage D, Hall D, Nallapati R, Manning CD (2009) Labeled lda: A supervised topic model for credit attribution in multi-labeled corpora. In: Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing: Volume 1-Volume 1. Association for Computational Linguistics, pp 248–256
Zurück zum Zitat Read J (2010) Scalable multi-label classification. Ph.D. dissertation, University of Waikato Read J (2010) Scalable multi-label classification. Ph.D. dissertation, University of Waikato
Zurück zum Zitat Read J, Pfahringer B, Holmes G, Frank E (2009) Classifier chains for multi-label classification. In: Machine Learning and Knowledge Discovery in Databases. Springer, pp 254–269 Read J, Pfahringer B, Holmes G, Frank E (2009) Classifier chains for multi-label classification. In: Machine Learning and Knowledge Discovery in Databases. Springer, pp 254–269
Zurück zum Zitat Shewhart WA (1931) Economic control of quality of manufactured product. ASQ Quality Press, vol. 509 Shewhart WA (1931) Economic control of quality of manufactured product. ASQ Quality Press, vol. 509
Zurück zum Zitat Thelwall M, Buckley K, Paltoglou G (2012) Sentiment strength detection for the social web. J Am Soc Inf Sci Technol 63(1):163–173. Available: doi:10.1002/asi.21662 Thelwall M, Buckley K, Paltoglou G (2012) Sentiment strength detection for the social web. J Am Soc Inf Sci Technol 63(1):163–173. Available: doi:10.​1002/​asi.​21662
Zurück zum Zitat Tsoumakas G, Katakis I, Vlahavas I (2010) Mining multi-label data. In: Data mining and knowledge discovery handbook. Springer, pp 667–685 Tsoumakas G, Katakis I, Vlahavas I (2010) Mining multi-label data. In: Data mining and knowledge discovery handbook. Springer, pp 667–685
Zurück zum Zitat Tsoumakas G, Katakis I (2007) Multi-label classification: An overview. Int J Data Warehousing and Mining (IJDWM) 3(3):1–13CrossRef Tsoumakas G, Katakis I (2007) Multi-label classification: An overview. Int J Data Warehousing and Mining (IJDWM) 3(3):1–13CrossRef
Metadaten
Titel
Analyzing and automatically labelling the types of user issues that are raised in mobile app reviews
verfasst von
Stuart McIlroy
Nasir Ali
Hammad Khalid
Ahmed E. Hassan
Publikationsdatum
01.06.2016
Verlag
Springer US
Erschienen in
Empirical Software Engineering / Ausgabe 3/2016
Print ISSN: 1382-3256
Elektronische ISSN: 1573-7616
DOI
https://doi.org/10.1007/s10664-015-9375-7

Weitere Artikel der Ausgabe 3/2016

Empirical Software Engineering 3/2016 Zur Ausgabe

Premium Partner