skip to main content
10.1145/3180155.3180218acmconferencesArticle/Chapter ViewAbstractPublication PagesicseConference Proceedingsconference-collections
research-article

Online app review analysis for identifying emerging issues

Published:27 May 2018Publication History

ABSTRACT

Detecting emerging issues (e.g., new bugs) timely and precisely is crucial for developers to update their apps. App reviews provide an opportunity to proactively collect user complaints and promptly improve apps' user experience, in terms of bug fixing and feature refinement. However, the tremendous quantities of reviews and noise words (e.g., misspelled words) increase the difficulties in accurately identifying newly-appearing app issues. In this paper, we propose a novel and automated framework IDEA, which aims to IDentify Emerging App issues effectively based on online review analysis. We evaluate IDEA on six popular apps from Google Play and Apple's App Store, employing the official app changelogs as our ground truth. Experiment results demonstrate the effectiveness of IDEA in identifying emerging app issues. Feedback from engineers and product managers shows that 88.9% of them think that the identified issues can facilitate app development in practice. Moreover, we have successfully applied IDEA to several products of Tencent, which serve hundreds of millions of users.

References

  1. Loulwah AlSumait, Daniel Barbará, and Carlotta Domeniconi. "On-line LDA: Adaptive Topic Models for Mining Text Streams with Applications to Topic Detection and Tracking". In: Proceedings of the 8th IEEE International Conference on Data Mining, ICDM 2008, December 15--19, 2008, Pisa, Italy. 2008, pp. 3--12. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. App Annie. https://www.appannie.com/en/.Google ScholarGoogle Scholar
  3. R. Arun et al. "On Finding the Natural Number of Topics with Latent Dirichlet Allocation: Some Observations". In: Advances in Knowledge Discovery and Data Mining, 14th Pacific-Asia Conference, PAKDD 2010, Hyderabad, India, June 21--24, 2010. Proceedings. Part I. 2010, pp. 391--402. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. David M. Blei, Andrew Y. Ng, and Michael I. Jordan. "Latent Dirichlet Allocation". In: Journal of Machine Learning Research 3 (2003), pp. 993--1022. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. Mario Cataldi, Luigi Di Caro, and Claudio Schifanella. "Emerging topic detection on twitter based on temporal and social terms evaluation". In: Proceedings of the Tenth International Workshop on Multimedia Data Mining (MDMKDD). ACM. 2010, p. 4. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. Ning Chen et al. "AR-miner: mining informative reviews for developers from mobile app marketplace". In: 36th International Conference on Software Engineering, ICSE 2014, Hyderabad, India - May 31 - June 07, 2014. 2014, pp. 767--778. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. Yan Chen et al. "Emerging topic detection for organizations from microblogs". In: The 36th International ACM SIGIR conference on research and development in Information Retrieval, SIGIR 2013, Dublin, Ireland - July 28 - August 01, 2013. 2013, pp. 43--52. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. Facebook Messenger is getting slammed by tons of negative reviews. http://www.businessinsider.com/facebook-messenger-app-store-reviews-are-humiliating-2014-8.Google ScholarGoogle Scholar
  9. Bin Fu et al. "Why people hate your app: making sense of user feedback in a mobile app store". In: The 19th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, KDD 2013, Chicago, IL, USA, August 11--14, 2013. 2013, pp. 1276--1284. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. Cuiyun Gao et al. "AR-Tracker: Track the Dynamics of Mobile Apps via User Review Mining". In: 2015 IEEE Symposium on Service-Oriented System Engineering, SOSE 2015, San Francisco Bay, CA, USA, March 30 - April 3, 2015. 2015, pp. 284--290. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. Cuiyun Gao et al. "PAID: Prioritizing app issues for developers by tracking user reviews over versions". In: 26th IEEE International Symposium on Software Reliability Engineering, ISSRE 2015, Gaithersbury, MD, USA, November 2--5, 2015. 2015, pp. 35--45. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. Judith Gebauer, Ya Tang, and Chaiwat Baimai. "User requirements of mobile technology: results from a content analysis of user reviews". In: Inf. Syst. E-Business Management 6.4 (2008), pp. 361--384.Google ScholarGoogle ScholarCross RefCross Ref
  13. Xiaodong Gu and Sunghun Kim. ""What Parts of Your Apps are Loved by Users?" (T)". In: 30th IEEE/ACM International Conference on Automated Software Engineering, ASE 2015, Lincoln, NE, USA, November 9--13, 2015. 2015, pp. 760--770.Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. Emitza Guzman and Walid Maalej. "How Do Users Like This Feature? A Fine Grained Sentiment Analysis of App Reviews". In: IEEE 22nd International Requirements Engineering Conference, RE 2014, Karlskrona, Sweden, August 25--29, 2014. 2014, pp. 153--162.Google ScholarGoogle ScholarCross RefCross Ref
  15. Leonard Hoon et al. "An analysis of the mobile app review landscape: trends and implications". In: Faculty of Information and Communication Technologies, Swinburne University of Technology, Tech. Rep (2013).Google ScholarGoogle Scholar
  16. Jiajia Huang et al. "A probabilistic method for emerging topic tracking in Microblog stream". In: World Wide Web 20.2 (2017), pp. 325--350. Google ScholarGoogle ScholarDigital LibraryDigital Library
  17. Claudia Iacob and Rachel Harrison. "Retrieving and analyzing mobile apps feature requests from online reviews". In: Proceedings of the 10th Working Conference on Mining Software Repositories, MSR 2013, San Francisco, CA, USA, May 18--19, 2013. 2013, pp. 41--44. Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. Aminul Islam and Diana Zaiu Inkpen. "Semantic text similarity using corpus-based word similarity and string similarity". In: TKDD 2.2 (2008), 10:1--10:25. Google ScholarGoogle ScholarDigital LibraryDigital Library
  19. Jensen Shannon divergence. https://en.wikipedia.org/wiki/Jensen-Shannon_divergence.Google ScholarGoogle Scholar
  20. H Khalid et al. "What do mobile app users complain about? A study on free iOS apps. 2014". In: IEEE Software 10 (2015).Google ScholarGoogle Scholar
  21. Jieun Kim et al. "Trends and relationships of smartphone application services: Analysis of apple app store using text mining-based network analysis". In: Proceedings of the 4th ISPIM Innovation Symposium. 2012.Google ScholarGoogle Scholar
  22. Jey Han Lau, Nigel Collier, and Timothy Baldwin. "On-line Trend Analysis with Topic Models: \#twitter Trends Detection Topic Model Online". In: COLING 2012, 24th International Conference on Computational Linguistics, Proceedings of the Conference: Technical Papers, 8--15 December 2012, Mumbai, India. 2012, pp. 1519--1534.Google ScholarGoogle Scholar
  23. LDA on small datasets. https://stats.stackexchange.com/questions/78926/at-what-point-does-lda-latent-dirichlet-allocation-not-make-sense-to-use.Google ScholarGoogle Scholar
  24. Qingwei Lin et al. "iDice: problem identification for emerging issues". In: Proceedings of the 38th International Conference on Software Engineering, ICSE 2016, Austin, TX, USA, May 14--22, 2016. 2016, pp. 214--224. Google ScholarGoogle ScholarDigital LibraryDigital Library
  25. Walid Maalej and Hadeer Nabil. "Bug report, feature request, or simply praise? On automatically classifying app reviews". In: 23rd IEEE International Requirements Engineering Conference, RE 2015, Ottawa, ON, Canada, August 24--28, 2015. 2015, pp. 116--125.Google ScholarGoogle ScholarCross RefCross Ref
  26. Yichuan Man et al. "Experience Report: Understanding Cross-Platform App Issues from User Reviews". In: 27th IEEE International Symposium on Software Reliability Engineering, ISSRE 2016, Ottawa, ON, Canada, October 23--27, 2016. 2016, pp. 138--149.Google ScholarGoogle Scholar
  27. William Martin, Federica Sarro, and Mark Harman. "Causal impact analysis for app releases in google play". In: Proceedings of the 24th ACM SIGSOFT International Symposium on Foundations of Software Engineering, FSE 2016, Seattle, WA, USA, November 13--18, 2016. 2016, pp. 435--446. Google ScholarGoogle ScholarDigital LibraryDigital Library
  28. William Martin et al. "A Survey of App Store Analysis for Software Engineering". In: IEEE Trans. Software Eng. 43.9 (2017), pp. 817--847.Google ScholarGoogle ScholarDigital LibraryDigital Library
  29. Stuart McIlroy et al. "Analyzing and automatically labelling the types of user issues that are raised in mobile app reviews". In: Empirical Software Engineering 21.3 (2016), pp. 1067--1106. Google ScholarGoogle ScholarDigital LibraryDigital Library
  30. Qiaozhu Mei, Xuehua Shen, and ChengXiang Zhai. "Automatic labeling of multinomial topic models". In: Proceedings of the 13th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, San Jose, California, USA, August 12--15, 2007. 2007, pp. 490--499. Google ScholarGoogle ScholarDigital LibraryDigital Library
  31. Tomas Mikolov et al. "Distributed Representations of Words and Phrases and their Compositionality". In: Advances in Neural Information Processing Systems 26: 27th Annual Conference on Neural Information Processing Systems 2013. Proceedings of a meeting held December 5--8, 2013, Lake Tahoe, Nevada, United States. 2013, pp. 3111--3119. Google ScholarGoogle ScholarDigital LibraryDigital Library
  32. Multi-tasking in iOS. https://developer.apple.com/ios/human-interface-guidelines/features/multitasking/.Google ScholarGoogle Scholar
  33. Thanh-Son Nguyen, Hady Wirawan Lauw, and Panayiotis Tsaparas. "Review Synthesis for Micro-Review Summarization". In: Proceedings of the Eighth ACM International Conference on Web Search and Data Mining, WSDM 2015, Shanghai, China, February 2--6, 2015. 2015, pp. 169--178. Google ScholarGoogle ScholarDigital LibraryDigital Library
  34. NLTK. http://www.nltk.org.Google ScholarGoogle Scholar
  35. PMI. https://en.wikipedia.org/wiki/Pointwise_mutual_information.Google ScholarGoogle Scholar
  36. Punkt tokenizer. http://www.nltk.org/modules/nltk/tokenize/punkt.html.Google ScholarGoogle Scholar
  37. Peter J. Rousseeuw and Mia Hubert. "Robust statistics for outlier detection". In: Wiley Interdisc. Rew.: Data Mining and Knowledge Discovery 1.1 (2011), pp. 73--79.Google ScholarGoogle ScholarCross RefCross Ref
  38. Federica Sarro et al. "Feature lifecycles as they spread, migrate, remain, and die in App Stores". In: 23rd IEEE International Requirements Engineering Conference, RE 2015, Ottawa, ON, Canada, August 24--28, 2015. 2015, pp. 76--85.Google ScholarGoogle Scholar
  39. Softmax function. https://en.wikipedia.org/wiki/Softmax_function.Google ScholarGoogle Scholar
  40. Andrea Di Sorbo et al. "What would users change in my app? summarizing app reviews for recommending software changes". In: Proceedings of the 24th ACM SIGSOFT International Symposium on Foundations of Software Engineering, FSE 2016, Seattle, WA, USA, November 13--18, 2016. 2016, pp. 499--510. Google ScholarGoogle ScholarDigital LibraryDigital Library
  41. User forum of Youtube iOS. https://productforums.google.com/forum/#!forum/youtube.Google ScholarGoogle Scholar
  42. Phong Minh Vu et al. "Mining User Opinions in Mobile App Reviews: A Keyword-Based Approach (T)". In: 30th IEEE/ACM International Conference on Automated Software Engineering, ASE 2015, Lincoln, NE, USA, November 9--13, 2015. 2015, pp. 749--759.Google ScholarGoogle Scholar
  43. Xiaojun Wan and Tianming Wang. "Automatic Labeling of Topic Models Using Text Summaries". In: Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics, ACL 2016, August 7--12, 2016, Berlin, Germany, Volume 1: Long Papers. 2016.Google ScholarGoogle Scholar
  44. WizNote. https://www.wiz.cn/.Google ScholarGoogle Scholar
  45. Weizhong Zhao et al. "A heuristic approach to determine an appropriate number of topics in topic modeling". In: BMC Bioinformatics 16.13 (2015), S8.Google ScholarGoogle ScholarCross RefCross Ref

Index Terms

  1. Online app review analysis for identifying emerging issues

      Recommendations

      Comments

      Login options

      Check if you have access through your login credentials or your institution to get full access on this article.

      Sign in
      • Published in

        cover image ACM Conferences
        ICSE '18: Proceedings of the 40th International Conference on Software Engineering
        May 2018
        1307 pages
        ISBN:9781450356381
        DOI:10.1145/3180155
        • Conference Chair:
        • Michel Chaudron,
        • General Chair:
        • Ivica Crnkovic,
        • Program Chairs:
        • Marsha Chechik,
        • Mark Harman

        Copyright © 2018 ACM

        Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

        Publisher

        Association for Computing Machinery

        New York, NY, United States

        Publication History

        • Published: 27 May 2018

        Permissions

        Request permissions about this article.

        Request Permissions

        Check for updates

        Qualifiers

        • research-article

        Acceptance Rates

        Overall Acceptance Rate276of1,856submissions,15%

        Upcoming Conference

        ICSE 2025

      PDF Format

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader