research-article

Online app review analysis for identifying emerging issues

Authors:
Cuiyun Gao

The Chinese University of Hong Kong, China

The Chinese University of Hong Kong, China
View Profile

,
Jichuan Zeng

The Chinese University of Hong Kong, China

The Chinese University of Hong Kong, China
View Profile

,
Michael R. Lyu

The Chinese University of Hong Kong, China

The Chinese University of Hong Kong, China
View Profile

,
Irwin King

The Chinese University of Hong Kong, China

The Chinese University of Hong Kong, China
View Profile

ICSE '18: Proceedings of the 40th International Conference on Software EngineeringMay 2018Pages 48–58https://doi.org/10.1145/3180155.3180218

Published:27 May 2018Publication History

ICSE '18: Proceedings of the 40th International Conference on Software Engineering

Pages 48–58

ABSTRACT

Detecting emerging issues (e.g., new bugs) timely and precisely is crucial for developers to update their apps. App reviews provide an opportunity to proactively collect user complaints and promptly improve apps' user experience, in terms of bug fixing and feature refinement. However, the tremendous quantities of reviews and noise words (e.g., misspelled words) increase the difficulties in accurately identifying newly-appearing app issues. In this paper, we propose a novel and automated framework IDEA, which aims to IDentify Emerging App issues effectively based on online review analysis. We evaluate IDEA on six popular apps from Google Play and Apple's App Store, employing the official app changelogs as our ground truth. Experiment results demonstrate the effectiveness of IDEA in identifying emerging app issues. Feedback from engineers and product managers shows that 88.9% of them think that the identified issues can facilitate app development in practice. Moreover, we have successfully applied IDEA to several products of Tencent, which serve hundreds of millions of users.

References

Loulwah AlSumait, Daniel Barbará, and Carlotta Domeniconi. "On-line LDA: Adaptive Topic Models for Mining Text Streams with Applications to Topic Detection and Tracking". In: Proceedings of the 8th IEEE International Conference on Data Mining, ICDM 2008, December 15--19, 2008, Pisa, Italy. 2008, pp. 3--12. Google ScholarDigital Library
App Annie. https://www.appannie.com/en/.Google Scholar
R. Arun et al. "On Finding the Natural Number of Topics with Latent Dirichlet Allocation: Some Observations". In: Advances in Knowledge Discovery and Data Mining, 14th Pacific-Asia Conference, PAKDD 2010, Hyderabad, India, June 21--24, 2010. Proceedings. Part I. 2010, pp. 391--402. Google ScholarDigital Library
David M. Blei, Andrew Y. Ng, and Michael I. Jordan. "Latent Dirichlet Allocation". In: Journal of Machine Learning Research 3 (2003), pp. 993--1022. Google ScholarDigital Library
Mario Cataldi, Luigi Di Caro, and Claudio Schifanella. "Emerging topic detection on twitter based on temporal and social terms evaluation". In: Proceedings of the Tenth International Workshop on Multimedia Data Mining (MDMKDD). ACM. 2010, p. 4. Google ScholarDigital Library
Ning Chen et al. "AR-miner: mining informative reviews for developers from mobile app marketplace". In: 36th International Conference on Software Engineering, ICSE 2014, Hyderabad, India - May 31 - June 07, 2014. 2014, pp. 767--778. Google ScholarDigital Library
Yan Chen et al. "Emerging topic detection for organizations from microblogs". In: The 36th International ACM SIGIR conference on research and development in Information Retrieval, SIGIR 2013, Dublin, Ireland - July 28 - August 01, 2013. 2013, pp. 43--52. Google ScholarDigital Library
Facebook Messenger is getting slammed by tons of negative reviews. http://www.businessinsider.com/facebook-messenger-app-store-reviews-are-humiliating-2014-8.Google Scholar
Bin Fu et al. "Why people hate your app: making sense of user feedback in a mobile app store". In: The 19th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, KDD 2013, Chicago, IL, USA, August 11--14, 2013. 2013, pp. 1276--1284. Google ScholarDigital Library
Cuiyun Gao et al. "AR-Tracker: Track the Dynamics of Mobile Apps via User Review Mining". In: 2015 IEEE Symposium on Service-Oriented System Engineering, SOSE 2015, San Francisco Bay, CA, USA, March 30 - April 3, 2015. 2015, pp. 284--290. Google ScholarDigital Library
Cuiyun Gao et al. "PAID: Prioritizing app issues for developers by tracking user reviews over versions". In: 26th IEEE International Symposium on Software Reliability Engineering, ISSRE 2015, Gaithersbury, MD, USA, November 2--5, 2015. 2015, pp. 35--45. Google ScholarDigital Library
Judith Gebauer, Ya Tang, and Chaiwat Baimai. "User requirements of mobile technology: results from a content analysis of user reviews". In: Inf. Syst. E-Business Management 6.4 (2008), pp. 361--384.Google ScholarCross Ref
Xiaodong Gu and Sunghun Kim. ""What Parts of Your Apps are Loved by Users?" (T)". In: 30th IEEE/ACM International Conference on Automated Software Engineering, ASE 2015, Lincoln, NE, USA, November 9--13, 2015. 2015, pp. 760--770.Google ScholarDigital Library
Emitza Guzman and Walid Maalej. "How Do Users Like This Feature? A Fine Grained Sentiment Analysis of App Reviews". In: IEEE 22nd International Requirements Engineering Conference, RE 2014, Karlskrona, Sweden, August 25--29, 2014. 2014, pp. 153--162.Google ScholarCross Ref
Leonard Hoon et al. "An analysis of the mobile app review landscape: trends and implications". In: Faculty of Information and Communication Technologies, Swinburne University of Technology, Tech. Rep (2013).Google Scholar
Jiajia Huang et al. "A probabilistic method for emerging topic tracking in Microblog stream". In: World Wide Web 20.2 (2017), pp. 325--350. Google ScholarDigital Library
Claudia Iacob and Rachel Harrison. "Retrieving and analyzing mobile apps feature requests from online reviews". In: Proceedings of the 10th Working Conference on Mining Software Repositories, MSR 2013, San Francisco, CA, USA, May 18--19, 2013. 2013, pp. 41--44. Google ScholarDigital Library
Aminul Islam and Diana Zaiu Inkpen. "Semantic text similarity using corpus-based word similarity and string similarity". In: TKDD 2.2 (2008), 10:1--10:25. Google ScholarDigital Library
Jensen Shannon divergence. https://en.wikipedia.org/wiki/Jensen-Shannon_divergence.Google Scholar
H Khalid et al. "What do mobile app users complain about? A study on free iOS apps. 2014". In: IEEE Software 10 (2015).Google Scholar
Jieun Kim et al. "Trends and relationships of smartphone application services: Analysis of apple app store using text mining-based network analysis". In: Proceedings of the 4th ISPIM Innovation Symposium. 2012.Google Scholar
Jey Han Lau, Nigel Collier, and Timothy Baldwin. "On-line Trend Analysis with Topic Models: \#twitter Trends Detection Topic Model Online". In: COLING 2012, 24th International Conference on Computational Linguistics, Proceedings of the Conference: Technical Papers, 8--15 December 2012, Mumbai, India. 2012, pp. 1519--1534.Google Scholar
LDA on small datasets. https://stats.stackexchange.com/questions/78926/at-what-point-does-lda-latent-dirichlet-allocation-not-make-sense-to-use.Google Scholar
Qingwei Lin et al. "iDice: problem identification for emerging issues". In: Proceedings of the 38th International Conference on Software Engineering, ICSE 2016, Austin, TX, USA, May 14--22, 2016. 2016, pp. 214--224. Google ScholarDigital Library
Walid Maalej and Hadeer Nabil. "Bug report, feature request, or simply praise? On automatically classifying app reviews". In: 23rd IEEE International Requirements Engineering Conference, RE 2015, Ottawa, ON, Canada, August 24--28, 2015. 2015, pp. 116--125.Google ScholarCross Ref
Yichuan Man et al. "Experience Report: Understanding Cross-Platform App Issues from User Reviews". In: 27th IEEE International Symposium on Software Reliability Engineering, ISSRE 2016, Ottawa, ON, Canada, October 23--27, 2016. 2016, pp. 138--149.Google Scholar
William Martin, Federica Sarro, and Mark Harman. "Causal impact analysis for app releases in google play". In: Proceedings of the 24th ACM SIGSOFT International Symposium on Foundations of Software Engineering, FSE 2016, Seattle, WA, USA, November 13--18, 2016. 2016, pp. 435--446. Google ScholarDigital Library
William Martin et al. "A Survey of App Store Analysis for Software Engineering". In: IEEE Trans. Software Eng. 43.9 (2017), pp. 817--847.Google ScholarDigital Library
Stuart McIlroy et al. "Analyzing and automatically labelling the types of user issues that are raised in mobile app reviews". In: Empirical Software Engineering 21.3 (2016), pp. 1067--1106. Google ScholarDigital Library
Qiaozhu Mei, Xuehua Shen, and ChengXiang Zhai. "Automatic labeling of multinomial topic models". In: Proceedings of the 13th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, San Jose, California, USA, August 12--15, 2007. 2007, pp. 490--499. Google ScholarDigital Library
Tomas Mikolov et al. "Distributed Representations of Words and Phrases and their Compositionality". In: Advances in Neural Information Processing Systems 26: 27th Annual Conference on Neural Information Processing Systems 2013. Proceedings of a meeting held December 5--8, 2013, Lake Tahoe, Nevada, United States. 2013, pp. 3111--3119. Google ScholarDigital Library
Multi-tasking in iOS. https://developer.apple.com/ios/human-interface-guidelines/features/multitasking/.Google Scholar
Thanh-Son Nguyen, Hady Wirawan Lauw, and Panayiotis Tsaparas. "Review Synthesis for Micro-Review Summarization". In: Proceedings of the Eighth ACM International Conference on Web Search and Data Mining, WSDM 2015, Shanghai, China, February 2--6, 2015. 2015, pp. 169--178. Google ScholarDigital Library
NLTK. http://www.nltk.org.Google Scholar
PMI. https://en.wikipedia.org/wiki/Pointwise_mutual_information.Google Scholar
Punkt tokenizer. http://www.nltk.org/modules/nltk/tokenize/punkt.html.Google Scholar
Peter J. Rousseeuw and Mia Hubert. "Robust statistics for outlier detection". In: Wiley Interdisc. Rew.: Data Mining and Knowledge Discovery 1.1 (2011), pp. 73--79.Google ScholarCross Ref
Federica Sarro et al. "Feature lifecycles as they spread, migrate, remain, and die in App Stores". In: 23rd IEEE International Requirements Engineering Conference, RE 2015, Ottawa, ON, Canada, August 24--28, 2015. 2015, pp. 76--85.Google Scholar
Softmax function. https://en.wikipedia.org/wiki/Softmax_function.Google Scholar
Andrea Di Sorbo et al. "What would users change in my app? summarizing app reviews for recommending software changes". In: Proceedings of the 24th ACM SIGSOFT International Symposium on Foundations of Software Engineering, FSE 2016, Seattle, WA, USA, November 13--18, 2016. 2016, pp. 499--510. Google ScholarDigital Library
User forum of Youtube iOS. https://productforums.google.com/forum/#!forum/youtube.Google Scholar
Phong Minh Vu et al. "Mining User Opinions in Mobile App Reviews: A Keyword-Based Approach (T)". In: 30th IEEE/ACM International Conference on Automated Software Engineering, ASE 2015, Lincoln, NE, USA, November 9--13, 2015. 2015, pp. 749--759.Google Scholar
Xiaojun Wan and Tianming Wang. "Automatic Labeling of Topic Models Using Text Summaries". In: Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics, ACL 2016, August 7--12, 2016, Berlin, Germany, Volume 1: Long Papers. 2016.Google Scholar
WizNote. https://www.wiz.cn/.Google Scholar
Weizhong Zhao et al. "A heuristic approach to determine an appropriate number of topics in topic modeling". In: BMC Bioinformatics 16.13 (2015), S8.Google ScholarCross Ref

Index Terms

Online app review analysis for identifying emerging issues
1. Information systems
  1. Information retrieval
    1. Specialized information retrieval
      1. Environment-specific retrieval
        Web and social media search
2. Software and its engineering
  1. Software organization and properties
    1. Software functional properties
      1. Formal methods
        Dynamic analysis

Recommendations

Crowdsourced App Review Manipulation
SIGIR '17: Proceedings of the 40th International ACM SIGIR Conference on Research and Development in Information Retrieval

With the rapid adoption of smartphones worldwide and the reliance on app marketplaces to discover new apps, these marketplaces are critical for connecting users with apps. And yet, the user reviews and ratings on these marketplaces may be strategically ...
Read More
Can app changelogs improve requirements classification from app reviews?: an exploratory study
ESEM '18: Proceedings of the 12th ACM/IEEE International Symposium on Empirical Software Engineering and Measurement

[Background] Recent research on mining app reviews for software evolution indicated that the elicitation and analysis of user requirements can benefit from supplementing user reviews by data from other sources. However, only a few studies reported ...
Read More
Emerging app issue identification from user feedback: experience on WeChat
ICSE-SEIP '19: Proceedings of the 41st International Conference on Software Engineering: Software Engineering in Practice

It is vital for popular mobile apps with large numbers of users to release updates with rich features while keeping stable user experience. Timely and accurately locating emerging app issues can greatly help developers to maintain and update apps. User ...
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Published in
ICSE '18: Proceedings of the 40th International Conference on Software Engineering
May 2018
1307 pages
ISBN:9781450356381
DOI:10.1145/3180155
Conference Chair:
Michel Chaudron
Chalmers University of Technology, University of Gothenburg, Sweden
,
General Chair:
Ivica Crnkovic
Chalmers University of Technology, University of Gothenburg, Sweden
,
Program Chairs:
Marsha Chechik
University of Toronto, Canada
,
Mark Harman
Facebook and University College London, United Kingdom
Copyright © 2018 ACM
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]
Sponsors
In-Cooperation
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
- Published: 27 May 2018
Permissions
Request permissions about this article.
Request Permissions

Check for updates
Author Tags
app reviews
emerging issues
online analysis
Qualifiers
- research-article
Conference

Acceptance Rates
Overall Acceptance Rate276of1,856submissions,15%

Upcoming Conference

ICSE 2025

2025 IEEE/ACM 46th International Conference on Software Engineering

April 26 - May 3, 2025

Ottawa , ON , Canada
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 77
  Total Citations
  View Citations
- 1,057
  Total Downloads
- Downloads (Last 12 months)124
- Downloads (Last 6 weeks)8
Other Metrics
View Author Metrics
Cited By
View all

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Online app review analysis for identifying emerging issues

ICSE '18: Proceedings of the 40th International Conference on Software Engineering

ABSTRACT

References

Cited By

Index Terms

Recommendations

Crowdsourced App Review Manipulation

Can app changelogs improve requirements classification from app reviews?: an exploratory study

Emerging app issue identification from user feedback: experience on WeChat