Skip to main content

2017 | OriginalPaper | Buchkapitel

Mining User Requirements from Application Store Reviews Using Frame Semantics

verfasst von : Nishant Jha, Anas Mahmoud

Erschienen in: Requirements Engineering: Foundation for Software Quality

Verlag: Springer International Publishing

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Context and motivation: Research on mining user reviews in mobile application (app) stores has noticeably advanced in the past few years. The majority of the proposed techniques rely on classifying the textual description of user reviews into different categories of technically informative user requirements and uninformative feedback. Question/Problem: Relying on the textual attributes of reviews often produces high dimensional models. This increases the complexity of the classifier and can lead to overfitting problems. Principal ideas/results: We propose a novel semantic approach for app review classification. The proposed approach is based on the notion of semantic role labeling, or characterizing the lexical meaning of text in terms of semantic frames. Semantic frames help to generalize from text (individual words) to more abstract scenarios (contexts). This reduces the dimensionality of the data and enhances the predictive capabilities of the classifier. Three datasets of user reviews are used to conduct our experimental analysis. Results show that semantic frames can be used to generate lower dimensional and more accurate models in comparison to text classification methods. Contribution: A novel semantic approach for extracting user requirements from app reviews. The proposed approach enables a more efficient classification process and reduces the chance of overfitting.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat Agarwal, A., Balasubramanian, S., Kotalwar, A., Zheng, J., Rambow, O.: Frame semantic tree kernels for social network extraction from text. In: Conference of the European Chapter of the Association for Computational Linguistics, pp. 211–219 (2014) Agarwal, A., Balasubramanian, S., Kotalwar, A., Zheng, J., Rambow, O.: Frame semantic tree kernels for social network extraction from text. In: Conference of the European Chapter of the Association for Computational Linguistics, pp. 211–219 (2014)
2.
Zurück zum Zitat Baker, C., Fillmore, C., Lowe, J.: The Berkeley framenet project. In: International Conference on Computational Linguistics, pp. 86–90 (1998) Baker, C., Fillmore, C., Lowe, J.: The Berkeley framenet project. In: International Conference on Computational Linguistics, pp. 86–90 (1998)
3.
Zurück zum Zitat Blei, D., Ng, A., Jordan, M.: Latent dirichlet allocation. J. Mach. Learn. Res. 3, 993–1022 (2003)MATH Blei, D., Ng, A., Jordan, M.: Latent dirichlet allocation. J. Mach. Learn. Res. 3, 993–1022 (2003)MATH
4.
Zurück zum Zitat Brusilovsky, P., Kobsa, A., Nejdl, W. (eds.): The Adaptive Web: Methods and Strategies of Web Personalization. Springer, Heidelberg (2007). pp. 335–336 Brusilovsky, P., Kobsa, A., Nejdl, W. (eds.): The Adaptive Web: Methods and Strategies of Web Personalization. Springer, Heidelberg (2007). pp. 335–336
5.
Zurück zum Zitat Burges, C.: A tutorial on support vector machines for pattern recognition. Data Min. Knowl. Discov. 2(2), 121–167 (1998)CrossRef Burges, C.: A tutorial on support vector machines for pattern recognition. Data Min. Knowl. Discov. 2(2), 121–167 (1998)CrossRef
6.
Zurück zum Zitat Carreńo, G., Winbladh, K.: Analysis of user comments: an approach for software requirements evolution. In: International Conference on Software Engineering, pp. 582–591 (2013) Carreńo, G., Winbladh, K.: Analysis of user comments: an approach for software requirements evolution. In: International Conference on Software Engineering, pp. 582–591 (2013)
7.
Zurück zum Zitat Chen, N., Lin, J., Hoi, S., Xiao, X., Zhang, B.: AR-Miner: mining informative reviews for developers from mobile app marketplace. In: International Conference on Software Engineering, pp. 767–778 (2014) Chen, N., Lin, J., Hoi, S., Xiao, X., Zhang, B.: AR-Miner: mining informative reviews for developers from mobile app marketplace. In: International Conference on Software Engineering, pp. 767–778 (2014)
8.
Zurück zum Zitat Das, D., Schneider, N., Chen, D., Smith, N.: SEMAFOR 1.0: A probabilistic frame-semantic parser (2010) Das, D., Schneider, N., Chen, D., Smith, N.: SEMAFOR 1.0: A probabilistic frame-semantic parser (2010)
9.
10.
Zurück zum Zitat Fillmore, C.: Frame semantics and the nature of language. In: Annals of the New York Academy of Sciences: Conference on the Origin and Development of Language and Speech, pp. 20–32 (1976) Fillmore, C.: Frame semantics and the nature of language. In: Annals of the New York Academy of Sciences: Conference on the Origin and Development of Language and Speech, pp. 20–32 (1976)
11.
Zurück zum Zitat Fleischman, M., Kwon, N., Hovy, E.: Maximum entropy models for FrameNet classification. In: Empirical Methods in Natural Language Processing, pp. 49–56 (2003) Fleischman, M., Kwon, N., Hovy, E.: Maximum entropy models for FrameNet classification. In: Empirical Methods in Natural Language Processing, pp. 49–56 (2003)
12.
Zurück zum Zitat Guzman, E., Maalej, W.: How do users like this feature? A fine grained sentiment analysis of app reviews. In: Requirements Engineering Conference, pp. 153–162 (2014) Guzman, E., Maalej, W.: How do users like this feature? A fine grained sentiment analysis of app reviews. In: Requirements Engineering Conference, pp. 153–162 (2014)
13.
Zurück zum Zitat Hasa, K., Ng, V.: Frame semantics for stance classification. In: Computational Natural Language Learning, pp. 124–132 (2013) Hasa, K., Ng, V.: Frame semantics for stance classification. In: Computational Natural Language Learning, pp. 124–132 (2013)
14.
Zurück zum Zitat Iacob, C., Harrison, R.: Retrieving and analyzing mobile apps feature requests from online reviews. In: Mining Software Repositories, pp. 41–44 (2013) Iacob, C., Harrison, R.: Retrieving and analyzing mobile apps feature requests from online reviews. In: Mining Software Repositories, pp. 41–44 (2013)
15.
Zurück zum Zitat Joachims, T.: Text categorization with support vector machines: learning with many relevant features. In: Nédellec, C., Rouveirol, C. (eds.) ECML 1998. LNCS, vol. 1398, pp. 137–142. Springer, Heidelberg (1998). doi:10.1007/BFb0026683 CrossRef Joachims, T.: Text categorization with support vector machines: learning with many relevant features. In: Nédellec, C., Rouveirol, C. (eds.) ECML 1998. LNCS, vol. 1398, pp. 137–142. Springer, Heidelberg (1998). doi:10.​1007/​BFb0026683 CrossRef
16.
Zurück zum Zitat Kohavi, R.: A study of cross-validation and bootstrap for accuracy estimation and model selection. In: International Joint Conference on Artificial Intelligence, pp. 1137–1143 (1995) Kohavi, R.: A study of cross-validation and bootstrap for accuracy estimation and model selection. In: International Joint Conference on Artificial Intelligence, pp. 1137–1143 (1995)
17.
Zurück zum Zitat Kumar Sinha, S.: Answering Questions About Complex Events. University of California at Berkeley (2008) Kumar Sinha, S.: Answering Questions About Complex Events. University of California at Berkeley (2008)
18.
Zurück zum Zitat Langley, P., Iba, W., Thompson, K.: An analysis of Bayesian classifiers. In: National Conference on Artificial Intelligence, pp. 223–228 (1992) Langley, P., Iba, W., Thompson, K.: An analysis of Bayesian classifiers. In: National Conference on Artificial Intelligence, pp. 223–228 (1992)
19.
Zurück zum Zitat Lovins, J.: Development of a stemming algorithm. Mech. Transl. Comput. Linguist. 11, 22–31 (1968) Lovins, J.: Development of a stemming algorithm. Mech. Transl. Comput. Linguist. 11, 22–31 (1968)
20.
Zurück zum Zitat Maalej, W., Nabil, H.: Bug report, feature request, or simply praise? On automatically classifying app reviews. In: Requirements Engineering Conference, pp. 116–125 (2015) Maalej, W., Nabil, H.: Bug report, feature request, or simply praise? On automatically classifying app reviews. In: Requirements Engineering Conference, pp. 116–125 (2015)
21.
Zurück zum Zitat Martin, W., Harman, M., Jia, Y., Sarro, F., Zhang, Y.: The app sampling problem for app store mining. In: Working Conference on Mining Software Repositories, pp. 123–133 (2015) Martin, W., Harman, M., Jia, Y., Sarro, F., Zhang, Y.: The app sampling problem for app store mining. In: Working Conference on Mining Software Repositories, pp. 123–133 (2015)
22.
Zurück zum Zitat McCallum, A., Nigam, K.: A comparison of event models for naive Bayes text classification. In: AAAI-98 Workshop on Learning for Text Categorization, pp. 41–48 (1998) McCallum, A., Nigam, K.: A comparison of event models for naive Bayes text classification. In: AAAI-98 Workshop on Learning for Text Categorization, pp. 41–48 (1998)
23.
Zurück zum Zitat Mitchell, T.: Machine Learning. McGraw-Hill, New York City (1997)MATH Mitchell, T.: Machine Learning. McGraw-Hill, New York City (1997)MATH
24.
Zurück zum Zitat Moschitti, A., Morarescu, P., Harabagiu, S.: Open domain information extraction via automatic semantic labeling. In: The Florida Artificial Intelligence Research Society Conference, pp. 397–401 (2003) Moschitti, A., Morarescu, P., Harabagiu, S.: Open domain information extraction via automatic semantic labeling. In: The Florida Artificial Intelligence Research Society Conference, pp. 397–401 (2003)
25.
Zurück zum Zitat Pagano, D., Maalej, W.: User feedback in the AppStore: an empirical study. In: Requirements Engineering Conference, pp. 125–134 (2013) Pagano, D., Maalej, W.: User feedback in the AppStore: an empirical study. In: Requirements Engineering Conference, pp. 125–134 (2013)
26.
Zurück zum Zitat Panichella, S., Di Sorbo, A., Guzman, E., Visaggio, C., Canfora, G., Gall, H.: How can I improve my app? Classifying user reviews for software maintenance and evolution. In: International Conference on Software Maintenance and Evolution, pp. 281–290 (2015) Panichella, S., Di Sorbo, A., Guzman, E., Visaggio, C., Canfora, G., Gall, H.: How can I improve my app? Classifying user reviews for software maintenance and evolution. In: International Conference on Software Maintenance and Evolution, pp. 281–290 (2015)
27.
Zurück zum Zitat Platt, J.: Fast training of Support Vector Machines using sequential minimal optimization. In: Schoelkopf, B., Burges, C., Smola, A. (eds.) Advances in Kernel Methods - Support Vector Learning. MIT Press, Cambridge (1998) Platt, J.: Fast training of Support Vector Machines using sequential minimal optimization. In: Schoelkopf, B., Burges, C., Smola, A. (eds.) Advances in Kernel Methods - Support Vector Learning. MIT Press, Cambridge (1998)
28.
Zurück zum Zitat Quinlan, J.: Induction of decision trees. Mach. Learn. 1(1), 81–106 (1986) Quinlan, J.: Induction of decision trees. Mach. Learn. 1(1), 81–106 (1986)
29.
Zurück zum Zitat Shen, D., Lapata, M.: Using semantic roles to improve question answering. In: Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning, pp. 12–21 (2007) Shen, D., Lapata, M.: Using semantic roles to improve question answering. In: Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning, pp. 12–21 (2007)
30.
Zurück zum Zitat Steinwart, I.: On the influence of the kernel on the consistency of Support Vector Machines. J. Mach. Learn. Res. 2, 67–93 (2001)MathSciNetMATH Steinwart, I.: On the influence of the kernel on the consistency of Support Vector Machines. J. Mach. Learn. Res. 2, 67–93 (2001)MathSciNetMATH
31.
Zurück zum Zitat Üstün, B., Melssen, W., Buydens, L.: Facilitating the application of support vector regression by using a universal Pearson VII function based kernel. Chemometr. Intell. Lab. Syst. 81, 29–40 (2006)CrossRef Üstün, B., Melssen, W., Buydens, L.: Facilitating the application of support vector regression by using a universal Pearson VII function based kernel. Chemometr. Intell. Lab. Syst. 81, 29–40 (2006)CrossRef
32.
Zurück zum Zitat Xie, B., Passonneau, R., Wu, L., Creamer, G.: Semantic frames to predict stock price movement. In: Annual Meeting of the Association for Computational Linguistics, pp. 873–883 (2013) Xie, B., Passonneau, R., Wu, L., Creamer, G.: Semantic frames to predict stock price movement. In: Annual Meeting of the Association for Computational Linguistics, pp. 873–883 (2013)
Metadaten
Titel
Mining User Requirements from Application Store Reviews Using Frame Semantics
verfasst von
Nishant Jha
Anas Mahmoud
Copyright-Jahr
2017
DOI
https://doi.org/10.1007/978-3-319-54045-0_20

Premium Partner