skip to main content
tutorial
Public Access

A Review on Quantification Learning

Published:26 September 2017Publication History
Skip Abstract Section

Abstract

The task of quantification consists in providing an aggregate estimation (e.g., the class distribution in a classification problem) for unseen test sets, applying a model that is trained using a training set with a different data distribution. Several real-world applications demand this kind of method that does not require predictions for individual examples and just focuses on obtaining accurate estimates at an aggregate level. During the past few years, several quantification methods have been proposed from different perspectives and with different goals. This article presents a unified review of the main approaches with the aim of serving as an introductory tutorial for newcomers in the field.

References

  1. Rocío Alaiz-Rodríguez, Enrique Alegre-Gutiérrez, Víctor González-Castro, and Lidia Sánchez. 2008. Quantifying the proportion of damaged sperm cells based on image analysis and neural networks. In Proceedings of the WSEAS International Conference on Simulation, Modelling and Optimization (SMO’08). WSEAS Press, 383--388Google ScholarGoogle Scholar
  2. Rocio Alaiz-Rodríguez, Alicia Guerrero-Curieses, and Jesús Cid-Sueiro. 2011. Class and subclass probability re-estimation to adapt a classifier in the presence of concept drift. Neurocomputing 74, 16 (2011), 2614--2623.Google ScholarGoogle ScholarCross RefCross Ref
  3. Giambattista Amati, Simone Angelini, Marco Bianchi, Luca Costantini, and Giuseppe Marcone. 2014a. A scalable approach to near real-time sentiment analysis on social networks. In Proceedings of the International Workshop on Information Filtering and Retrieval. 12--23.Google ScholarGoogle Scholar
  4. Giambattista Amati, Marco Bianchi, and Giuseppe Marcone. 2014b. Sentiment estimation on twitter. In Proceedings of the 5th Italian Information Retrieval Workshop (2014). 39--50.Google ScholarGoogle Scholar
  5. Jon Scott Armstrong. 1978. Long-range Forecasting: From Crystal Ball to Computer. Wiley: New York.Google ScholarGoogle Scholar
  6. Hideki Asoh, Kazushi Ikeda, and Chihiro Ono. 2012. A fast and simple method for profiling a population of twitter users. In Proceedings of the 3rd International Workshop on Mining Ubiquitous and Social Environments. 19--26.Google ScholarGoogle Scholar
  7. Jose Barranquero, Jorge Díez, and Juan José del Coz. 2015. Quantification-oriented learning based on reliable classifiers. Pattern Recogn. 48, 2 (2015), 591--604. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. Jose Barranquero, Pablo González, Jorge Díez, and Juan José del Coz. 2013. On the study of nearest neighbour algorithms for prevalence estimation in binary problems. Pattern Recogn. 46, 2 (2013), 472—482.Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. Oscar Beijbom, Judy Hoffman, Evan Yao, Trevor Darrell, Alberto Rodriguez-Ramirez, Manuel Gonzalez-Rivero, and Ove Hoegh Guldberg. 2015. Quantification in-the-wild: Data-sets and baselines. In Proceedings of the Workshop on Transfer and Multi-Task Learning (NIPS’15).Google ScholarGoogle Scholar
  10. Antonio Bella, Cèsar Ferri, José Hernández-Orallo, and María José Ramírez-Quintana. 2010. Quantification via probability estimators. In Proceedings of the IEEE International Conference on Data Mining (ICDM’10). IEEE, 737--742.Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. Antonio Bella, Cèsar Ferri, José Hernández-Orallo, and María José Ramírez-Quintana. 2014. Aggregative quantification for regression. Data Min. Knowl. Discov. 28, 2 (2014), 475--518.Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. J. Roger Bray and John T. Curtis. 1957. An ordination of the upland forest communities of southern Wisconsin. Ecol. Monogr. 27, 4 (1957), 325--349. Google ScholarGoogle ScholarCross RefCross Ref
  13. Yee Seng Chan and Hwee Tou Ng. 2006. Estimating class priors in domain adaptation for word sense disambiguation. In Proceedings of the 21st International Conference on Computational Linguistics and the 44th Annual Meeting of the Association for Computational Linguistics. Association for Computational Linguistics, 89--96. Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. Hal Daume III and Daniel Marcu. 2006. Domain adaptation for statistical classifiers. J. Artif. Intell. Res. 26 (2006), 101--126.Google ScholarGoogle ScholarCross RefCross Ref
  15. Marthinus Christoffel du Plessis and Masashi Sugiyama. 2012. Semi-supervised learning of class balance under class-prior change by distribution matching. In Proceedings of the International Conference on Machine Learning (ICML’12).Google ScholarGoogle Scholar
  16. Marthinus Christoffel Du Plessis and Masashi Sugiyama. 2014a. Class prior estimation from positive and unlabeled data. IEICE Trans. Inf. Syst. 97, 5 (2014), 1358--1362. Google ScholarGoogle ScholarCross RefCross Ref
  17. Marthinus Christoffel Du Plessis and Masashi Sugiyama. 2014b. Semi-supervised learning of class balance under class-prior change by distribution matching. Neur. Netw. 50 (2014), 110--119. Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. Andrea Esuli and Fabrizio Sebastiani. 2010. Sentiment quantification. IEEE Intell. Syst. 25, 4 (2010), 72--75. Google ScholarGoogle ScholarDigital LibraryDigital Library
  19. Andrea Esuli and Fabrizio Sebastiani. 2015. Optimizing text quantifiers for multivariate loss functions. ACM Trans. Knowl. Discov. Data 9, 4 (2015), 27:1--27:27.Google ScholarGoogle ScholarDigital LibraryDigital Library
  20. Tom Fawcett. 2004. ROC graphs: Notes and practical considerations for researchers. Mach. Learn. 31 (2004), 1--38.Google ScholarGoogle Scholar
  21. Tom Fawcett and Peter A. Flach. 2005. A response to webb and ting’s on the application of ROC analysis to predict classification performance under varying class distributions. Mach. Learn. 58, 1 (2005), 33--38. Google ScholarGoogle ScholarDigital LibraryDigital Library
  22. Aykut Firat. 2016. Unified framework for quantification. arXiv preprint arXiv:1606.00868 (2016).Google ScholarGoogle Scholar
  23. George Forman. 2005. Counting positives accurately despite inaccurate classification. In Proceedings of the European Conference on Machine Learning (ECML’05). 564--575. Google ScholarGoogle ScholarDigital LibraryDigital Library
  24. George Forman. 2006. Quantifying trends accurately despite classifier error and class imbalance. In Proceedings of the ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (SIGKDD’06). ACM, 157--166. Google ScholarGoogle ScholarDigital LibraryDigital Library
  25. George Forman. 2008. Quantifying counts and costs via classification. Data Min. Knowl. Discov. 17, 2 (2008), 164--206. Google ScholarGoogle ScholarDigital LibraryDigital Library
  26. George Forman, Evan Kirshenbaum, and Jaap Suermondt. 2006. Pragmatic text mining: Minimizing human effort to quantify many issues in call logs. In Proceedings of the ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (SIGKDD’06). ACM, 852--861.Google ScholarGoogle ScholarDigital LibraryDigital Library
  27. James Foulds and Eibe Frank. 2010. A review of multi-instance learning assumptions. Knowl. Eng. Rev. 25, 01 (2010), 1--25. Google ScholarGoogle ScholarDigital LibraryDigital Library
  28. Eibe Frank and Mark Hall. 2001. A simple approach to ordinal classification. In Proceedings of the European Conference on Machine Learning. Springer, 145--156. Google ScholarGoogle ScholarDigital LibraryDigital Library
  29. João Gama, Indrė Žliobaitė, Albert Bifet, Mykola Pechenizkiy, and Abdelhamid Bouchachia. 2014. A survey on concept drift adaptation. ACM Comput. Surv. 46, 4 (2014), 44.Google ScholarGoogle ScholarDigital LibraryDigital Library
  30. Wei Gao and Fabrizio Sebastiani. 2015. Tweet sentiment: From classification to quantification. In Proceedings of the International Conference on Advances in Social Networks Analysis and Mining (ASONAM’15).Google ScholarGoogle ScholarDigital LibraryDigital Library
  31. Wei Gao and Fabrizio Sebastiani. 2016. From classification to quantification in tweet sentiment analysis. Soc. Netw. Anal. Min. 6, 1 (2016), 1--22. Google ScholarGoogle ScholarCross RefCross Ref
  32. John J. Gart and Alfred A. Buck. 1966. Comparison of a screening test and a reference test in epidemiologic studies ii. A probabilistic model for the comparison of diagnostic tests. Am. J. Epidemiol. 83, 3 (1966), 593--602. Google ScholarGoogle ScholarCross RefCross Ref
  33. Anastasia Giachanou and Fabio Crestani. 2016. Like it or not: A survey of twitter sentiment analysis methods. Comput. Surv. 49, 2 (2016), 28:1--28:41.Google ScholarGoogle ScholarDigital LibraryDigital Library
  34. Alec Go, Richa Bhayani, and Lei Huang. 2009. Twitter sentiment classification using distant supervision. CS224N Project Rep. Stanford 1 (2009), 12.Google ScholarGoogle Scholar
  35. Pablo González, Eva álvarez, Jose Barranquero, Jorge Díez, Rafael González-Quirós, Enrique Nogueira, Angel López-Urrutia, and Juan José del Coz. 2013. Multiclass support vector machines with example-dependent costs applied to plankton biomass estimation. IEEE Trans. Neur. Netw. Learn. Syst. 24, 11 (2013), 1901--1905.Google ScholarGoogle ScholarCross RefCross Ref
  36. Pablo González, Eva álvarez, Jorge Díez, ángel López-Urrutia, and Juan José del Coz. 2017. Validation methods for plankton image classification systems. Limnol. Oceanogr. Methods 15, 3 (2017), 221--237.Google ScholarGoogle ScholarCross RefCross Ref
  37. Pablo González, Jorge Díez, Nitesh Chawla, and Juan José del Coz. 2017. Why is quantification an interesting learning problem?Progr. Artif. Intell. 6, 1 (2017), 53--58.Google ScholarGoogle Scholar
  38. Víctor González-Castro, Rocío Alaiz-Rodríguez, and Enrique Alegre. 2013. Class distribution estimation based on the hellinger distance. Inf. Sci. 218 (2013), 146--164.Google ScholarGoogle ScholarDigital LibraryDigital Library
  39. Vera Hofer. 2015. Adapting a classification rule to local and global shift when only unlabelled data are available. Eur. J. Operat. Res. 243, 1 (2015), 177--189. Google ScholarGoogle ScholarCross RefCross Ref
  40. Vera Hofer and Georg Krempl. 2013. Drift mining in data: A framework for addressing drift in classification. Comput. Stat. Data Anal. 57, 1 (2013), 377--391. Google ScholarGoogle ScholarDigital LibraryDigital Library
  41. Daniel J. Hopkins and Gary King. 2010. A method of automated nonparametric content analysis for social science. Am. J. Polit. Sci. 54, 1 (2010), 229--247. Google ScholarGoogle ScholarCross RefCross Ref
  42. Jiayuan Huang, Alex J. Smola, Arthur Gretton, Karsten Borgwardt, and Bernhard Schölkopf. 2007. Correcting sample selection bias by unlabeled data. In Proceedings of the Annual Conference on Neural Information Processing Systems (NIPS’07). The MIT Press, 601--608.Google ScholarGoogle Scholar
  43. Arun Iyer, Saketha Nath, and Sunita Sarawagi. 2014. Maximum mean discrepancy for class ratio estimation: Convergence bounds and kernel selection. In Proceedings of the International Conference on Machine Learning (ICML’14). 530--538.Google ScholarGoogle Scholar
  44. Thorsten Joachims. 2005. A support vector method for multivariate performance measures. In Proceedings of the International Conference on Machine Learning (ICML’05). ACM, 377--384. Google ScholarGoogle ScholarDigital LibraryDigital Library
  45. Hideko Kawakubo, Marthinus Christoffel Du Plessis, and Masashi Sugiyama. 2016. Computationally efficient class-prior estimation under class balance change using energy distance. Trans. Inf. Syst. 99, 1 (2016), 176--186. Google ScholarGoogle ScholarCross RefCross Ref
  46. Gary King and Ying Lu. 2008. Verbal autopsy methods with multiple causes of death. Statist. Sci. 23, 1 (2008), 78--91. Google ScholarGoogle ScholarCross RefCross Ref
  47. Meelis Kull and Peter Flach. 2014. Patterns of dataset shift. In Proceedings of the 1st International Workshop on Learning over Multiple Contexts (LMCE’14) at ECML-PKDD.Google ScholarGoogle Scholar
  48. Paul S. Levy and Edward H. Kass. 1970. A three-population model for sequential screening for bacteriuria. Am. J. Epidemiol. 91, 2 (1970), 148--154. Google ScholarGoogle ScholarCross RefCross Ref
  49. Giovanni Da San Martino, Wei Gao, and Fabrizio Sebastiani. 2016a. Ordinal text quantification. In Proceedings of the International ACM SIGIR Conference on Research and Development in Information Retrieval. 937--940. Google ScholarGoogle ScholarDigital LibraryDigital Library
  50. Giovanni Da San Martino, Wei Gao, and Fabrizio Sebastiani. 2016b. QCRI at SemEval-2016 Task 4: Probabilistic methods for binary and ordinal quantification. In Proceedings of the 10th International Workshop on Semantic Evaluation (SemEval’16). Association for Computational Linguistics, A, 58--63.Google ScholarGoogle ScholarCross RefCross Ref
  51. Letizia Milli, Anna Monreale, Giulio Rossetti, Fosca Giannotti, Dino Pedreschi, and Fabrizio Sebastiani. 2013. Quantification trees. In Proceedings of the IEEE International Conference on Data Mining (ICDM’13). 528--536. Google ScholarGoogle ScholarCross RefCross Ref
  52. Letizia Milli, Anna Monreale, Giulio Rossetti, Dino Pedreschi, Fosca Giannotti, and Fabrizio Sebastiani. 2015. Quantification in social networks. In Proceedings of the IEEE International Conference on Data Science and Advanced Analytics. 1--10. Google ScholarGoogle ScholarCross RefCross Ref
  53. José G. Moreno-Torres, Troy Raeder, Rocío Alaiz-Rodríguez, Nitesh V. Chawla, and Francisco Herrera. 2012. A unifying view on dataset shift in classification. Pattern Recogn. 45, 1 (2012), 521--530.Google ScholarGoogle ScholarDigital LibraryDigital Library
  54. Harikrishna Narasimhan, Shuai Li, Purushottam Kar, Sanjay Chawla, and Fabrizio Sebastiani. 2016. Stochastic optimization techniques for quantification performance measures. (unpublished).Google ScholarGoogle Scholar
  55. Sinno Jialin Pan and Qiang Yang. 2010. A survey on transfer learning. IEEE Trans. Knowl. Data Eng. 22, 10 (2010), 1345--1359. Google ScholarGoogle ScholarDigital LibraryDigital Library
  56. Pablo Pérez-Gállego, José Ramón Quevedo, and Juan José del Coz. 2017. Using ensembles for problems with characterizable changes in data distribution: A case study on quantification. Inf. Fusion 34 (2017), 87--100.Google ScholarGoogle ScholarDigital LibraryDigital Library
  57. Charles Peters and William A Coberly. 1976. The numerical evaluation of the maximum-likelihood estimate of mixture proportions. Commun. Stat.-Theory. Methods 5, 12 (1976), 1127--1135. Google ScholarGoogle ScholarCross RefCross Ref
  58. Foster Provost and Tom Fawcett. 2001. Robust classification for imprecise environments. Mach. Learn. 42, 3 (2001), 203--231. Google ScholarGoogle ScholarDigital LibraryDigital Library
  59. Yossi Rubner, Carlo Tomasi, and Leonidas J. Guibas. 2000. The earth mover’s distance as a metric for image retrieval. Int. J. Comput. Vis. 40, 2 (2000), 99--121. Google ScholarGoogle ScholarDigital LibraryDigital Library
  60. Marco Saerens, Patrice Latinne, and Christine Decaestecker. 2002. Adjusting the outputs of a classifier to new a priori probabilities: A simple procedure. Neur. Comput. 14, 1 (2002), 21--41. Google ScholarGoogle ScholarDigital LibraryDigital Library
  61. Andrew Solow, Cabell Davis, and Qiao Hu. 2001. Estimating the taxonomic composition of a sample when individuals are classified with error. Mar. Ecol.: Prog. Ser. 216 (2001), 309--311. Google ScholarGoogle ScholarCross RefCross Ref
  62. Heidi M. Sosik and Robert J. Olson. 2007. Automated taxonomic classification of phytoplankton sampled with imaging-in-flow cytometry. Limnol. Oceanogr.: Methods 5, 6 (2007), 204--216. Google ScholarGoogle ScholarCross RefCross Ref
  63. Amos J. Storkey. 2009. Dataset Shift in Machine Learning. The MIT Press, 3--28.Google ScholarGoogle Scholar
  64. Masashi Sugiyama, Takafumi Kanamori, Taiji Suzuki, Marthinus Christoffel du Plessis, Song Liu, and Ichiro Takeuchi. 2013. Density-difference estimation. Neur. Comput. 25, 10 (2013), 2734--2775. Google ScholarGoogle ScholarDigital LibraryDigital Library
  65. Masashi Sugiyama, Shinichi Nakajima, Hisashi Kashima, Paul von Bünau, and Motoaki Kawanabe. 2007. Direct importance estimation with model selection and its application to covariate shift adaptation. In Proceedings of the Annual Conference on Neural Information Processing Systems (NIPS’07).Google ScholarGoogle Scholar
  66. Masashi Sugiyama, Taiji Suzuki, and Takafumi Kanamori. 2012. Density Ratio Estimation in Machine Learning. Cambridge University Press. Google ScholarGoogle ScholarCross RefCross Ref
  67. Masashi Sugiyama, Makoto Yamada, and Marthinus Christoffel du Plessis. 2013. Learning under nonstationarity: Covariate shift and class-balance change. Wiley Interdisc. Rev.: Comput. Stat. 5, 6 (2013), 465--477. Google ScholarGoogle ScholarDigital LibraryDigital Library
  68. Lei Tang, Huiji Gao, and Huan Liu. 2010. Network quantification despite biased labels. In Proceedings of the 8th Workshop on Mining and Learning with Graphs (MLG’10) at ACM SIGKDD’10. ACM, 147--154. Google ScholarGoogle ScholarDigital LibraryDigital Library
  69. Dirk Tasche. 2014. Exact fit of simple finite mixture models. J. Risk Financ. Manag. 7, 4 (2014), 150--164. Google ScholarGoogle ScholarCross RefCross Ref
  70. Dirk Tasche. 2016. Does quantification without adjustments work?arXiv preprint arXiv:1602.08780 (2016).Google ScholarGoogle Scholar
  71. Dirk Tasche. 2017. Fisher consistency for prior probability shift. arXiv preprint arXiv:1701.05512 (2017).Google ScholarGoogle Scholar
  72. Chris Tofallis. 2014. A better measure of relative prediction accuracy for model selection and model estimation. J. Operat. Res. Soc. 66, 8 (2014), 1352--1362. Google ScholarGoogle ScholarCross RefCross Ref
  73. Slobodan Vucetic and Zoran Obradovic. 2001. Classification on data with biased class distribution. In Proceedings of the European Conference on Machine Learning (ECML’01). Springer-Verlag, 527--538. Google ScholarGoogle ScholarDigital LibraryDigital Library
  74. Geoffrey I. Webb, Roy Hyde, Hong Cao, Hai Long Nguyen, and Francois Petitjean. 2015. Characterizing concept drift. Data Min. Knowl. Discov. (2015), 1--31.Google ScholarGoogle Scholar
  75. Karl Weiss, Taghi M. Khoshgoftaar, and DingDing Wang. 2016. A survey of transfer learning. J. Big Data 3, 1 (2016), 1--40. Google ScholarGoogle ScholarCross RefCross Ref
  76. Jack Chongjie Xue and Gary M. Weiss. 2009. Quantification and semi-supervised classification methods for handling changes in class distribution. In Proceedings of ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (SIGKDD’09). ACM, 897--906. Google ScholarGoogle ScholarDigital LibraryDigital Library
  77. Kun Zhang, Bernhard Schölkopf, Krikamol Muandet, and Zhikun Wang. 2013. Domain adaptation under target and conditional shift. In Proceedings of the International Conference on Machine Learning (ICML’13). 819--827.Google ScholarGoogle Scholar

Index Terms

  1. A Review on Quantification Learning

    Recommendations

    Comments

    Login options

    Check if you have access through your login credentials or your institution to get full access on this article.

    Sign in

    Full Access

    • Published in

      cover image ACM Computing Surveys
      ACM Computing Surveys  Volume 50, Issue 5
      September 2018
      573 pages
      ISSN:0360-0300
      EISSN:1557-7341
      DOI:10.1145/3145473
      • Editor:
      • Sartaj Sahni
      Issue’s Table of Contents

      Copyright © 2017 ACM

      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      • Published: 26 September 2017
      • Revised: 1 June 2017
      • Accepted: 1 June 2017
      • Received: 1 December 2016
      Published in csur Volume 50, Issue 5

      Permissions

      Request permissions about this article.

      Request Permissions

      Check for updates

      Qualifiers

      • tutorial
      • Research
      • Refereed

    PDF Format

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader