Skip to main content
Erschienen in: Pattern Analysis and Applications 2/2020

09.04.2019 | Theoretical advances

Smooth estimates of multiple quantiles in dynamically varying data streams

verfasst von: Hugo Lewi Hammer, Anis Yazidi

Erschienen in: Pattern Analysis and Applications | Ausgabe 2/2020

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

In this paper, we investigate the problem of estimating multiple quantiles when samples are received online (data stream). We assume a dynamical system, i.e., the distribution of the samples from the data stream changes with time. A major challenge of using incremental quantile estimators to track multiple quantiles is that we are not guaranteed that the monotone property of quantiles will be satisfied, i.e, an estimate of a lower quantile might erroneously overpass that of a higher quantile estimate. Surprisingly, we have only found two papers in the literature that attempt to counter these challenges, namely the works of Cao et al. (Proceedings of the first ACM workshop on mobile internet through cellular networks, ACM, 2009) and Hammer and Yazidi (Proceedings of the 30th international conference on industrial engineering and other applications of applied intelligent systems (IEA/AIE), France, Springer, 2017) where the latter is a preliminary version of the work in this paper. Furthermore, the state-of-the-art incremental quantile estimator called deterministic update-based multiplicative incremental quantile estimator (DUMIQE), due to Yazidi and Hammer (IEEE Trans Cybernet, 2017), fails to guarantee the monotone property when estimating multiple quantiles. A challenge with the solutions, in Cao et al. (2009) and Hammer and Yazidi (2017), is that even though the estimates satisfy the monotone property of quantiles, the estimates can be highly irregular relative to each other which usually is unrealistic from a practical point of view. In this paper, we suggest to generate the quantile estimates by inserting the quantile probabilities (e.g., \(0.1, 0.2, \ldots , 0.9\)) into a monotonically increasing and infinitely smooth function (can be differentiated infinitely many times). The function is incrementally updated from the data stream. The monotonicity and smoothness of the function ensure that both the monotone property and regularity requirement of the quantile estimates are satisfied. The experimental results show that the method performs very well and estimates multiple quantiles more precisely than the original DUMIQE (Yazidi and Hammer 2017), and the approaches reported in Hammer and Yazidi (2017) and Cao et al. (2009).

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat Arandjelovic O, Pham D-S, Venkatesh S (2015) Two maximum entropy-based algorithms for running quantile estimation in nonstationary data streams. IEEE Trans Circuits Syst Video Technol 25(9):1469–1479CrossRef Arandjelovic O, Pham D-S, Venkatesh S (2015) Two maximum entropy-based algorithms for running quantile estimation in nonstationary data streams. IEEE Trans Circuits Syst Video Technol 25(9):1469–1479CrossRef
2.
Zurück zum Zitat Cao J, Li L, Chen A, Bu T (2010) Tracking quantiles of network data streams with dynamic operations. In: 2010 Proceedings of IEEE INFOCOM. IEEE, pp 1–5 Cao J, Li L, Chen A, Bu T (2010) Tracking quantiles of network data streams with dynamic operations. In: 2010 Proceedings of IEEE INFOCOM. IEEE, pp 1–5
3.
Zurück zum Zitat Cao J, Li LE, Chen A, Bu T (2009) Incremental tracking of multiple quantiles for network monitoring in cellular networks. In: Proceedings of the 1st ACM workshop on Mobile internet through cellular networks. ACM, pp 7–12 Cao J, Li LE, Chen A, Bu T (2009) Incremental tracking of multiple quantiles for network monitoring in cellular networks. In: Proceedings of the 1st ACM workshop on Mobile internet through cellular networks. ACM, pp 7–12
4.
Zurück zum Zitat Chambers JM, James DA, Lambert D, Wiel SV (2006) Monitoring networked applications with incremental quantile estimation. Stat Sci 21:463–475MathSciNetCrossRef Chambers JM, James DA, Lambert D, Wiel SV (2006) Monitoring networked applications with incremental quantile estimation. Stat Sci 21:463–475MathSciNetCrossRef
5.
Zurück zum Zitat Chen F, Lambert D, Pinheiro JC (2000) Incremental quantile estimation for massive tracking. In: Proceedings of the sixth ACM SIGKDD international conference on Knowledge discovery and data mining. ACM, pp 516–522 Chen F, Lambert D, Pinheiro JC (2000) Incremental quantile estimation for massive tracking. In: Proceedings of the sixth ACM SIGKDD international conference on Knowledge discovery and data mining. ACM, pp 516–522
6.
Zurück zum Zitat Hammer HL, Yazidi A (2017) Incremental quantiles estimators for tracking multiple quantiles. In: Proceedings of the 30th international conference on industrial engineering and other applications of applied intelligent systems (IEA/AIE), France. Springer, pp 202–210 Hammer HL, Yazidi A (2017) Incremental quantiles estimators for tracking multiple quantiles. In: Proceedings of the 30th international conference on industrial engineering and other applications of applied intelligent systems (IEA/AIE), France. Springer, pp 202–210
7.
Zurück zum Zitat Jain R, Chlamtac I (1985) The P2 algorithm for dynamic calculation of quantiles and histograms without storing observations. Commun ACM 28(10):1076–1085CrossRef Jain R, Chlamtac I (1985) The P2 algorithm for dynamic calculation of quantiles and histograms without storing observations. Commun ACM 28(10):1076–1085CrossRef
8.
Zurück zum Zitat Luo G, Wang L, Yi K, Graham C (2016) Quantiles over data streams: experimental comparisons, new analyses, and further improvements. VLDB J 25(4):449–472CrossRef Luo G, Wang L, Yi K, Graham C (2016) Quantiles over data streams: experimental comparisons, new analyses, and further improvements. VLDB J 25(4):449–472CrossRef
9.
Zurück zum Zitat Ma Q, Muthukrishnan S, Sandler M (2014) Frugal streaming for estimating quantiles: One (or two) memory suffices. arXiv preprint arXiv:1407.1121 Ma Q, Muthukrishnan S, Sandler M (2014) Frugal streaming for estimating quantiles: One (or two) memory suffices. arXiv preprint arXiv:​1407.​1121
10.
Zurück zum Zitat McDermott JP, Babu GJ, Liechty JC, Lin DKJ (2007) Data skeletons: simultaneous estimation of multiple quantiles for massive streaming datasets with applications to density estimation. Stat Comput 17(4):311–321MathSciNetCrossRef McDermott JP, Babu GJ, Liechty JC, Lin DKJ (2007) Data skeletons: simultaneous estimation of multiple quantiles for massive streaming datasets with applications to density estimation. Stat Comput 17(4):311–321MathSciNetCrossRef
11.
Zurück zum Zitat Schmeiser BW, Deutsch SJ (1977) Quantile estimation from grouped data: the cell midpoint. Commun Stat Simul Comput 6(3):221–234CrossRef Schmeiser BW, Deutsch SJ (1977) Quantile estimation from grouped data: the cell midpoint. Commun Stat Simul Comput 6(3):221–234CrossRef
12.
Zurück zum Zitat Shrivastava N, Buragohain C, Agrawal D, Suri S (2004) Medians and beyond: new aggregation techniques for sensor networks. In: Proceedings of the 2nd international conference on Embedded networked sensor systems. ACM, pp 239–249 Shrivastava N, Buragohain C, Agrawal D, Suri S (2004) Medians and beyond: new aggregation techniques for sensor networks. In: Proceedings of the 2nd international conference on Embedded networked sensor systems. ACM, pp 239–249
13.
Zurück zum Zitat Tierney L (1983) A space-efficient recursive procedure for estimating a quantile of an unknown distribution. SIAM J Sci Stat Comput 4(4):706–711MathSciNetCrossRef Tierney L (1983) A space-efficient recursive procedure for estimating a quantile of an unknown distribution. SIAM J Sci Stat Comput 4(4):706–711MathSciNetCrossRef
14.
Zurück zum Zitat Tschumitschew K, Klawonn F (2010) Incremental quantile estimation. Evol Syst 1(4):253–264CrossRef Tschumitschew K, Klawonn F (2010) Incremental quantile estimation. Evol Syst 1(4):253–264CrossRef
15.
Zurück zum Zitat Yazidi A, Hammer H (2017) Multiplicative update methods for incremental quantile estimation. IEEE Trans Cybernet 49(3):746–756CrossRef Yazidi A, Hammer H (2017) Multiplicative update methods for incremental quantile estimation. IEEE Trans Cybernet 49(3):746–756CrossRef
Metadaten
Titel
Smooth estimates of multiple quantiles in dynamically varying data streams
verfasst von
Hugo Lewi Hammer
Anis Yazidi
Publikationsdatum
09.04.2019
Verlag
Springer London
Erschienen in
Pattern Analysis and Applications / Ausgabe 2/2020
Print ISSN: 1433-7541
Elektronische ISSN: 1433-755X
DOI
https://doi.org/10.1007/s10044-019-00794-3

Weitere Artikel der Ausgabe 2/2020

Pattern Analysis and Applications 2/2020 Zur Ausgabe

Premium Partner