Skip to main content
Erschienen in: Neural Computing and Applications 6/2016

01.08.2016 | Original Article

Linear combination of densities and its direct estimation framework with applications

verfasst von: Min Xu, Guanjin Wang, Fu-lai Chung, Shitong Wang

Erschienen in: Neural Computing and Applications | Ausgabe 6/2016

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

In this paper, typical learning task including data condensation, binary classification, identification of the independence between random variables and conditional density estimation is described from a unified perspective of a linear combination of densities, and accordingly a direct estimation framework based on a linear combination of Gaussian components (i.e., Gaussian basis functions) under integrated square error criterion is proposed to solve these learning tasks. The proposed direct estimation framework has three advantages. Firstly, different from most of the existing state-of-the-art methods in which estimating each component’s density in this linear combination of densities and then combining them linearly are required, it can directly estimate the linear combination of densities as a whole, and it has at least comparable to or even better approximation accuracy than the existing density estimation methods. Secondly, the time complexity of the proposed direct estimation framework is O(l 3) in which l is the number of Gaussian components in this framework which are generally viewed as the Gaussian distributions of the clusters in a dataset, and hence l is generally much less than the size of the dataset, so it is very suitable for large datasets. Thirdly, this proposed framework can be typically used to develop alternative approaches to classification, data condensation, identification of the independence between random variables, conditional density estimation and the similarity identification between multiple source domains and a target domain. Our preliminary results about experiments on several typical applications indicate the power of the proposed direct estimation framework.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat Deng ZH, Chung FL, Wang ST (2008) FRSDE: fast reduced set density estimator using minimal enclosing ball. Pattern Recognit 41(4):1363–1372CrossRefMATH Deng ZH, Chung FL, Wang ST (2008) FRSDE: fast reduced set density estimator using minimal enclosing ball. Pattern Recognit 41(4):1363–1372CrossRefMATH
2.
Zurück zum Zitat Tao JW, Chung FL, Wang ST (2012) On minimum distribution discrepancy support vector machine for domain adaptation. Pattern Recognit 45(11):3962–3984CrossRefMATH Tao JW, Chung FL, Wang ST (2012) On minimum distribution discrepancy support vector machine for domain adaptation. Pattern Recognit 45(11):3962–3984CrossRefMATH
3.
Zurück zum Zitat Wang J, Wang ST, Deng ZH, Chung FL (2012) Double indices induced FCM clustering and its integration with fuzzy subspace clustering. In: Proceedings of 2012 FUZZ-IEEE pp 1–8 Wang J, Wang ST, Deng ZH, Chung FL (2012) Double indices induced FCM clustering and its integration with fuzzy subspace clustering. In: Proceedings of 2012 FUZZ-IEEE pp 1–8
4.
Zurück zum Zitat Deng ZH, Chung FL, Wang ST (2011) Clustering-inverse: a generalized model for pattern-based time series segmentation. J Intell Learn Syst Appl 3(1):26–36 Deng ZH, Chung FL, Wang ST (2011) Clustering-inverse: a generalized model for pattern-based time series segmentation. J Intell Learn Syst Appl 3(1):26–36
5.
Zurück zum Zitat He C, Girolami M (2004) Novelty detection employing an L2 optimal nonparametric density estimator. Pattern Recognit Lett 25(12):1389–1397CrossRef He C, Girolami M (2004) Novelty detection employing an L2 optimal nonparametric density estimator. Pattern Recognit Lett 25(12):1389–1397CrossRef
6.
Zurück zum Zitat Wang XM, Chung FL, Wang ST (2011) Theoretical analysis for solution of support vector data description. Neural Netw 24(4):360–369MathSciNetCrossRefMATH Wang XM, Chung FL, Wang ST (2011) Theoretical analysis for solution of support vector data description. Neural Netw 24(4):360–369MathSciNetCrossRefMATH
7.
Zurück zum Zitat Kim J, Scott C (2010) L2 kernel classification. IEEE Trans Pattern Anal Mach Intell 32(10):1822–1831CrossRef Kim J, Scott C (2010) L2 kernel classification. IEEE Trans Pattern Anal Mach Intell 32(10):1822–1831CrossRef
8.
Zurück zum Zitat Vapnik VN (1998) Statistical learning theory. Wiley, New YorkMATH Vapnik VN (1998) Statistical learning theory. Wiley, New YorkMATH
9.
Zurück zum Zitat Girolami M, He C (2003) Probability density estimation from optimally condensed data samples. IEEE Trans Pattern Anal Mach Intell 25(10):1253–1264CrossRef Girolami M, He C (2003) Probability density estimation from optimally condensed data samples. IEEE Trans Pattern Anal Mach Intell 25(10):1253–1264CrossRef
10.
Zurück zum Zitat Ray D, Majumder DD, Das A (2012) Noise reduction and image enhancement of MRI using adaptive multiscale data condensation. In: Proceedings of 2012 1st international conference on recent advances in information technology (RAIT) pp 107–113 Ray D, Majumder DD, Das A (2012) Noise reduction and image enhancement of MRI using adaptive multiscale data condensation. In: Proceedings of 2012 1st international conference on recent advances in information technology (RAIT) pp 107–113
11.
Zurück zum Zitat Angiulli F (2007) Condensed nearest neighbor data domain description. IEEE Trans Pattern Anal Mach Intell 29(10):1746–1758CrossRef Angiulli F (2007) Condensed nearest neighbor data domain description. IEEE Trans Pattern Anal Mach Intell 29(10):1746–1758CrossRef
12.
Zurück zum Zitat T Suzuki, M Sugiyama, J Sese, T Kanamori (2008) Approximating mutual information by maximum likelihood density ratio estimation. In JMLR workshop and conference proceedings, new challenges for feature selection in data mining and knowledge discovery, vol 4, pp 5–20 T Suzuki, M Sugiyama, J Sese, T Kanamori (2008) Approximating mutual information by maximum likelihood density ratio estimation. In JMLR workshop and conference proceedings, new challenges for feature selection in data mining and knowledge discovery, vol 4, pp 5–20
13.
Zurück zum Zitat Seth S, Principe C (2009) Estimation of density ratio and its application to design a measure of dependence. In: Proceedings of 2009 IEEE international workshop on machine learning for signal processing, pp 1–6 Seth S, Principe C (2009) Estimation of density ratio and its application to design a measure of dependence. In: Proceedings of 2009 IEEE international workshop on machine learning for signal processing, pp 1–6
15.
Zurück zum Zitat Shen Z, Xie SQ, Pan CY (1979) Probability theory and mathematical statistics. Higher Education Press, Beijing Shen Z, Xie SQ, Pan CY (1979) Probability theory and mathematical statistics. Higher Education Press, Beijing
16.
Zurück zum Zitat Zhuang FZ, Luo P, Xiong H et al (2010) Cross2domain learning from multiple sources: a consensus regularization perspective. IEEE Trans Knowl Data Eng 22(12):1664–1678CrossRef Zhuang FZ, Luo P, Xiong H et al (2010) Cross2domain learning from multiple sources: a consensus regularization perspective. IEEE Trans Knowl Data Eng 22(12):1664–1678CrossRef
17.
Zurück zum Zitat Bollegala D, Weir D, Carroll J (2011) Using multiple sources to construct a sentiment sensitive thesaurus for cross domain sentiment classification. In: Proceedings of the 49th annual meeting of the ACL: human language technologies HLT 2011, vol 1, pp 132–141 Bollegala D, Weir D, Carroll J (2011) Using multiple sources to construct a sentiment sensitive thesaurus for cross domain sentiment classification. In: Proceedings of the 49th annual meeting of the ACL: human language technologies HLT 2011, vol 1, pp 132–141
19.
Zurück zum Zitat Fan RE, Chen PH, Lin CJ (2005) Working set selection using second order information for training support vector machines. J Mach Learn Res 6:1889–1918MathSciNetMATH Fan RE, Chen PH, Lin CJ (2005) Working set selection using second order information for training support vector machines. J Mach Learn Res 6:1889–1918MathSciNetMATH
20.
Zurück zum Zitat Fukumizu K, Gretton A et al (2008) Kernel measures of conditional dependence. Advances in neural information processing systems. MIT Press, Cambridge, pp 489–496 Fukumizu K, Gretton A et al (2008) Kernel measures of conditional dependence. Advances in neural information processing systems. MIT Press, Cambridge, pp 489–496
21.
Zurück zum Zitat Takeuchi I, Nomura K, Kanamori T (2009) Nonparametric conditional density estimation using piecewise-linear solution path of kernel quantile regression. Neural Comput 21(2):533–559MathSciNetCrossRefMATH Takeuchi I, Nomura K, Kanamori T (2009) Nonparametric conditional density estimation using piecewise-linear solution path of kernel quantile regression. Neural Comput 21(2):533–559MathSciNetCrossRefMATH
22.
Zurück zum Zitat Jones MC, Marron JS, Sheather SJ (1996) A brief survey of bandwidth selection for density estimation. J Am Stat Assoc 91(433):401–407MathSciNetCrossRefMATH Jones MC, Marron JS, Sheather SJ (1996) A brief survey of bandwidth selection for density estimation. J Am Stat Assoc 91(433):401–407MathSciNetCrossRefMATH
23.
Zurück zum Zitat Raykar VC, Duraiswami R (2006) Fast optimal bandwidth selection for kernel density estimation. In: Proceedings of 6th SIAM international conference on data mining, pp 524–528 Raykar VC, Duraiswami R (2006) Fast optimal bandwidth selection for kernel density estimation. In: Proceedings of 6th SIAM international conference on data mining, pp 524–528
24.
Zurück zum Zitat Silverman BW (1986) Density estimation for statistics and data analysis. Chapman and Hall, LondonCrossRefMATH Silverman BW (1986) Density estimation for statistics and data analysis. Chapman and Hall, LondonCrossRefMATH
25.
Zurück zum Zitat Yen SJ, Wu YC, Yang JC, Lee YS, Lee CJ, Liu JJ (2013) A support vector machine-based context-ranking model for question answering. Inf Sci 224:77–87CrossRef Yen SJ, Wu YC, Yang JC, Lee YS, Lee CJ, Liu JJ (2013) A support vector machine-based context-ranking model for question answering. Inf Sci 224:77–87CrossRef
26.
Zurück zum Zitat Liu X, Pan S, Hao Z, Lin Z (2014) Graph-based semi-supervised learning by mixed label propagation with a soft constraint. Inf Sci 277:327–337MathSciNetCrossRef Liu X, Pan S, Hao Z, Lin Z (2014) Graph-based semi-supervised learning by mixed label propagation with a soft constraint. Inf Sci 277:327–337MathSciNetCrossRef
27.
Zurück zum Zitat Li HX, Yang JL, Zhang G, Fan B (2013) Probabilistic support vector machines for classification of noise affected data. Inf Sci 221:60–71CrossRef Li HX, Yang JL, Zhang G, Fan B (2013) Probabilistic support vector machines for classification of noise affected data. Inf Sci 221:60–71CrossRef
Metadaten
Titel
Linear combination of densities and its direct estimation framework with applications
verfasst von
Min Xu
Guanjin Wang
Fu-lai Chung
Shitong Wang
Publikationsdatum
01.08.2016
Verlag
Springer London
Erschienen in
Neural Computing and Applications / Ausgabe 6/2016
Print ISSN: 0941-0643
Elektronische ISSN: 1433-3058
DOI
https://doi.org/10.1007/s00521-015-1947-3

Weitere Artikel der Ausgabe 6/2016

Neural Computing and Applications 6/2016 Zur Ausgabe

Premium Partner