Skip to main content
Erschienen in: Cognitive Computation 3/2014

01.09.2014

A Learner-Independent Knowledge Transfer Approach to Multi-task Learning

verfasst von: Shaoning Pang, Fan Liu, Youki Kadobayashi, Tao Ban, Daisuke Inoue

Erschienen in: Cognitive Computation | Ausgabe 3/2014

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

This paper proposes a learner-independent multi-task learning (MTL) scheme in which knowledge transfer (KT) is running beyond the learner. In the proposed KT approach, we use minimum enclosing balls (MEBs) as knowledge carriers to extract and transfer knowledge from one task to another. Since the knowledge presented in MEB can be decomposed as raw data, it can be incorporated into any learner as additional training data for a new learning task to improve the learning rate. The effectiveness and robustness of the proposed KT is evaluated, respectively, on multi-task pattern recognition problems derived from synthetic datasets, UCI datasets, and real face image datasets, using classifiers from different disciplines for MTL. The experimental results show that multi-task learners using KT via MEB carriers perform better than learners without-KT, and this has been successfully applied to different classifiers such as k nearest neighbor and support vector machines.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat McClelland J. Is a machine realization of truly human-like intelligence achievable? Cogn Comput. 2009;1(1):17–21. McClelland J. Is a machine realization of truly human-like intelligence achievable? Cogn Comput. 2009;1(1):17–21.
2.
Zurück zum Zitat Taylor JG. Cognitive computation. Cogn Comput. 2009;1(1):4–16. Taylor JG. Cognitive computation. Cogn Comput. 2009;1(1):4–16.
3.
Zurück zum Zitat Pang S, Ozawa S, Kasabov N. Incremental linear discriminant analysis for classification of data streams. IEEE Trans Neural Netw. 2005;35(5):905–914. Pang S, Ozawa S, Kasabov N. Incremental linear discriminant analysis for classification of data streams. IEEE Trans Neural Netw. 2005;35(5):905–914.
4.
Zurück zum Zitat Pang S, Zhu L, Chen G, Sarrafzadeh A, Ban T, Inoue D. Dynamic class imbalance learning for incremental LPSVM. Neural Netw. 2013;44(2013):87–100. Pang S, Zhu L, Chen G, Sarrafzadeh A, Ban T, Inoue D. Dynamic class imbalance learning for incremental LPSVM. Neural Netw. 2013;44(2013):87–100.
5.
Zurück zum Zitat Kumar P, Mitchell JSB, Yildirim EA. Approximate minimum enclosing balls in high dimensions using core-sets. J Exp Algorithm (JEA). 2003;8:11. Kumar P, Mitchell JSB, Yildirim EA. Approximate minimum enclosing balls in high dimensions using core-sets. J Exp Algorithm (JEA). 2003;8:11.
6.
Zurück zum Zitat Chan TM. Approximating the diameter, width, smallest enclosing cylinder, and minimum-width annulus. In: Proceedings of the sixteenth annual symposium on computational geometry. Clear Water Bay, Kowloon, Hong Kong: ACM New York, NY, USA, 2000;300–309. Chan TM. Approximating the diameter, width, smallest enclosing cylinder, and minimum-width annulus. In: Proceedings of the sixteenth annual symposium on computational geometry. Clear Water Bay, Kowloon, Hong Kong: ACM New York, NY, USA, 2000;300–309.
7.
Zurück zum Zitat Pan SJ, Yang Q. A survey on transfer learning. Knowl Data Eng IEEE Trans. 2009;(99):1–1. Pan SJ, Yang Q. A survey on transfer learning. Knowl Data Eng IEEE Trans. 2009;(99):1–1.
8.
Zurück zum Zitat Argyriou A, Maurer A, Pontil M. An algorithm for transfer learning in a heterogeneous environment. In: proceedings of the 2008 European conference on machine learning and knowledge discovery in databases—Part I. Antwerp, Belgium: Springer; 2008. p. 71–85. http://dl.acm.org/citation.cfm?id=1431959. Argyriou A, Maurer A, Pontil M. An algorithm for transfer learning in a heterogeneous environment. In: proceedings of the 2008 European conference on machine learning and knowledge discovery in databases—Part I. Antwerp, Belgium: Springer; 2008. p. 71–85. http://​dl.​acm.​org/​citation.​cfm?​id=​1431959.
9.
Zurück zum Zitat Lawrence N. Platt J. Learning to learn with the informative vector machine. In: The 21st international conference on machine learning. 2004. Lawrence N. Platt J. Learning to learn with the informative vector machine. In: The 21st international conference on machine learning. 2004.
10.
Zurück zum Zitat Knowledge Transfer via Multiple Model Local Structure Mapping, ser. Research papers, no. 978-1-60558-193-4. ACM, November 2008. Knowledge Transfer via Multiple Model Local Structure Mapping, ser. Research papers, no. 978-1-60558-193-4. ACM, November 2008.
11.
Zurück zum Zitat Mitchell TM. The need for biases in learning generalizations. Tech Rep.; 1980. Mitchell TM. The need for biases in learning generalizations. Tech Rep.; 1980.
12.
Zurück zum Zitat Ozawa S, Roy A, Roussinov D. A multitask learning model for online pattern recognition. IEEE Trans Neural Netw. 2009;20(3):430–445.PubMedCrossRef Ozawa S, Roy A, Roussinov D. A multitask learning model for online pattern recognition. IEEE Trans Neural Netw. 2009;20(3):430–445.PubMedCrossRef
13.
Zurück zum Zitat Yu K, Schwaighofer A, Tresp V, Ma W-Y, Zhang H. Collaborative ensemble learning: combining collaborative and content-based information filtering via hierarchical bayes. In: Proceedings of the 19th conference on uncertainty in artificial intelligence. 2003. Yu K, Schwaighofer A, Tresp V, Ma W-Y, Zhang H. Collaborative ensemble learning: combining collaborative and content-based information filtering via hierarchical bayes. In: Proceedings of the 19th conference on uncertainty in artificial intelligence. 2003.
14.
Zurück zum Zitat Silver D, Mercer R. Selective functional transfer: inductive bias from related tasks. In: Proceedings of the IASTED international conference on artificial intelligence and soft computing (ASC2001), p. 182–189. Silver D, Mercer R. Selective functional transfer: inductive bias from related tasks. In: Proceedings of the IASTED international conference on artificial intelligence and soft computing (ASC2001), p. 182–189.
15.
Zurück zum Zitat Mitchell TM. Machine learning. New York; McGraw-Hill. 1997. Mitchell TM. Machine learning. New York; McGraw-Hill. 1997.
16.
Zurück zum Zitat Silver DL, McCracken P. Selective transfer of task knowledge using stochastic noise. In: Xiang IY, Chaib-draa B, editors. Advances in artificial intelligence, proceedings of the 16th conference of the Canadian society for computational studies of intelligence (AI’2003). Berlin: Springer. 2003;190–205. Silver DL, McCracken P. Selective transfer of task knowledge using stochastic noise. In: Xiang IY, Chaib-draa B, editors. Advances in artificial intelligence, proceedings of the 16th conference of the Canadian society for computational studies of intelligence (AI’2003). Berlin: Springer. 2003;190–205.
17.
Zurück zum Zitat Ghosn J, Bengio Y. Bias learning, knowledge sharing. IEEE Trans Neural Netw. 2003;14(4):748–765.PubMedCrossRef Ghosn J, Bengio Y. Bias learning, knowledge sharing. IEEE Trans Neural Netw. 2003;14(4):748–765.PubMedCrossRef
18.
Zurück zum Zitat Silver DL, Poirier R. Sequential consolidation of learned task knowledge. 2004. Silver DL, Poirier R. Sequential consolidation of learned task knowledge. 2004.
19.
Zurück zum Zitat Silver DL. Selective transfer of neural network task knowledge. PhD Thesis, University of Western Ontario. 2000. Silver DL. Selective transfer of neural network task knowledge. PhD Thesis, University of Western Ontario. 2000.
20.
21.
Zurück zum Zitat Jonathan B. Learning internal representations. In: Proceedings of the eighth international conference on computational learning theory. 1995. Jonathan B. Learning internal representations. In: Proceedings of the eighth international conference on computational learning theory. 1995.
22.
Zurück zum Zitat Intrator N, Edelman S. Making a low-dimensional representation suitable for diverse tasks. Norwell, MA, USA: Kluwer Academic. 1998. Intrator N, Edelman S. Making a low-dimensional representation suitable for diverse tasks. Norwell, MA, USA: Kluwer Academic. 1998.
23.
Zurück zum Zitat Thrun S. Is learning the n-th thing any easier than learning the first?. In: Advances in neural information processing systems. Cambridge: The MIT Press. 1996;640–646. Thrun S. Is learning the n-th thing any easier than learning the first?. In: Advances in neural information processing systems. Cambridge: The MIT Press. 1996;640–646.
24.
Zurück zum Zitat Roy A, Kim LS, Mukhopadhyay S. A polynomial time algorithm for the construction and training of a class of multilayer perceptrons. Neural Netw. 1993;6(4):535–545.CrossRef Roy A, Kim LS, Mukhopadhyay S. A polynomial time algorithm for the construction and training of a class of multilayer perceptrons. Neural Netw. 1993;6(4):535–545.CrossRef
25.
Zurück zum Zitat Roy A, Govil S, Miranda R. An algorithm to generate radial basis function (rbf)-like nets for classification problems. Neural Netw. 1995;8(2):179–202.CrossRef Roy A, Govil S, Miranda R. An algorithm to generate radial basis function (rbf)-like nets for classification problems. Neural Netw. 1995;8(2):179–202.CrossRef
26.
Zurück zum Zitat Roy A, Mukhopadhyay S. Iterative generation of higher-order nets in polynomial time using linear programming. IEEE Trans Neural Netw. 1997;8(2):402–412.PubMedCrossRef Roy A, Mukhopadhyay S. Iterative generation of higher-order nets in polynomial time using linear programming. IEEE Trans Neural Netw. 1997;8(2):402–412.PubMedCrossRef
27.
Zurück zum Zitat Roy A, Govil S, Miranda R. A neural network learning theory and a polynomial time rbf algorithm. IEEE Trans Neural Netw. 1997;8(6):1301–1313.PubMedCrossRef Roy A, Govil S, Miranda R. A neural network learning theory and a polynomial time rbf algorithm. IEEE Trans Neural Netw. 1997;8(6):1301–1313.PubMedCrossRef
28.
Zurück zum Zitat Xue Y, Liao X, Carin L. Multi-task learning for classification with dirichlet process priors. J Mach Learn Res Research. 2007;8:35–63. Xue Y, Liao X, Carin L. Multi-task learning for classification with dirichlet process priors. J Mach Learn Res Research. 2007;8:35–63.
29.
Zurück zum Zitat Ferguson T. A bayesian analysis of some nonparametric problems. Ann Stat. 1973;1:209–230.CrossRef Ferguson T. A bayesian analysis of some nonparametric problems. Ann Stat. 1973;1:209–230.CrossRef
30.
Zurück zum Zitat Good I. Some history of the hierarchical Bayesian methodology, ser. Bayesian statistics. Valencia, Valencia University Press. 1980 Good I. Some history of the hierarchical Bayesian methodology, ser. Bayesian statistics. Valencia, Valencia University Press. 1980
31.
Zurück zum Zitat Evgeniou T, Pontil M. Regularized multi-task learning. In: Proceedings of the tenth ACM SIGKDD international conference on Knowledge discovery and data mining. ACM. 2004;109–117. Evgeniou T, Pontil M. Regularized multi-task learning. In: Proceedings of the tenth ACM SIGKDD international conference on Knowledge discovery and data mining. ACM. 2004;109–117.
32.
Zurück zum Zitat Ben-David S, Schuller R. Exploiting task relatedness for multiple task learning. In: Proceedings of computational learning theory (COLT). 2003. Ben-David S, Schuller R. Exploiting task relatedness for multiple task learning. In: Proceedings of computational learning theory (COLT). 2003.
33.
Zurück zum Zitat Eaton E, desJardins M. Knowledge transfer with a multiresolution ensemble of classifiers. In: ICML-06 workshop on structural knowledge transfer for machine learning, June 29, Pittsburgh, PA. 2006. Eaton E, desJardins M. Knowledge transfer with a multiresolution ensemble of classifiers. In: ICML-06 workshop on structural knowledge transfer for machine learning, June 29, Pittsburgh, PA. 2006.
34.
Zurück zum Zitat Eaton E. Multi-resolution learning for knowledge transfer. Boston, MA; AAAI Press. July 16–20 2006, [Doctoral Consortium]. Eaton E. Multi-resolution learning for knowledge transfer. Boston, MA; AAAI Press. July 16–20 2006, [Doctoral Consortium].
35.
Zurück zum Zitat Eaton E, desJardins M, Stevenson J. Using multiresolution learning for transfer in image classification. In: aaai07. AAAI Press. 2007. Eaton E, desJardins M, Stevenson J. Using multiresolution learning for transfer in image classification. In: aaai07. AAAI Press. 2007.
36.
Zurück zum Zitat Megiddo N. Linear-time algorithms for linear programming in r 3 and related problems. SIAM J Comput. 1983;12(4):759–776.CrossRef Megiddo N. Linear-time algorithms for linear programming in r 3 and related problems. SIAM J Comput. 1983;12(4):759–776.CrossRef
37.
Zurück zum Zitat Welzl E. Smallest enclosing disks (balls and ellipsoids). Results New Trends Comput Sci. 1991;359–370. Welzl E. Smallest enclosing disks (balls and ellipsoids). Results New Trends Comput Sci. 1991;359–370.
38.
Zurück zum Zitat Badoiu M. Optimal core sets for balls. In: In DIMACS workshop on computational geometry. 2002. Badoiu M. Optimal core sets for balls. In: In DIMACS workshop on computational geometry. 2002.
39.
Zurück zum Zitat Chapelle O, Scholkopf B, Zien A. Semi-supervised learning. Cambridge, MA: MIT Press. 2006.CrossRef Chapelle O, Scholkopf B, Zien A. Semi-supervised learning. Cambridge, MA: MIT Press. 2006.CrossRef
41.
Zurück zum Zitat Kim M-S, Kim D, Lee S-Y. Face recognition descriptor using the embedded hmm with the 2nd-order block-specific eigenvectors. ISO/IEC JTC1/SC21/WG11/M7997, Jeju, 2002. Kim M-S, Kim D, Lee S-Y. Face recognition descriptor using the embedded hmm with the 2nd-order block-specific eigenvectors. ISO/IEC JTC1/SC21/WG11/M7997, Jeju, 2002.
42.
Zurück zum Zitat Ozawa S, Pang S, Kasabov N. Online feature selection for adaptive evolving connectionist systems. Int J Innov Comput Inf Control. 2006;2(1):181–192. Ozawa S, Pang S, Kasabov N. Online feature selection for adaptive evolving connectionist systems. Int J Innov Comput Inf Control. 2006;2(1):181–192.
43.
Zurück zum Zitat Ozawa S, Pang S, Kasabov N. Incremental learning of chunk data for on-line pattern classification systems. IEEE Trans Neural Netw. 2008;19(6):1061–1074. Ozawa S, Pang S, Kasabov N. Incremental learning of chunk data for on-line pattern classification systems. IEEE Trans Neural Netw. 2008;19(6):1061–1074.
44.
Zurück zum Zitat Ozawa S, Pang S, Kasabov N. Incremental learning of feature space and classifier for on-line pattern recognition. Int J Knowl Based Intell Eng Syst. 2006;10:57–65. Ozawa S, Pang S, Kasabov N. Incremental learning of feature space and classifier for on-line pattern recognition. Int J Knowl Based Intell Eng Syst. 2006;10:57–65.
Metadaten
Titel
A Learner-Independent Knowledge Transfer Approach to Multi-task Learning
verfasst von
Shaoning Pang
Fan Liu
Youki Kadobayashi
Tao Ban
Daisuke Inoue
Publikationsdatum
01.09.2014
Verlag
Springer US
Erschienen in
Cognitive Computation / Ausgabe 3/2014
Print ISSN: 1866-9956
Elektronische ISSN: 1866-9964
DOI
https://doi.org/10.1007/s12559-013-9238-8

Weitere Artikel der Ausgabe 3/2014

Cognitive Computation 3/2014 Zur Ausgabe