Skip to main content
Erschienen in: Neural Computing and Applications 1/2014

01.07.2014 | Original Article

Knowledge transfer across different domain data with multiple views

verfasst von: Qi Tan, Huifang Deng, Pei Yang

Erschienen in: Neural Computing and Applications | Ausgabe 1/2014

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

In many real-world applications in the areas of data mining, the distributions of testing data are different from that of training data. And on the other hand, many data are often represented by multiple views which are of importance to learning. However, little work has been done for it. In this paper, we explored to leverage the multi-view information across different domains for knowledge transfer. We proposed a novel transfer learning model which integrates the domain distance and view consistency into a 2-view support vector machine framework, namely DV2S. The objective of DV2S is to find the optimal feature mapping such that under the projections the classification margin is maximized, while both the domain distance and the disagreement between multiple views are minimized simultaneously. Experiments showed that DV2S outperforms a variety of state-of-the-art algorithms.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat Pan SJ, Yang Q (2010) A survey on transfer learning. IEEE Trans Knowl Data Eng 22(10):1345–1359CrossRef Pan SJ, Yang Q (2010) A survey on transfer learning. IEEE Trans Knowl Data Eng 22(10):1345–1359CrossRef
2.
Zurück zum Zitat Dai WY, Xue GR, Yang Q, Yu Y (2007) Co-clustering based classification for out-of-domain documents. In: Berkhin P, Caruana R, Wu X (eds) Proceedings of the 13th ACM SIGKDD international conference on knowledge discovery and data mining, 12–15 August 2007, ACM 2007, San Jose, pp 210–219 Dai WY, Xue GR, Yang Q, Yu Y (2007) Co-clustering based classification for out-of-domain documents. In: Berkhin P, Caruana R, Wu X (eds) Proceedings of the 13th ACM SIGKDD international conference on knowledge discovery and data mining, 12–15 August 2007, ACM 2007, San Jose, pp 210–219
3.
Zurück zum Zitat Sarinnapakorn K, Kubat M (2007) Combining sub-classifiers in text categorization: a DST-based solution and a case study. IEEE Trans Knowl Data Eng 19(12):1638–1651CrossRef Sarinnapakorn K, Kubat M (2007) Combining sub-classifiers in text categorization: a DST-based solution and a case study. IEEE Trans Knowl Data Eng 19(12):1638–1651CrossRef
4.
Zurück zum Zitat Blitzer J, Dredze M, Pereira F (2007) Biographies, bollywood, boom-boxes and blenders: domain adaptation for sentiment classification. In: Carroll JA, van den Bosch A, Zaenen A (eds) Proceedings of the 45th annual meeting of the association for computational linguistics, 23–30 June 2007. The Association for Computational Linguistics (ACL 2007), Prague, pp 440–447 Blitzer J, Dredze M, Pereira F (2007) Biographies, bollywood, boom-boxes and blenders: domain adaptation for sentiment classification. In: Carroll JA, van den Bosch A, Zaenen A (eds) Proceedings of the 45th annual meeting of the association for computational linguistics, 23–30 June 2007. The Association for Computational Linguistics (ACL 2007), Prague, pp 440–447
5.
Zurück zum Zitat Blitzer J, Kakade S, Foster DP (2011) Domain adaptation with coupled subspaces. In: Gordon G, Dunson D, Dudík M (eds) Proceedings of the fourteenth international conference on artificial intelligence and statistics (AISTATS 2011), 11–13 April 2011. Microtome Publishing, Fort Lauderdale, pp 173–181 Blitzer J, Kakade S, Foster DP (2011) Domain adaptation with coupled subspaces. In: Gordon G, Dunson D, Dudík M (eds) Proceedings of the fourteenth international conference on artificial intelligence and statistics (AISTATS 2011), 11–13 April 2011. Microtome Publishing, Fort Lauderdale, pp 173–181
6.
Zurück zum Zitat Pan WK, Xiang EW, Liu NN, Yang Q (2010) Transfer learning in collaborative filtering for sparsity reduction. In: Fox M, Poole D (eds) Proceedings of the twenty-fourth AAAI conference on artificial intelligence (AAAI 2010), 11–15 July 2010. AAAI Press 2010, Atlanta, pp 230–235 Pan WK, Xiang EW, Liu NN, Yang Q (2010) Transfer learning in collaborative filtering for sparsity reduction. In: Fox M, Poole D (eds) Proceedings of the twenty-fourth AAAI conference on artificial intelligence (AAAI 2010), 11–15 July 2010. AAAI Press 2010, Atlanta, pp 230–235
7.
Zurück zum Zitat Ma H, Zhou DY, Liu C, Lyu MR, King I (2011) Recommender systems with social regularization. In: King I, Nejdl W, Li H (eds) Proceedings of the fourth international conference on web search and web data mining (WSDM 2011), 9–12 February 2011. ACM 2011, Hong Kong, pp 287–296 Ma H, Zhou DY, Liu C, Lyu MR, King I (2011) Recommender systems with social regularization. In: King I, Nejdl W, Li H (eds) Proceedings of the fourth international conference on web search and web data mining (WSDM 2011), 9–12 February 2011. ACM 2011, Hong Kong, pp 287–296
8.
Zurück zum Zitat Gao W, Cai P, Wong K-F, Zhou AY (2010) Learning to rank only using training data from related domain. In: Crestani F, Marchand-Maillet S, Chen H-H, Efthimiadis EN, Savoy J (eds) Proceedings of the 33rd international ACM SIGIR conference on research and development in information retrieval (SIGIR 2010), 19–23 July 2010. ACM 2010, Geneva, pp 162–169 Gao W, Cai P, Wong K-F, Zhou AY (2010) Learning to rank only using training data from related domain. In: Crestani F, Marchand-Maillet S, Chen H-H, Efthimiadis EN, Savoy J (eds) Proceedings of the 33rd international ACM SIGIR conference on research and development in information retrieval (SIGIR 2010), 19–23 July 2010. ACM 2010, Geneva, pp 162–169
9.
Zurück zum Zitat Blum A, Mitchell T (1998) Combining labeled and unlabeled data with co-training. In: Bartlett P, Mansour Y (eds) The eleventh annual conference on computational learning theory (COLT’98), ACM 1998, University of Wisconsin, Madison, pp 92–100 Blum A, Mitchell T (1998) Combining labeled and unlabeled data with co-training. In: Bartlett P, Mansour Y (eds) The eleventh annual conference on computational learning theory (COLT’98), ACM 1998, University of Wisconsin, Madison, pp 92–100
10.
Zurück zum Zitat Rüping S, Scheffer T (2005) Learning with multiple views. In: De Raedt L, Wrobel S (eds) Machine learning, Proceedings of the twenty-second international conference (ICML 2005), ACM International Conference Proceeding Series 119, 7–11 August 2005. ACM 2005, Bonn Rüping S, Scheffer T (2005) Learning with multiple views. In: De Raedt L, Wrobel S (eds) Machine learning, Proceedings of the twenty-second international conference (ICML 2005), ACM International Conference Proceeding Series 119, 7–11 August 2005. ACM 2005, Bonn
11.
Zurück zum Zitat Abney S (2002) Bootstrapping. In: Charniak E, Lin D (eds) Proceedings of the 40th annual meeting of the association for computational linguistics, 6–12 July 2002. The Association for Computational Linguistics 2002, Philadelphia, pp 360–367 Abney S (2002) Bootstrapping. In: Charniak E, Lin D (eds) Proceedings of the 40th annual meeting of the association for computational linguistics, 6–12 July 2002. The Association for Computational Linguistics 2002, Philadelphia, pp 360–367
12.
Zurück zum Zitat Zhang D, He JR, Liu Y, Si L, Lawrence RD (2011) Multi-view transfer learning with a large margin approach. In: Apté C, Ghosh J, Smyth P (eds) Proceedings of the 17th ACM SIGKDD international conference on knowledge discovery and data mining, 21–24 August 2011. ACM 2011, San Diego, pp 1208–1216 Zhang D, He JR, Liu Y, Si L, Lawrence RD (2011) Multi-view transfer learning with a large margin approach. In: Apté C, Ghosh J, Smyth P (eds) Proceedings of the 17th ACM SIGKDD international conference on knowledge discovery and data mining, 21–24 August 2011. ACM 2011, San Diego, pp 1208–1216
13.
Zurück zum Zitat Quanz B, Huan J (2009) Large margin transductive transfer learning. In: Wai-Lok Cheung D, Song I-Y, Chu WW, Hu X, Lin JJ (eds) Proceedings of the 18th ACM conference on information and knowledge management (CIKM 2009), 2–6 November 2009. ACM 2009, Hong Kong, pp 1327–1336 Quanz B, Huan J (2009) Large margin transductive transfer learning. In: Wai-Lok Cheung D, Song I-Y, Chu WW, Hu X, Lin JJ (eds) Proceedings of the 18th ACM conference on information and knowledge management (CIKM 2009), 2–6 November 2009. ACM 2009, Hong Kong, pp 1327–1336
14.
Zurück zum Zitat Joachims T (1999) Transductive inference for text classification using support vector machines. In: Bratko I, Dzeroski S (eds) Proceedings of the sixteenth international conference on machine learning (ICML 1999), 27–30 June 1999. Morgan Kaufmann 1999, Bled, pp 200–209 Joachims T (1999) Transductive inference for text classification using support vector machines. In: Bratko I, Dzeroski S (eds) Proceedings of the sixteenth international conference on machine learning (ICML 1999), 27–30 June 1999. Morgan Kaufmann 1999, Bled, pp 200–209
15.
Zurück zum Zitat Jiang J, Zhai CX (2007) Instance weighting for domain adaptation in NLP. In: Carroll JA, van den Bosch A, Zaenen A (eds) Proceedings of the 45th annual meeting of the association for computational linguistics, 23–30 June 2007. The Association for Computational Linguistics 2007 (ACL 2007), Prague, pp 264–271 Jiang J, Zhai CX (2007) Instance weighting for domain adaptation in NLP. In: Carroll JA, van den Bosch A, Zaenen A (eds) Proceedings of the 45th annual meeting of the association for computational linguistics, 23–30 June 2007. The Association for Computational Linguistics 2007 (ACL 2007), Prague, pp 264–271
16.
Zurück zum Zitat Dai WY, Yang Q, Xue GR, Yu Y (2007) Boosting for transfer learning. In: Ghahramani Z (ed) Machine Learning, Proceedings of the twenty-fourth international conference (ICML 2007), 20–24 June 2007. ACM International Conference Proceeding Series 227, ACM 2007, Corvallis, pp 193–200 Dai WY, Yang Q, Xue GR, Yu Y (2007) Boosting for transfer learning. In: Ghahramani Z (ed) Machine Learning, Proceedings of the twenty-fourth international conference (ICML 2007), 20–24 June 2007. ACM International Conference Proceeding Series 227, ACM 2007, Corvallis, pp 193–200
17.
Zurück zum Zitat Dayanik AA, Lewis DD, Madigan D, Menkov V, Genkin A (2006) Constructing informative prior distributions from domain knowledge in text classification. In: Efthimiadis EN, Dumais ST, Hawking D, Järvelin K (eds) SIGIR 2006: Proceedings of the 29th annual international ACM SIGIR conference on research and development in information retrieval, 6–11 August 2006. ACM 2006, Seattle, pp 493–500 Dayanik AA, Lewis DD, Madigan D, Menkov V, Genkin A (2006) Constructing informative prior distributions from domain knowledge in text classification. In: Efthimiadis EN, Dumais ST, Hawking D, Järvelin K (eds) SIGIR 2006: Proceedings of the 29th annual international ACM SIGIR conference on research and development in information retrieval, 6–11 August 2006. ACM 2006, Seattle, pp 493–500
18.
Zurück zum Zitat Gretton A, Borgwardt KM, Rasch MJ, Schölkopf B, Smola AJ (2006) A kernel method for the two-sample problem. In: Schölkopf B, Platt JC, Hoffman T (eds) Advances in neural information processing systems 19, Proceedings of the twentieth annual conference on neural information processing systems, 4–7 December 2006. MIT Press 2007, Vancouver, British Columbia, pp 513–520 Gretton A, Borgwardt KM, Rasch MJ, Schölkopf B, Smola AJ (2006) A kernel method for the two-sample problem. In: Schölkopf B, Platt JC, Hoffman T (eds) Advances in neural information processing systems 19, Proceedings of the twentieth annual conference on neural information processing systems, 4–7 December 2006. MIT Press 2007, Vancouver, British Columbia, pp 513–520
19.
Zurück zum Zitat Huang JY, Smola AJ, Gretton A, Borgwardt KM, Schölkopf B (2006) Correcting sample selection bias by unlabeled data. In: Schölkopf B, Platt JC, Hoffman T (eds) Advances in neural information processing systems 19, Proceedings of the twentieth annual conference on neural information processing systems, 4–7 December 2006. MIT Press 2007, Vancouver, British Columbia, pp 601–608 Huang JY, Smola AJ, Gretton A, Borgwardt KM, Schölkopf B (2006) Correcting sample selection bias by unlabeled data. In: Schölkopf B, Platt JC, Hoffman T (eds) Advances in neural information processing systems 19, Proceedings of the twentieth annual conference on neural information processing systems, 4–7 December 2006. MIT Press 2007, Vancouver, British Columbia, pp 601–608
20.
Zurück zum Zitat Pan SJ, Kwok JT, Yang Q (2008) Transfer learning via dimensionality reduction. In: Fox D, Gomes CP (eds) Proceedings of the twenty-third AAAI conference on artificial intelligence, AAAI 2008, 13–17 July 2008. AAAI Press 2008, Chicago, pp 677–682 Pan SJ, Kwok JT, Yang Q (2008) Transfer learning via dimensionality reduction. In: Fox D, Gomes CP (eds) Proceedings of the twenty-third AAAI conference on artificial intelligence, AAAI 2008, 13–17 July 2008. AAAI Press 2008, Chicago, pp 677–682
21.
Zurück zum Zitat Dasgupta S, Littman ML, McAllester D (2001) PAC generalization bounds for co-training. In: Dietterich TG, Becker S, Ghahramani Z (eds) Advances in neural information processing systems 14 [Neural Information Processing Systems: Natural and Synthetic, NIPS 2001, 3–8 December 2001]. MIT Press 2001, Vancouver, British Columbia, pp 375–382 Dasgupta S, Littman ML, McAllester D (2001) PAC generalization bounds for co-training. In: Dietterich TG, Becker S, Ghahramani Z (eds) Advances in neural information processing systems 14 [Neural Information Processing Systems: Natural and Synthetic, NIPS 2001, 3–8 December 2001]. MIT Press 2001, Vancouver, British Columbia, pp 375–382
22.
Zurück zum Zitat Chen MM, Weinberger KQ, Blitzer J (2011) Co-training for domain adaptation. In: Shawe-Taylor J, Zemel RS, Bartlett PL, Pereira FCN, Weinberger KQ (eds) Advances in neural information processing systems 24: 25th annual conference on neural information processing systems 2011, 12–14 December 2011. MIT Press 2011, Granada, pp 1–9 Chen MM, Weinberger KQ, Blitzer J (2011) Co-training for domain adaptation. In: Shawe-Taylor J, Zemel RS, Bartlett PL, Pereira FCN, Weinberger KQ (eds) Advances in neural information processing systems 24: 25th annual conference on neural information processing systems 2011, 12–14 December 2011. MIT Press 2011, Granada, pp 1–9
23.
Zurück zum Zitat McCallum AK, Nigam K, Rennie J, Seymore K (2000) Automating the construction of internet portals with machine learning. Inf Retr 3(2):127–163CrossRef McCallum AK, Nigam K, Rennie J, Seymore K (2000) Automating the construction of internet portals with machine learning. Inf Retr 3(2):127–163CrossRef
24.
Zurück zum Zitat Salton G, Buckley C (1988) Term-weighting approaches in automatic text retrieval. Inf Process Manag 24(5):513–523CrossRef Salton G, Buckley C (1988) Term-weighting approaches in automatic text retrieval. Inf Process Manag 24(5):513–523CrossRef
Metadaten
Titel
Knowledge transfer across different domain data with multiple views
verfasst von
Qi Tan
Huifang Deng
Pei Yang
Publikationsdatum
01.07.2014
Verlag
Springer London
Erschienen in
Neural Computing and Applications / Ausgabe 1/2014
Print ISSN: 0941-0643
Elektronische ISSN: 1433-3058
DOI
https://doi.org/10.1007/s00521-013-1432-9

Weitere Artikel der Ausgabe 1/2014

Neural Computing and Applications 1/2014 Zur Ausgabe

Premium Partner