Genetic transfer learning

https://doi.org/10.1016/j.eswa.2010.03.019Get rights and content

Abstract

Transfer learning is a method which aims to improve “related” tasks performance. Transfer learning tries to use information gained from related tasks solutions to improve performance of learning strategy. Transfer learning addresses the problem of how to utilize plenty of labeled data in a source domain to solve related but different problems in a target domain, even when the training and testing problems have different distributions or features (Pan, Kwok, & Yang, 2008). In this paper we have used transfer learning to improve performance of genetic algorithms.

Introduction

Traditional machine learning techniques try to solve one problem at a time. And some of the complex problems take a lot of time to solve by classic machine learning techniques. Also if there is not enough information about the problem it’s sometimes difficult to get the exact solution. Fig. 1 shows the difference of learning process between traditional machine learning techniques and transfer learning techniques. Traditional machine learning techniques try to learn each task from scratch, while transfer learning techniques try to transfer knowledge from previous tasks to a new target task with a few training data (Pan & Yang, 2008). In real life a human can solve complex problems easily by using his/her past experiences in the problem domains. Transfer learning is a humanlike learning strategy which aims to help problem solving and learning mechanisms by using information gained from previously solved related tasks. Transfer learning is used in many learning methods like neural networks (Murre, 1995, Pratt, 1993), markov logic networks (Mihalkova and Huynh, 2007, Mihalkova and Mooney, 2006), text categorization (Gupta & Ratinov, 2008), web page classification (Ling et al., 2008).

The key problem of transfer learning is task relatedness. Determining whether the tasks are related or not which the information will be transferred among is very important or how they are correlated. And decide transfer or not information between two tasks, how many information to transfer and how the information will be transferred.

Transfer learning is first used to set initial weight of a “target” neural network by using another, trained, same architecture (e.g. same size input and same size output) “source” neural network (Pratt, 1993). When the information gained from source network used directly (e.g. weights) it performs worse in target network than random initial weights because, weights in source network was too big and reduced very slowly by back-propagation. So gained information from source network is reduced by the benefit, which tested by target networks training data.

Another transfer learning example is markov logic networks, which consist of a set of weighted formulae and provides a way of softening first-order logic by making situations, in which not all formulae are satisfied, less likely but not impossible (Mihalkova & Huynh, 2007). In this working three real-world relational domains IMDB, UW-CSE, and WebKB are used. The aim of the working is training faster by transferring obtained rules form cross domains.

Transfer learning is also used in classification problems. Tasks sampled from the environment are used to improve classification performance on future tasks. We consider situations in which the tasks can be divided into groups. Tasks within each group are related by sharing a low dimensional representation, which differs across the groups (Argyriou, Maurer, & Pontil, 2008).

Transfer learning mostly used in reinforcement learning and lifelong learning. One of the fundamental working is “Lifelong Robot Learning” (Thrun & Mitchell, 1995).

More detailed information about transfer learning can be found in (Pan & Yang, 2008).

All workings above acquired good performance improvements. So we would like to try if we can acquire good performance improvements in other learning strategies like genetic algorithms.

Section snippets

Genetic transfer

Genetic programming is a domain-independent problem solving approach in which computer programs are evolved to solve, or approximately solve, problems. Genetic programming is based on the Darwinian principle of reproduction and survival of the fittest and analogs of naturally occurring genetic operations such as crossover (sexual recombination) and mutation (Koza, 1998). More detailed information can be found in (Gen and Cheng, 2000, Goldberg, 1989).

The aim in genetic transfer is, like the

Experimental results

In our experiments we tried two different “individual pool” creation methods. All distance and difference values are normalized values by the Eqs. (7), (8). Because values of these variables are depend on individual lengthNormalized difference=differencemax_valueNormalized distance=distancemax_value

  • Single transfer (S.T.): Single transfer pool creation method is used to create “individual pool” by only one running of source function, so size of individual pool is 100 × 3 = 300 for iteration count

Conclusion and future work

The “goodness” value is the probability that genetic transfer algorithm will have better performance than classic genetic algorithm, calculated by 10 independent runs. And similarly “difference” value is the answer for question “how much genetic transfer algorithm is better than classic genetic algorithm?” calculated by best fitness values of 10 independent runs. As it seen in experimental result, genetic transfer algorithm is a good choice when you have two similar optimization problems. In

References (15)

  • V.J. Milenkovic et al.

    Translational polygon containment and minimal enclosure using mathematical programming

    International Transactions in Operational Research

    (1999)
  • Argyriou, A., Maurer, A., & Pontil, M. (2008). An algorithm for transfer learning in a heterogeneous environment. In...
  • M. Gen et al.

    Genetic algorithms and engineering optimization

    (2000)
  • D.E. Goldberg

    Genetic algorithms in search, optimization and machine learning

    (1989)
  • Gupta, R., & Ratinov, L. (2008). Text categorization with knowledge transfer from heterogeneous data sources. In...
  • Hassan Mahmud, M. M. (2007). On universal transfer learning. In Proceedings of the 18th international conference on...
  • J. Koza

    Genetic programming

    Encyclopedia of Computer Science and Technology

    (1998)
There are more references available in the full text version of this article.

Cited by (55)

  • Cross-domain feature selection and diagnosis of oil and gas pipeline defects based on transfer learning

    2023, Engineering Failure Analysis
    Citation Excerpt :

    To address cross-domain diagnosis problems, TL methods have been proposed by many researchers [21,22]. Combining TL with the feature selection method or ML method can relax the restriction that test and training data must come from the same domain [23]. In many TL-based feature selection methods, an attempt is made to create a new feature projection space, and projected target data can be classified by traditional ML algorithms [24,25].

View all citing articles on Scopus
View full text