An Overview of Overfitting and its Solutions

Xue Ying

doi:10.1088/1742-6596/1168/2/022022

Journal of Physics: Conference Series

Paper • The following article is Open access

An Overview of Overfitting and its Solutions

Xue Ying¹

Published under licence by IOP Publishing Ltd
Journal of Physics: Conference Series, Volume 1168, Issue 2 Citation Xue Ying 2019 J. Phys.: Conf. Ser. 1168 022022 DOI 10.1088/1742-6596/1168/2/022022

Download Article PDF

Article metrics

45038 Total downloads

Author e-mails

xuee4259@gmail.com

Author affiliations

¹ Building 1, Huizhong Tower, NO.1 Shangdi Seven Street, Haidian District Beijing 10 0085 China

Buy this article in print

Journal RSS

Sign up for new issue notifications

Abstract

Overfitting is a fundamental issue in supervised machine learning which prevents us from perfectly generalizing the models to well fit observed data on training data, as well as unseen data on testing set. Because of the presence of noise, the limited size of training set, and the complexity of classifiers, overfitting happens. This paper is going to talk about overfitting from the perspectives of causes and solutions. To reduce the effects of overfitting, various strategies are proposed to address to these causes: 1) "early-stopping" strategy is introduced to prevent overfitting by stopping training before the performance stops optimize; 2) "network-reduction" strategy is used to exclude the noises in training set; 3) "data-expansion" strategy is proposed for complicated models to fine-tune the hyper-parameters sets with a great amount of data; and 4) "regularization" strategy is proposed to guarantee models performance to a great extent while dealing with real world issues by feature-selection, and by distinguishing more useful and less useful features.

Export citation and abstract BibTeX RIS

Previous article in issue

Next article in issue

Content from this work may be used under the terms of the Creative Commons Attribution 3.0 licence. Any further distribution of this work must maintain attribution to the author(s) and the title of the work, journal citation and DOI.

Please wait… references are loading.

An Overview of Overfitting and its Solutions

Article metrics

Share this article

Author e-mails

Author affiliations

Abstract