Skip to main content
Top
Published in: The Journal of Supercomputing 2/2023

06-08-2022

Machine learning-based techniques for fault diagnosis in the semiconductor manufacturing process: a comparative study

Authors: Abubakar Abdussalam Nuhu, Qasim Zeeshan, Babak Safaei, Muhammad Atif Shahzad

Published in: The Journal of Supercomputing | Issue 2/2023

Log in

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

Industries are going through the fourth industrial revolution (Industry 4.0), where technologies like the Industrial Internet of things, big data analytics, and machine learning (ML) are extensively utilized to improve the productivity and efficiency of manufacturing systems and processes. This work aims to further investigate the applicability and improve the effectiveness of ML prediction models for fault diagnosis in the smart manufacturing process. Hence, we propose several methodologies and ML models for fault diagnosis for smart manufacturing process applications. A case study has been conducted on a real dataset from a semiconductor manufacturing (SECOM) process. However, this dataset contains missing values, noisy features, and class imbalance problem. This imbalance problem makes it so difficult to accurately predict the minority class, due to the majority class size difference. In the literature, efforts have been made to alleviate the class imbalance problem using several synthetic data generation techniques (SDGT) on the UCI machine learning repository SECOM dataset. In this work, to handle the imbalance problem, we employed, compared, and evaluated the feasibility of three SDGT on this dataset. To handle issues related to the missing values and noisy features, we implemented two missing values imputation techniques and feature selection techniques, respectively. We then developed and compared the performance of ten predictive ML models against these proposed methodologies. The results obtained across several evaluation metrics of performance were significant. A comparative analysis shows the feasibility and validate the effectiveness of these SDGT and the proposed methodologies. Some among the proposed methodologies could produce an accuracy in the range of 99.5% to 100%. Furthermore, based on a comparative analysis with similar models from the literature, our proposed models outpaced those proposed in the literature.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Literature
3.
go back to reference Mccann M, Li Y, Maquire L, Johnston A (2010) Causality challenge: benchmarking relevant signal components for effective monitoring and process control. J Mach Learn Res Work Conf Proc 6:277–288 Mccann M, Li Y, Maquire L, Johnston A (2010) Causality challenge: benchmarking relevant signal components for effective monitoring and process control. J Mach Learn Res Work Conf Proc 6:277–288
10.
go back to reference Mack CA (2011) Fiftyyears of Moore’ s law. IEEE Fellow 24:2008 Mack CA (2011) Fiftyyears of Moore’ s law. IEEE Fellow 24:2008
16.
go back to reference Kerdprasop K, Kerdprasop N (2011) A data mining approach to automate fault detection model development in the semiconductor manufacturing process. Int J Mech 5:336–344 Kerdprasop K, Kerdprasop N (2011) A data mining approach to automate fault detection model development in the semiconductor manufacturing process. Int J Mech 5:336–344
19.
go back to reference Han H, Wang W-Y, Mao B-H (2005) Borderline-SMOTE: a new over-sampling method in imbalanced data sets learning Han H, Wang W-Y, Mao B-H (2005) Borderline-SMOTE: a new over-sampling method in imbalanced data sets learning
22.
go back to reference Bunkhumpornpat C, Sinapiromsaran K, Lursinsap C (2009) Safe-level-SMOTE: safe-level-synthetic minority over-sampling technique for handling the class imbalanced problem. Lect. Notes Comput. Sci. (Including Subser. Lect. Notes Artif. Intell. Lect. Notes Bioinformatics). 5476 LNAI, pp 475–482. https://doi.org/10.1007/978-3-642-01307-2_43 Bunkhumpornpat C, Sinapiromsaran K, Lursinsap C (2009) Safe-level-SMOTE: safe-level-synthetic minority over-sampling technique for handling the class imbalanced problem. Lect. Notes Comput. Sci. (Including Subser. Lect. Notes Artif. Intell. Lect. Notes Bioinformatics). 5476 LNAI, pp 475–482. https://​doi.​org/​10.​1007/​978-3-642-01307-2_​43
26.
go back to reference Wang Q, Luo Z, Huang J, Feng Y, Liu Z (2017) A novel ensemble method for imbalanced data learning. Comput Intell Neurosci 2017:1–11 Wang Q, Luo Z, Huang J, Feng Y, Liu Z (2017) A novel ensemble method for imbalanced data learning. Comput Intell Neurosci 2017:1–11
31.
go back to reference Moldovan D, Anghel I, Cioara T, Salomie I (2020) Particle swarm optimization based deep learning ensemble for manufacturing processes. In: Proceedings of 2020 IEEE 16th international conference on intelligent computer communication and processing ICCP 2020, pp 563–570. https://doi.org/10.1109/ICCP51029.2020.9266269 Moldovan D, Anghel I, Cioara T, Salomie I (2020) Particle swarm optimization based deep learning ensemble for manufacturing processes. In: Proceedings of 2020 IEEE 16th international conference on intelligent computer communication and processing ICCP 2020, pp 563–570. https://​doi.​org/​10.​1109/​ICCP51029.​2020.​9266269
37.
go back to reference Anghel I, Cioara T, Moldovan D, Salomie I, Tomus MM (2018) Prediction of manufacturing processes errors: gradient boosted trees versus deep neural networks. In Proceedings of the 16th international conference on embedded and ubiquitous computing EUC 2018, pp 29–36. https://doi.org/10.1109/EUC.2018.00012 Anghel I, Cioara T, Moldovan D, Salomie I, Tomus MM (2018) Prediction of manufacturing processes errors: gradient boosted trees versus deep neural networks. In Proceedings of the 16th international conference on embedded and ubiquitous computing EUC 2018, pp 29–36. https://​doi.​org/​10.​1109/​EUC.​2018.​00012
42.
43.
go back to reference Moldovan D, Chifu V, Pop C, Cioara T, Anghel I, Salomie I (2018) Chicken swarm optimization and deep learning for manufacturing processes. In: Proceedings of the 17th RoEduNet IEEE international conference networking in education and research RoEduNet 2018, pp 18–23. https://doi.org/10.1109/ROEDUNET.2018.8514152 Moldovan D, Chifu V, Pop C, Cioara T, Anghel I, Salomie I (2018) Chicken swarm optimization and deep learning for manufacturing processes. In: Proceedings of the 17th RoEduNet IEEE international conference networking in education and research RoEduNet 2018, pp 18–23. https://​doi.​org/​10.​1109/​ROEDUNET.​2018.​8514152
47.
go back to reference Batista GEAPA, Monard MC (2002) A study of k-nearest neighbour as an imputation method. Front Artif Intell Appl 87:251–260 Batista GEAPA, Monard MC (2002) A study of k-nearest neighbour as an imputation method. Front Artif Intell Appl 87:251–260
49.
52.
go back to reference Josse J, Prost N, Scornet E, Varoquaux G, Josse J, Prost N, Scornet E, Varoquaux G, Josse J (2020) On the consistency of supervised learning with missing values Josse J, Prost N, Scornet E, Varoquaux G, Josse J, Prost N, Scornet E, Varoquaux G, Josse J (2020) On the consistency of supervised learning with missing values
56.
go back to reference Arora M, Bhambhu L, Tech Scholar M (2014) Role of scaling in data classification using SVM. Int J Adv Res Comput Sci Softw Eng 4:2277 Arora M, Bhambhu L, Tech Scholar M (2014) Role of scaling in data classification using SVM. Int J Adv Res Comput Sci Softw Eng 4:2277
Metadata
Title
Machine learning-based techniques for fault diagnosis in the semiconductor manufacturing process: a comparative study
Authors
Abubakar Abdussalam Nuhu
Qasim Zeeshan
Babak Safaei
Muhammad Atif Shahzad
Publication date
06-08-2022
Publisher
Springer US
Published in
The Journal of Supercomputing / Issue 2/2023
Print ISSN: 0920-8542
Electronic ISSN: 1573-0484
DOI
https://doi.org/10.1007/s11227-022-04730-x

Other articles of this Issue 2/2023

The Journal of Supercomputing 2/2023 Go to the issue

Premium Partner