Skip to main content

2020 | OriginalPaper | Buchkapitel

2. Emergence of Statistical Methodologies with the Rise of BIG Data

verfasst von : Nedret Billor, Asuman S. Turkmen

Erschienen in: Women in Industrial and Systems Engineering

Verlag: Springer International Publishing

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Due to the acceleration of electronic computation and the generation of the “BIG”datasets at an unprecedented pace in many fields, there have been great advancements in the development of statistical/machine learning methodologies as we enter the twenty-first century. It is very important to be able to analyze such complex and high dimensional datasets yielding valuable information that deepens understanding, improves decision making, and enhances the performance of predictive models. For instance, the current problems encountered in manufacturing industry, such as quality improvement initiatives, determination of user expectations for a new product, and manufacturing cost estimation, become more difficult to solve as the high dimensional and complex structured data have become available. In order to overcome some of the today’s challenges of a complex manufacturing system, statistical/machine learning techniques have been utilized and found that these have been remarkably helpful to handle the problems arising in this field. We have three objectives in this chapter. The first is to highlight the developments in new statistical/machine learning algorithms, including—but not limited to—deep learning, random forests, support vector machine, dimension reduction, and sparse modeling. The second is to present how these new enormously ambitious data-driven algorithms play important role in analyzing datasets, generated continuously every second from various scientific disciplines such as engineering, biology, neuroscience, chemistry. As the complexity and size of generated data increased for the past two decades, the invention of data-driven algorithms has flourished and became more important than their inferential justifications which are an important aspect of a statistical analysis. Therefore, the third objective of the chapter is to explain how inferential analyses, the theories by which statisticians choose among competing methods, evolve in the twenty-first century as the invention of these new algorithms continues at a fast pace.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
Zurück zum Zitat Aguirre-Urreta MI, Rönkkö M (2017) Statistical inference with PLSc using bootstrap confidence intervals. MIS Quarterly. Aguirre-Urreta MI, Rönkkö M (2017) Statistical inference with PLSc using bootstrap confidence intervals. MIS Quarterly.
Zurück zum Zitat Allaire JJ, Chollet F (2018) Keras: R interface to “Keras”. R package version 2.1.3 Allaire JJ, Chollet F (2018) Keras: R interface to “Keras”. R package version 2.1.3
Zurück zum Zitat Auret L, Aldrich C (2010) Unsupervised process fault detection with random forests. Ind Eng Chem Res 49(19):9184–9194CrossRef Auret L, Aldrich C (2010) Unsupervised process fault detection with random forests. Ind Eng Chem Res 49(19):9184–9194CrossRef
Zurück zum Zitat Benjamini Y, Hochberg Y (1995) Controlling the false discovery rate: A practical and powerful approach to multiple testing. J R Stat Soc Ser B 57(1):289–300MathSciNetMATH Benjamini Y, Hochberg Y (1995) Controlling the false discovery rate: A practical and powerful approach to multiple testing. J R Stat Soc Ser B 57(1):289–300MathSciNetMATH
Zurück zum Zitat Benkedjouh T, Medjaher K, Zerhouni N, Rechak S (2015) Health assessment and life prediction of cutting tools based on support vector regression. J Intell Manuf 26(2):213–223CrossRef Benkedjouh T, Medjaher K, Zerhouni N, Rechak S (2015) Health assessment and life prediction of cutting tools based on support vector regression. J Intell Manuf 26(2):213–223CrossRef
Zurück zum Zitat Bertino E, Catania B, Caglio E (1999) Applying data mining techniques to wafer manufacturing. In: Zytkow JM, Rauch J (eds) PKDD’99, LNAI, vol 1704. Springer, Berlin, pp 41–50 Bertino E, Catania B, Caglio E (1999) Applying data mining techniques to wafer manufacturing. In: Zytkow JM, Rauch J (eds) PKDD’99, LNAI, vol 1704. Springer, Berlin, pp 41–50
Zurück zum Zitat Biau G, Devroye L, Lugosi G (2008) Consistency of random forests and other averaging classifiers. J Mach Learn Res 9:2015–2033MathSciNetMATH Biau G, Devroye L, Lugosi G (2008) Consistency of random forests and other averaging classifiers. J Mach Learn Res 9:2015–2033MathSciNetMATH
Zurück zum Zitat Blanchard G, Bousquet O, Massart P (2004) Statistical performance of support vector machines. Technical Report Blanchard G, Bousquet O, Massart P (2004) Statistical performance of support vector machines. Technical Report
Zurück zum Zitat Caydas U, Ekici S (2010) Support vector machines models for surface roughness prediction in CNC turning of AISI 304 austenitic stainless steel. J Intell Manuf 23:639–650CrossRef Caydas U, Ekici S (2010) Support vector machines models for surface roughness prediction in CNC turning of AISI 304 austenitic stainless steel. J Intell Manuf 23:639–650CrossRef
Zurück zum Zitat Chang YC, Mastrangelo C (2011) Addressing multicollinearity in semiconductor manufacturing. Qual Reliab Eng Int 27:843–854CrossRef Chang YC, Mastrangelo C (2011) Addressing multicollinearity in semiconductor manufacturing. Qual Reliab Eng Int 27:843–854CrossRef
Zurück zum Zitat Chiang LH, Pell RJ, Seasholtz MB (2003) Exploring process data with the use of robust outlier detection algorithms. J Process Control 13(5):437–449CrossRef Chiang LH, Pell RJ, Seasholtz MB (2003) Exploring process data with the use of robust outlier detection algorithms. J Process Control 13(5):437–449CrossRef
Zurück zum Zitat Cho S, Asfour S, Onar A, Kaundinya N (2005) Tool breakage detection using support vector machine learning in a milling process. Int J Mach Tools Manuf 45(3):241–249CrossRef Cho S, Asfour S, Onar A, Kaundinya N (2005) Tool breakage detection using support vector machine learning in a milling process. Int J Mach Tools Manuf 45(3):241–249CrossRef
Zurück zum Zitat Dauxois J, Pousse A, Romain Y (1982) Asymptotic theory for the principal component analysis of a vector random function: some applications to statistical inference. J Multivar Anal 12(1):136–154MathSciNetMATHCrossRef Dauxois J, Pousse A, Romain Y (1982) Asymptotic theory for the principal component analysis of a vector random function: some applications to statistical inference. J Multivar Anal 12(1):136–154MathSciNetMATHCrossRef
Zurück zum Zitat de Jong S (1993) SIMPLS: An alternative approach to partial least squares regression. Chemome Intell Lab Syst 18:251–263CrossRef de Jong S (1993) SIMPLS: An alternative approach to partial least squares regression. Chemome Intell Lab Syst 18:251–263CrossRef
Zurück zum Zitat de Ketelaere K, Hubert M, Schmitt E (2015) Overview of PCA based statistical process monitoring methods for time-dependent, high dimensional data. J Qual Technol 47:318–335CrossRef de Ketelaere K, Hubert M, Schmitt E (2015) Overview of PCA based statistical process monitoring methods for time-dependent, high dimensional data. J Qual Technol 47:318–335CrossRef
Zurück zum Zitat Deng L, Seltzer M, Yu D, Acero A, Mohamed A, Hinton GE (2010) Binary coding of speech spectrograms using a deep auto-encoder. In” Proceedings of 11th annual conference of the international speech communication association, vol 3, pp 1692–1695 Deng L, Seltzer M, Yu D, Acero A, Mohamed A, Hinton GE (2010) Binary coding of speech spectrograms using a deep auto-encoder. In” Proceedings of 11th annual conference of the international speech communication association, vol 3, pp 1692–1695
Zurück zum Zitat Dijkstra TK, Henseler J (2015) Consistent partial least squares path modeling. MIS Q 39(2):297–316CrossRef Dijkstra TK, Henseler J (2015) Consistent partial least squares path modeling. MIS Q 39(2):297–316CrossRef
Zurück zum Zitat Dunia R, Edgar TF, Nixon M (2013) Process monitoring using principal components in parallel coordinates. AIChE J 59(2):445–456CrossRef Dunia R, Edgar TF, Nixon M (2013) Process monitoring using principal components in parallel coordinates. AIChE J 59(2):445–456CrossRef
Zurück zum Zitat Efron B (2010) Large-scale inference: Empirical Bayes methods for estimation, testing, and prediction. Institute of mathematical statistics onographs, Vol 1. Cambridge University Press, CambridgeCrossRef Efron B (2010) Large-scale inference: Empirical Bayes methods for estimation, testing, and prediction. Institute of mathematical statistics onographs, Vol 1. Cambridge University Press, CambridgeCrossRef
Zurück zum Zitat Efron B (2014) Estimation and accuracy after model selection (with discussion). J Am Stat Assoc 109(507):991–1007MATHCrossRef Efron B (2014) Estimation and accuracy after model selection (with discussion). J Am Stat Assoc 109(507):991–1007MATHCrossRef
Zurück zum Zitat Efron B, Hastie T (2016) Computer age statistical inference: algorithms, evidence, and data science. Institute of mathematical statistics monographs, 1st edn. Cambridge University Press, CambridgeMATHCrossRef Efron B, Hastie T (2016) Computer age statistical inference: algorithms, evidence, and data science. Institute of mathematical statistics monographs, 1st edn. Cambridge University Press, CambridgeMATHCrossRef
Zurück zum Zitat Efron B, Turnbull B, Narasimhan B (2015) locfdr: Computes local false discovery rates. R package version 1.1-8 Efron B, Turnbull B, Narasimhan B (2015) locfdr: Computes local false discovery rates. R package version 1.1-8
Zurück zum Zitat Fan J, Li R (2001) Variable selection via nonconcave penalized likelihood and its oracle properties. J Am Stat Assoc 96:1348–1360MathSciNetMATHCrossRef Fan J, Li R (2001) Variable selection via nonconcave penalized likelihood and its oracle properties. J Am Stat Assoc 96:1348–1360MathSciNetMATHCrossRef
Zurück zum Zitat Friedman J, Hastie T, Tibshirani R (2010) Regularization paths for generalized linear models via coordinate descent. J Stat Softw 33(1):1–22CrossRef Friedman J, Hastie T, Tibshirani R (2010) Regularization paths for generalized linear models via coordinate descent. J Stat Softw 33(1):1–22CrossRef
Zurück zum Zitat Ge Z, Song Z (2010) A comparative study of just-in-time-learning based methods for online soft sensor modeling. Chemom Intell Lab Syst 104(2):306–317CrossRef Ge Z, Song Z (2010) A comparative study of just-in-time-learning based methods for online soft sensor modeling. Chemom Intell Lab Syst 104(2):306–317CrossRef
Zurück zum Zitat Genuer R, Poggi JM, Tuleau C (2008) Random forests: some methodological insights. Technical report, INRIA Genuer R, Poggi JM, Tuleau C (2008) Random forests: some methodological insights. Technical report, INRIA
Zurück zum Zitat Hable R (2012) Asymptotic normality of support vector machine variants and other regularized kernel methods. J Multivar Anal 106:92–117MathSciNetMATHCrossRef Hable R (2012) Asymptotic normality of support vector machine variants and other regularized kernel methods. J Multivar Anal 106:92–117MathSciNetMATHCrossRef
Zurück zum Zitat Hastie T, Tibshirani R, Friedman J (2009) The elements of statistical learning: prediction, inference and data mining, 2nd edn. SpringerMATHCrossRef Hastie T, Tibshirani R, Friedman J (2009) The elements of statistical learning: prediction, inference and data mining, 2nd edn. SpringerMATHCrossRef
Zurück zum Zitat Hihi SE, Bengio Y (1996) Hierarchical recurrent neural networks for long-term dependencies. Adv Neural Inf Process Syst 8:493–499 Hihi SE, Bengio Y (1996) Hierarchical recurrent neural networks for long-term dependencies. Adv Neural Inf Process Syst 8:493–499
Zurück zum Zitat Hoerl A, Kennard RW (1970) Ridge regression: biased estimation for nonorthogonal problems. Technometrics 12:55–67MATHCrossRef Hoerl A, Kennard RW (1970) Ridge regression: biased estimation for nonorthogonal problems. Technometrics 12:55–67MATHCrossRef
Zurück zum Zitat Hyvorinen A, Karhunen J, Oja E (2001) Independent component analysis, 1st edn. Wiley, New YorkCrossRef Hyvorinen A, Karhunen J, Oja E (2001) Independent component analysis, 1st edn. Wiley, New YorkCrossRef
Zurück zum Zitat Irani KB, Cheng J, Fayyad UM, Qian Z (1993) Applying machine learning to semiconductor manufacturing. IEEE Exp 8:41–47CrossRef Irani KB, Cheng J, Fayyad UM, Qian Z (1993) Applying machine learning to semiconductor manufacturing. IEEE Exp 8:41–47CrossRef
Zurück zum Zitat Jain P, Rahman I, Kulkarni BD (2007) Development of a soft sensor for a batch distillation column using support vector regression techniques. Chem Eng Res Des 85(2):283–287CrossRef Jain P, Rahman I, Kulkarni BD (2007) Development of a soft sensor for a batch distillation column using support vector regression techniques. Chem Eng Res Des 85(2):283–287CrossRef
Zurück zum Zitat Janssens O, Slavkovikj V, Vervisch B, Stockman K, Loccufier M, Verstockt S, et al. (2016) Convolution neural network based fault detection for rotating machinery. J Sound Vib 377:331–345CrossRef Janssens O, Slavkovikj V, Vervisch B, Stockman K, Loccufier M, Verstockt S, et al. (2016) Convolution neural network based fault detection for rotating machinery. J Sound Vib 377:331–345CrossRef
Zurück zum Zitat Javanmard A, Montanari A (2014) Confidence intervals and hypothesis testing for high-dimensional regression. J Mach Learn Res 15:2869–2909MathSciNetMATH Javanmard A, Montanari A (2014) Confidence intervals and hypothesis testing for high-dimensional regression. J Mach Learn Res 15:2869–2909MathSciNetMATH
Zurück zum Zitat Jia F, Lei Y, Lin J, Zhou X, Lu N (2016) Deep neural networks: a promising tool for fault characteristic mining and intelligent diagnosis of rotating machinery with massive data. Mech Syst Signal Process 72–73:303–315CrossRef Jia F, Lei Y, Lin J, Zhou X, Lu N (2016) Deep neural networks: a promising tool for fault characteristic mining and intelligent diagnosis of rotating machinery with massive data. Mech Syst Signal Process 72–73:303–315CrossRef
Zurück zum Zitat Jolliffe IT (2002) Principal component analysis. Springer series in statistics, 2nd edn. Springer, New York Jolliffe IT (2002) Principal component analysis. Springer series in statistics, 2nd edn. Springer, New York
Zurück zum Zitat Kao LJ, Lee TS, Lu CJ (2016) A multi-stage control chart pattern recognition scheme based on independent component analysis and support vector machine. J Intell Manuf 27(3):653–664CrossRef Kao LJ, Lee TS, Lu CJ (2016) A multi-stage control chart pattern recognition scheme based on independent component analysis and support vector machine. J Intell Manuf 27(3):653–664CrossRef
Zurück zum Zitat Karoui N, Purdom E (2016) The bootstrap, covariance matrices and PCA in moderate and high-dimensions. arXiv:1608.00948 Karoui N, Purdom E (2016) The bootstrap, covariance matrices and PCA in moderate and high-dimensions. arXiv:1608.00948
Zurück zum Zitat Le S, Josse J, Husson F (2008) FactoMineR: An R package for multivariate analysis. J Stat Softw 25(1):1–18CrossRef Le S, Josse J, Husson F (2008) FactoMineR: An R package for multivariate analysis. J Stat Softw 25(1):1–18CrossRef
Zurück zum Zitat Lecun Y, Bottou L, Bengio Y, Haffner P (1998) Gradient-based learning applied to document recognition. Proc IEEE 86(11):2278–324CrossRef Lecun Y, Bottou L, Bengio Y, Haffner P (1998) Gradient-based learning applied to document recognition. Proc IEEE 86(11):2278–324CrossRef
Zurück zum Zitat Lee JM, Yoo C, Choi SW, Vanrolleghem PA, Lee IB (2004) Nonlinear process monitoring using kernel principal component analysis. Chem Eng Sci 59(1):223–234CrossRef Lee JM, Yoo C, Choi SW, Vanrolleghem PA, Lee IB (2004) Nonlinear process monitoring using kernel principal component analysis. Chem Eng Sci 59(1):223–234CrossRef
Zurück zum Zitat Liaw A, Wiener M (2002) Classification and regression by randomforest. R News 2(3):18–22 Liaw A, Wiener M (2002) Classification and regression by randomforest. R News 2(3):18–22
Zurück zum Zitat Lim HK, Kim Y, Kim MK (2017) Failure prediction using sequential pattern mining in the wire bonding process. IEEE Trans Semicond Manuf 30(3):285–292CrossRef Lim HK, Kim Y, Kim MK (2017) Failure prediction using sequential pattern mining in the wire bonding process. IEEE Trans Semicond Manuf 30(3):285–292CrossRef
Zurück zum Zitat Lin Y (2000) Some asymptotic properties of the support vector machine. Technical report 1029. Department of Statistics, University of Wisconsin-Madison Lin Y (2000) Some asymptotic properties of the support vector machine. Technical report 1029. Department of Statistics, University of Wisconsin-Madison
Zurück zum Zitat Malhi A, Yan R, Gao RX (2011) Prognosis of defect propagation based on recurrent neural networks. IEEE Trans Instrum Meas 60(3):703–711CrossRef Malhi A, Yan R, Gao RX (2011) Prognosis of defect propagation based on recurrent neural networks. IEEE Trans Instrum Meas 60(3):703–711CrossRef
Zurück zum Zitat Mallows CL (1973) Some comments on C P. Technometrics 15(4):661–675MATH Mallows CL (1973) Some comments on C P. Technometrics 15(4):661–675MATH
Zurück zum Zitat Marchini JL, Heaton C, Ripley BD (2017) fastICA: FastICA algorithms to perform ICA and projection pursuit. R package version 1.2–1 Marchini JL, Heaton C, Ripley BD (2017) fastICA: FastICA algorithms to perform ICA and projection pursuit. R package version 1.2–1
Zurück zum Zitat Melhem M, Ananou B, Ouladsine M, Pinaton J (2016) Regression methods for predicting the product’s quality in the semiconductor manufacturing process. IFAC-papers online, vol 49, pp 83–88CrossRef Melhem M, Ananou B, Ouladsine M, Pinaton J (2016) Regression methods for predicting the product’s quality in the semiconductor manufacturing process. IFAC-papers online, vol 49, pp 83–88CrossRef
Zurück zum Zitat Mentch L, Hooker G (2016) Quantifying uncertainty in random forests via confidence intervals and hypothesis tests. J Mach Learn Res 17:1–41MathSciNetMATH Mentch L, Hooker G (2016) Quantifying uncertainty in random forests via confidence intervals and hypothesis tests. J Mach Learn Res 17:1–41MathSciNetMATH
Zurück zum Zitat Mentch L, Hooker G (2014) Ensemble trees and CLTs: statistical inference for supervised learning. arXiv preprint arXiv:1404.6473 Mentch L, Hooker G (2014) Ensemble trees and CLTs: statistical inference for supervised learning. arXiv preprint arXiv:1404.6473
Zurück zum Zitat Mevik BH, Wehrens R, Liland KH (2016) pls: Partial least squares and principal component regression. R package version 2.6-0 Mevik BH, Wehrens R, Liland KH (2016) pls: Partial least squares and principal component regression. R package version 2.6-0
Zurück zum Zitat Meyer D, Dimitriadou E, Hornik K, Weingessel A, Leisch F (2017) e1071: Misc functions of the Department of Statistics, Probability Theory Group, (Formerly: E1071), TU Wien. R package version 1.6-8 Meyer D, Dimitriadou E, Hornik K, Weingessel A, Leisch F (2017) e1071: Misc functions of the Department of Statistics, Probability Theory Group, (Formerly: E1071), TU Wien. R package version 1.6-8
Zurück zum Zitat Miller Jr RG (1981) Simultaneous statistical inference. Springer series in statistics, 2nd edn. Springer, New YorkMATHCrossRef Miller Jr RG (1981) Simultaneous statistical inference. Springer series in statistics, 2nd edn. Springer, New YorkMATHCrossRef
Zurück zum Zitat Oksanen J, Blanchet GF, Friendly M, Kindt R, Legendre P, McGlinn D, Minchin PR, O’Hara RB, Simpson GL, Solymos P, Stevens MHH, Szoecs E, Wagner H (2017) vegan: Community ecology package. R package version 2.4-5 Oksanen J, Blanchet GF, Friendly M, Kindt R, Legendre P, McGlinn D, Minchin PR, O’Hara RB, Simpson GL, Solymos P, Stevens MHH, Szoecs E, Wagner H (2017) vegan: Community ecology package. R package version 2.4-5
Zurück zum Zitat Pardo M, Sberveglieri G (2008) Random forests and nearest Shrunken centroids for the classification of sensor array data. Sens Actuators B Chem 131:93–99CrossRef Pardo M, Sberveglieri G (2008) Random forests and nearest Shrunken centroids for the classification of sensor array data. Sens Actuators B Chem 131:93–99CrossRef
Zurück zum Zitat Puggini L, Doyle J, McLoone S (2016) Fault detection using random forest similarity distance. IFAC-Safe Process 49(5):132–137 Puggini L, Doyle J, McLoone S (2016) Fault detection using random forest similarity distance. IFAC-Safe Process 49(5):132–137
Zurück zum Zitat Qin SJ (2003) Statistical process monitoring: basics and beyond. J Chemom 17:480–502CrossRef Qin SJ (2003) Statistical process monitoring: basics and beyond. J Chemom 17:480–502CrossRef
Zurück zum Zitat Ribeiro B (2005) Support vector machines for quality monitoring in a plastic injection molding process. IEEE Trans Syst Man Cybern C (Appl Rev) 35:401–410CrossRef Ribeiro B (2005) Support vector machines for quality monitoring in a plastic injection molding process. IEEE Trans Syst Man Cybern C (Appl Rev) 35:401–410CrossRef
Zurück zum Zitat Saidi L, Ail JB, Friaiech F (2015) Application of higher order spectral features and support vector machines for bearing faults classification. ISA Trans 54:193–206CrossRef Saidi L, Ail JB, Friaiech F (2015) Application of higher order spectral features and support vector machines for bearing faults classification. ISA Trans 54:193–206CrossRef
Zurück zum Zitat Saybani MR, Wah TY, Amini A, Yazdi S, Lahsasna A (2011) Applications of support vector machines in oil refineries: A survey. Int J Phys Sci 6(27):6295–6302 Saybani MR, Wah TY, Amini A, Yazdi S, Lahsasna A (2011) Applications of support vector machines in oil refineries: A survey. Int J Phys Sci 6(27):6295–6302
Zurück zum Zitat Schmidhuber J (2015) Deep learning in neural networks: an overview. Neural Netw 61:85–117CrossRef Schmidhuber J (2015) Deep learning in neural networks: an overview. Neural Netw 61:85–117CrossRef
Zurück zum Zitat Schölkopf B, Burges C, Smola A (1999) Advances in kernel methods: support vector learning. MIT Press, CambridgeMATH Schölkopf B, Burges C, Smola A (1999) Advances in kernel methods: support vector learning. MIT Press, CambridgeMATH
Zurück zum Zitat Scovel JC, Steinwart I (2004) Fast rates for support vector machines using gaussian kernels. Technical report LA-UR04-8796, Los Alamos National Laboratory Scovel JC, Steinwart I (2004) Fast rates for support vector machines using gaussian kernels. Technical report LA-UR04-8796, Los Alamos National Laboratory
Zurück zum Zitat Smolensky PI (1986) Information processing in dynamical systems: foundations of harmony theory, parallel distributed processing: explorations in the micro structure of cognition. MIT Press, Cambridge Smolensky PI (1986) Information processing in dynamical systems: foundations of harmony theory, parallel distributed processing: explorations in the micro structure of cognition. MIT Press, Cambridge
Zurück zum Zitat Sokol A, Maathuis MH, Falkeborg B (2014) Quantifying identifiability in independent component analysis. Electron J Stat 8:1438–1459MathSciNetMATHCrossRef Sokol A, Maathuis MH, Falkeborg B (2014) Quantifying identifiability in independent component analysis. Electron J Stat 8:1438–1459MathSciNetMATHCrossRef
Zurück zum Zitat Steinwart I (2005) Consistency of support vector machines and other regularized kernel machines. IEEE Trans Inform Theory 51:128–142MathSciNetMATHCrossRef Steinwart I (2005) Consistency of support vector machines and other regularized kernel machines. IEEE Trans Inform Theory 51:128–142MathSciNetMATHCrossRef
Zurück zum Zitat Susto GA, Beghi A (2013) A virtual metrology system based on least angle regression and statistical clustering. Appl Stoch Models Bus Ind 29:362–376MathSciNetCrossRef Susto GA, Beghi A (2013) A virtual metrology system based on least angle regression and statistical clustering. Appl Stoch Models Bus Ind 29:362–376MathSciNetCrossRef
Zurück zum Zitat Tenenbaum JB, Silva VD, Langford JC (2010) A global geometric framework for nonlinear dimensionality reduction. Science 290:2319–2323CrossRef Tenenbaum JB, Silva VD, Langford JC (2010) A global geometric framework for nonlinear dimensionality reduction. Science 290:2319–2323CrossRef
Zurück zum Zitat Tian Y, Fu M, Wu F (2015) Steel plates fault diagnosis on the basis of support vector machines. Neurocomputing 151:296–303CrossRef Tian Y, Fu M, Wu F (2015) Steel plates fault diagnosis on the basis of support vector machines. Neurocomputing 151:296–303CrossRef
Zurück zum Zitat Tibshirani R (1996) Regression shrinkage and selection via the LASSO. J R Stat Soc Ser B 58(1):267–288MathSciNetMATH Tibshirani R (1996) Regression shrinkage and selection via the LASSO. J R Stat Soc Ser B 58(1):267–288MathSciNetMATH
Zurück zum Zitat Tibshirani R, Taylor J, Loftus J, Reid S (2016) selectiveInference: tools for post-selection inference, R package version 1.1.3 Tibshirani R, Taylor J, Loftus J, Reid S (2016) selectiveInference: tools for post-selection inference, R package version 1.1.3
Zurück zum Zitat Thornhill NF, Shah SL, Huang B, Vishnubhotla A (2002) Spectral principal component analysis of dynamic process data. Control Eng Pract 10(8):833–846CrossRef Thornhill NF, Shah SL, Huang B, Vishnubhotla A (2002) Spectral principal component analysis of dynamic process data. Control Eng Pract 10(8):833–846CrossRef
Zurück zum Zitat van de Geer S, Bühlmann P, Ritov Y, Dezeure R (2014) On asymptotically optimal confidence regions and tests for high-dimensional models. Ann Stat 42(3):1166–1202MathSciNetMATHCrossRef van de Geer S, Bühlmann P, Ritov Y, Dezeure R (2014) On asymptotically optimal confidence regions and tests for high-dimensional models. Ann Stat 42(3):1166–1202MathSciNetMATHCrossRef
Zurück zum Zitat Wager S, Hastie T, Efron B (2014) Confidence intervals for random forests: The Jackknife and the infinitesimal Jackknife. J Mach Learn Res 15:1625–1651MathSciNetMATH Wager S, Hastie T, Efron B (2014) Confidence intervals for random forests: The Jackknife and the infinitesimal Jackknife. J Mach Learn Res 15:1625–1651MathSciNetMATH
Zurück zum Zitat Wang XZ, McGreavy C (1998) Automatic classification for mining process operational data. Ind Eng Chem Res 37(6):2215–2222CrossRef Wang XZ, McGreavy C (1998) Automatic classification for mining process operational data. Ind Eng Chem Res 37(6):2215–2222CrossRef
Zurück zum Zitat Wang P, Gao RX, Yan R (2017) A deep learning-based approach to material removal rate prediction in polishing. CIRP Ann Manuf Technol 66:429–432CrossRef Wang P, Gao RX, Yan R (2017) A deep learning-based approach to material removal rate prediction in polishing. CIRP Ann Manuf Technol 66:429–432CrossRef
Zurück zum Zitat Wang J, Ma Y, Zhang L, Gao RX, Wu D (2018) Deep learning for smart manufacturing: methods and applications. J Manuf Syst 48(Part C):144–156CrossRef Wang J, Ma Y, Zhang L, Gao RX, Wu D (2018) Deep learning for smart manufacturing: methods and applications. J Manuf Syst 48(Part C):144–156CrossRef
Zurück zum Zitat Wei T (2015) The convergence and asymptotic analysis of the generalized symmetric fast ICA algorithm. IEEE Trans Signal Process 63(24):6445–6458MathSciNetMATHCrossRef Wei T (2015) The convergence and asymptotic analysis of the generalized symmetric fast ICA algorithm. IEEE Trans Signal Process 63(24):6445–6458MathSciNetMATHCrossRef
Zurück zum Zitat Weimer D, Scholz-Reiter B, Shpitalni M (2016) Design of deep convolution neural network architectures for automated feature extraction in industrial inspection. CIRP Ann Manuf Technol 65(1):417–420CrossRef Weimer D, Scholz-Reiter B, Shpitalni M (2016) Design of deep convolution neural network architectures for automated feature extraction in industrial inspection. CIRP Ann Manuf Technol 65(1):417–420CrossRef
Zurück zum Zitat Westfall P, Young S (1993) Resampling-based multiple testing: examples and methods for p-value adjustment. Wiley series in probability and statistics. Wiley-InterscienceMATH Westfall P, Young S (1993) Resampling-based multiple testing: examples and methods for p-value adjustment. Wiley series in probability and statistics. Wiley-InterscienceMATH
Zurück zum Zitat Widodo A, Yang BS (2007) Support vector machine in machine condition monitoring and fault diagnosis. Mech Syst Signal Process 21:2560–2574CrossRef Widodo A, Yang BS (2007) Support vector machine in machine condition monitoring and fault diagnosis. Mech Syst Signal Process 21:2560–2574CrossRef
Zurück zum Zitat Wold H (1975) Path models with latent variables: the NIPALS approach. In: Quantitative sociology international perspectives on mathematical and statistical model building, pp 307–357. Academic Press Wold H (1975) Path models with latent variables: the NIPALS approach. In: Quantitative sociology international perspectives on mathematical and statistical model building, pp 307–357. Academic Press
Zurück zum Zitat Wu D, Jennings C, Terpenny J, Gao RX, Kumara S (2017) a comparative study on machine learning algorithms for smart manufacturing: tool wear prediction using random forests. J Manuf Sci Eng 139:071018–071027CrossRef Wu D, Jennings C, Terpenny J, Gao RX, Kumara S (2017) a comparative study on machine learning algorithms for smart manufacturing: tool wear prediction using random forests. J Manuf Sci Eng 139:071018–071027CrossRef
Zurück zum Zitat Xanthopoulos P, Razzaghi T (2013) A weighted support vector machine method for control chart pattern recognition. Comput Ind Eng 66:683–695CrossRef Xanthopoulos P, Razzaghi T (2013) A weighted support vector machine method for control chart pattern recognition. Comput Ind Eng 66:683–695CrossRef
Zurück zum Zitat Xiao Y, Wang H, Zhang L (2014) Two methods of selecting gaussian kernel parameters for one-class SVM and their application to fault detection. Knowl-Based Syst 59:75–84CrossRef Xiao Y, Wang H, Zhang L (2014) Two methods of selecting gaussian kernel parameters for one-class SVM and their application to fault detection. Knowl-Based Syst 59:75–84CrossRef
Zurück zum Zitat Yang B, Di X, Han T (2008) Random forests classifier for machine fault diagnosis. J Mech Sci Technol 22:1716–1725CrossRef Yang B, Di X, Han T (2008) Random forests classifier for machine fault diagnosis. J Mech Sci Technol 22:1716–1725CrossRef
Zurück zum Zitat Yao M, Wang H (2015) On-line monitoring of batch processes using generalized additive kernel principal component analysis. J Process Control 103:338–351MathSciNet Yao M, Wang H (2015) On-line monitoring of batch processes using generalized additive kernel principal component analysis. J Process Control 103:338–351MathSciNet
Zurück zum Zitat Yarin G (2016) Uncertainty in deep learning. Ph.D. thesis, Cambridge University Yarin G (2016) Uncertainty in deep learning. Ph.D. thesis, Cambridge University
Zurück zum Zitat You D, Gao X, Katayama S (2015) WPD-PCA-based laser welding process monitoring and defects diagnosis by using FNN and SVM. IEEE Trans Ind Electron 62(1):628–636CrossRef You D, Gao X, Katayama S (2015) WPD-PCA-based laser welding process monitoring and defects diagnosis by using FNN and SVM. IEEE Trans Ind Electron 62(1):628–636CrossRef
Zurück zum Zitat Yu J (2012) A Bayesian inference based two-stage support vector regression framework for soft sensor development in batch bioprocesses. Comput Chem Eng 41:134–144CrossRef Yu J (2012) A Bayesian inference based two-stage support vector regression framework for soft sensor development in batch bioprocesses. Comput Chem Eng 41:134–144CrossRef
Zurück zum Zitat Yu H, Khan F, Garaniya V (2015) Nonlinear Gaussian belief network based fault diagnosis for industrial processes. J Process Control 35:178–200CrossRef Yu H, Khan F, Garaniya V (2015) Nonlinear Gaussian belief network based fault diagnosis for industrial processes. J Process Control 35:178–200CrossRef
Zurück zum Zitat Zhang T (2004) Statistical behavior and consistency of classification methods based on convex risk minimization. Ann Stat 32:56–84MathSciNetMATHCrossRef Zhang T (2004) Statistical behavior and consistency of classification methods based on convex risk minimization. Ann Stat 32:56–84MathSciNetMATHCrossRef
Zurück zum Zitat Zhang Y, Teng Y, Zhang Y (2010) Complex process quality prediction using modified kernel partial least squares. Chem Eng Sci 65(6):2153–2158CrossRef Zhang Y, Teng Y, Zhang Y (2010) Complex process quality prediction using modified kernel partial least squares. Chem Eng Sci 65(6):2153–2158CrossRef
Zurück zum Zitat Zhang Y (2008) Fault detection and diagnosis of nonlinear processes using improved kernel independent component analysis (KICA) and support vector machine (SVM). Ind Eng Chem Res 47(18):6961–6971CrossRef Zhang Y (2008) Fault detection and diagnosis of nonlinear processes using improved kernel independent component analysis (KICA) and support vector machine (SVM). Ind Eng Chem Res 47(18):6961–6971CrossRef
Zurück zum Zitat Zhang W, He D, Jia R (2013) Online quality prediction for cobalt oxalate synthesis process using least squares support vector regression approach with dual updating. Control Eng Pract 21(10):1267–1276CrossRef Zhang W, He D, Jia R (2013) Online quality prediction for cobalt oxalate synthesis process using least squares support vector regression approach with dual updating. Control Eng Pract 21(10):1267–1276CrossRef
Zurück zum Zitat Zhang Y, Li S, Teng Y (2012) Dynamic processes monitoring using recursive kernel principal component analysis. Chem Eng Sci 72:78–86CrossRef Zhang Y, Li S, Teng Y (2012) Dynamic processes monitoring using recursive kernel principal component analysis. Chem Eng Sci 72:78–86CrossRef
Zurück zum Zitat Zhang C-H, Zhang S (2014) Confidence intervals for low-dimensional parameters with high-dimensional data. J R Stat Soc Ser B 76(1):217–242MathSciNetMATHCrossRef Zhang C-H, Zhang S (2014) Confidence intervals for low-dimensional parameters with high-dimensional data. J R Stat Soc Ser B 76(1):217–242MathSciNetMATHCrossRef
Zurück zum Zitat Zou C, Tseng ST, Wang Z (2014) Outlier detection in general profiles using penalized regression method. IIE Trans J Inst Ind Syst Eng 46(2):106–117 Zou C, Tseng ST, Wang Z (2014) Outlier detection in general profiles using penalized regression method. IIE Trans J Inst Ind Syst Eng 46(2):106–117
Metadaten
Titel
Emergence of Statistical Methodologies with the Rise of BIG Data
verfasst von
Nedret Billor
Asuman S. Turkmen
Copyright-Jahr
2020
DOI
https://doi.org/10.1007/978-3-030-11866-2_2

Neuer Inhalt