Skip to main content

2002 | Buch

Smoothing Spline ANOVA Models

verfasst von: Chong Gu

Verlag: Springer New York

Buchreihe : Springer Series in Statistics

insite
SUCHEN

Über dieses Buch

Nonparametric function estimation with stochastic data, otherwise known as smoothing, has been studied by several generations of statisticians. Assisted by the recent availability of ample desktop and laptop computing power, smoothing methods are now finding their ways into everyday data analysis by practitioners.
While scores of methods have proved successful for univariate smoothing, ones practical in multivariate settings number far less. Smoothing spline ANOVA models are a versatile family of smoothing methods derived through roughness penalties that are suitable for both univariate and multivariate problems.
In this book, the author presents a comprehensive treatment of penalty smoothing under a unified framework. Methods are developed for (i) regression with Gaussian and non-Gaussian responses as well as with censored life time data; (ii) density and conditional density estimation under a variety of sampling schemes; and (iii) hazard rate estimation with censored life time data and covariates. The unifying themes are the general penalized likelihood method and the construction of multivariate models with built-in ANOVA decompositions. Extensive discussions are devoted to model construction, smoothing parameter selection, computation, and asymptotic convergence. Most of the computational and data analytical tools discussed in the book are implemented in R, an open-source clone of the popular S/S- PLUS language. Code for regression has been distributed in the R package gss freely available through the Internet on CRAN, the Comprehensive R Archive Network. The use of gss facilities is illustrated in the book through simulated and real data examples.

Inhaltsverzeichnis

Frontmatter
1. Introduction
Abstract
Data and models are two sources of information in a statistical analysis. Data carry noise but are “unbiased,” whereas models, effectively a set of constraints, help to reduce noise but are responsible for “biases.” Representing the two extremes on the spectrum of “bias-variance” trade-off are standard parametric models and constraint-free nonparametric “models” such as the empirical distribution for a probability density. In between the two extremes, there exist scores of nonparametric or semiparametric models, of which most are also known as smoothing methods. A family of such nonparametric models in a variety of stochastic settings can be derived through the penalized likelihood method, forming the subject of this book.
Chong Gu
2. Model Construction
Abstract
The two basic components of a statistical model, the deterministic part and the stochastic part, are well separated in the penalized likelihood score L(f) + (λ/2)J(f) of (1.3). The deterministic part is specified via J(f), which defines the notion of smoothness for functions on the domain X. The stochastic part is characterized by L(f),which reflects the sampling structure of the data.
Chong Gu
3. Regression with Gaussian-Type Responses
Abstract
For regression with Gaussian responses, L( f ) + (λ/2)J( f ) reduces to the familiar penalized least squares functional. Among topics of primary interest are the selection of smoothing parameters, the computation of the estimates, the asymptotic convergence of the estimates, and various data analytical tools.
Chong Gu
4. More Splines
Abstract
The framework for model construction as laid out in Chapter 2 takes as building blocks any reproducing kernel. The polynomial splines of §2.3 are the standard choices on continuous domains, but generalizations or restrictions are sometimes called for by the nature of the applications. The technical underpinnings of the variants are generally different from that of polynomial splines, but once the reproducing kernels are specified, everything else remains largely intact.
Chong Gu
5. Regression with Responses from Exponential Families
Abstract
For responses from exponential family distributions, (1.4) of Example 1.1 defines penalized likelihood regression. Among topics of primary interest are the selection of smoothing parameters, the computation of the estimates, the asymptotic behavior of the estimates, and various data analytical tools.
Chong Gu
6. Probability Density Estimation
Abstract
For observational data, (1.5) of Example 1.2 defines penalized likelihood density estimation. Of interest are the selection of smoothing parameters, the computation of the estimates, and the asymptotic behavior of the estimates. Variants of (1.5) are also called for to accommodate data subject to selection bias and data from conditional distributions.
Chong Gu
7. Hazard Rate Estimation
Abstract
For right-censored lifetime data with possible left-truncation, (1.6) of Example 1.3 defines penalized likelihood hazard estimation. Of interest are the selection of smoothing parameters, the computation of the estimates, and the asymptotic behavior of the estimates.
Chong Gu
8. Asymptotic Convergence
Abstract
In this chapter, we develop an asymptotic theory concerning the rates of convergence of penalized likelihood estimates to the target functions as the sample size goes to infinity. The rates are calculated in terms of problem-specific loss functions derived from the respective stochastic settings.
Chong Gu
Backmatter
Metadaten
Titel
Smoothing Spline ANOVA Models
verfasst von
Chong Gu
Copyright-Jahr
2002
Verlag
Springer New York
Electronic ISBN
978-1-4757-3683-0
Print ISBN
978-1-4419-2966-2
DOI
https://doi.org/10.1007/978-1-4757-3683-0