Skip to main content

2003 | OriginalPaper | Buchkapitel

Smoothing Techniques for Tree-k-Grammar-Based Natural Language Modeling

verfasst von : Jose L. Verdú-Mas, Jorge Calera-Rubio, Rafael C. Carrasco

Erschienen in: Pattern Recognition and Image Analysis

Verlag: Springer Berlin Heidelberg

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

In a previous work, a new probabilistic context-free grammar (PCFG) model for natural language parsing derived from a tree bank corpus has been introduced. The model estimates the probabilities according to a generalized k-grammar scheme for trees. It allows for faster parsing, decreases considerably the perplexity of the test samples and tends to give more structured and refined parses. However, it suffers from the problem of incomplete coverage. In this paper, we compare several smoothing techniques such as backing-off or interpolation that are used to avoid assigning zero probability to any sentence.

Metadaten
Titel
Smoothing Techniques for Tree-k-Grammar-Based Natural Language Modeling
verfasst von
Jose L. Verdú-Mas
Jorge Calera-Rubio
Rafael C. Carrasco
Copyright-Jahr
2003
Verlag
Springer Berlin Heidelberg
DOI
https://doi.org/10.1007/978-3-540-44871-6_122

Premium Partner