Article

Free Access

Text chunking using regularized Winnow

Authors:
Tong Zhang

IBM T.J. Watson Research Center, Yorktown Heights, New York

IBM T.J. Watson Research Center, Yorktown Heights, New York
View Profile

,
Fred Damerau

IBM T.J. Watson Research Center, Yorktown Heights, New York

IBM T.J. Watson Research Center, Yorktown Heights, New York
View Profile

,
David Johnson

IBM T.J. Watson Research Center, Yorktown Heights, New York

IBM T.J. Watson Research Center, Yorktown Heights, New York
View Profile

ACL '01: Proceedings of the 39th Annual Meeting on Association for Computational LinguisticsJuly 2001Pages 539–546https://doi.org/10.3115/1073012.1073081

Published:06 July 2001Publication History

ACL '01: Proceedings of the 39th Annual Meeting on Association for Computational Linguistics

Pages 539–546

ABSTRACT

Many machine learning methods have recently been applied to natural language processing tasks. Among them, the Winnow algorithm has been argued to be particularly suitable for NLP problems, due to its robustness to irrelevant features. However in theory, Winnow may not converge for non-separable data. To remedy this problem, a modification called regularized Winnow has been proposed. In this paper, we apply this new method to text chunking. We show that this method achieves state of the art performance with significantly less computation than previous approaches.

References

S. P. Abney. 1991. Parsing by chunks. In R. C. Berwick, S. P. Abney, and C. Tenny, editors, Principle-Based Parsing: Computation and Psycholinguistics, pages 257--278. Kluwer, Dordrecht.]] Google ScholarDigital Library
Eric Brill. 1994. Some advances in rule-based part of speech tagging. In Proc. AAAI 94, pages 722--727.]] Google ScholarDigital Library
I. Dagan, Y. Karov, and D. Roth. 1997. Mistake-driven learning in text categorization. In Proceedings of the Second Conference on Empirical Methods in NLP.]]Google Scholar
C. Gentile and M. K. Warmuth. 1998. Linear hinge loss and average margin. In Proc. NIPS'98.]] Google ScholarDigital Library
A. Grove and D. Roth. 2001. Linear concepts and hidden variables. Machine Learning, 42:123--141.]]Google ScholarDigital Library
R. Khardon, D. Roth, and L. Valiant. 1999. Relational learning for NLP using linear threshold elements. In Proceedings IJCAI-99.]] Google ScholarDigital Library
Taku Kudoh and Yuji Matsumoto. 2000. Use of support vector learning for chunk identification. In Proc. CoNLL-2000 and LLL-2000, pages 142--144.]] Google ScholarDigital Library
N. Littlestone. 1988. Learning quickly when irrelevant attributes abound: a new linear-threshold algorithm. Machine Learning, 2:285--318.]] Google ScholarCross Ref
Michael McCord. 1989. Slot grammar: a system for simple construction of practical natural language grammars. Natural Language and Logic, pages 118--145.]] Google ScholarDigital Library
Vasin Punyakanok and Dan Roth. 2001. The use of classifiers in sequential inference. In Todd K. Leen, Thomas G. Dietterich, and Volker Tresp, editors, Advances in Neural Information Processing Systems 13, pages 995--1001. MIT Press.]]Google Scholar
Erik F. Tjong Kim Sang and Sabine Buchholz. 2000. Introduction to the conll-2000 shared tasks: Chunking. In Proc. CoNLL-2000 and LLL-2000, pages 127--132.]] Google ScholarDigital Library
Hans van Halteren. 2000. Chunking with wpdv models. In Proc. CoNLL-2000 and LLL-2000, pages 154--156.]] Google ScholarDigital Library
Tong Zhang. 2001. Regularized winnow methods. In Advances in Neural Information Processing Systems 13, pages 703--709.]]Google Scholar

Text chunking using regularized Winnow

Recommendations

Hybrid text chunking
ConLL '00: Proceedings of the 2nd workshop on Learning language in logic and the 4th conference on Computational natural language learning - Volume 7

This paper proposes an error-driven HMM-based text chunk tagger with context-dependent lexicon. Compared with standard HMM-based tagger, this tagger incorporates more contextual information into a lexical entry. Moreover, an error-driven learning ...
Read More
Text chunking by combining hand-crafted rules and memory-based learning
ACL '03: Proceedings of the 41st Annual Meeting on Association for Computational Linguistics - Volume 1

This paper proposes a hybrid of hand-crafted rules and a machine learning method for chunking Korean. In the partially free word-order languages such as Korean and Japanese, a small number of rules dominate the performance due to their well-developed ...
Read More
Noun phrase chunking in Hebrew: influence of lexical and morphological features
ACL-44: Proceedings of the 21st International Conference on Computational Linguistics and the 44th annual meeting of the Association for Computational Linguistics

We present a method for Noun Phrase chunking in Hebrew. We show that the traditional definition of base-NPs as non-recursive noun phrases does not apply in Hebrew, and propose an alternative definition of Simple NPs. We review syntactic properties of ...
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Published in
ACL '01: Proceedings of the 39th Annual Meeting on Association for Computational Linguistics
July 2001
562 pages
General Chair:
Bonnie Lynn Webber
Sponsors
In-Cooperation
Publisher
Association for Computational Linguistics
United States
Publication History
- Published: 6 July 2001
Qualifiers
- Article
Conference

Acceptance Rates
Overall Acceptance Rate85of443submissions,19%
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 14
  Total Citations
  View Citations
- 420
  Total Downloads
- Downloads (Last 12 months)39
- Downloads (Last 6 weeks)5
Other Metrics
View Author Metrics
Cited By
View all

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Text chunking using regularized Winnow

ACL '01: Proceedings of the 39th Annual Meeting on Association for Computational Linguistics

ABSTRACT

References

Cited By

Recommendations

Hybrid text chunking

Text chunking by combining hand-crafted rules and memory-based learning

Noun phrase chunking in Hebrew: influence of lexical and morphological features

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Qualifiers

Conference

Acceptance Rates

Funding Sources

Other Metrics

Article Metrics

Other Metrics

Cited By

PDF Format

eReader

Digital Edition

Caption

Text chunking using regularized Winnow

ACL '01: Proceedings of the 39th Annual Meeting on Association for Computational Linguistics

ABSTRACT

References

Cited By

Recommendations

Hybrid text chunking

Text chunking by combining hand-crafted rules and memory-based learning

Noun phrase chunking in Hebrew: influence of lexical and morphological features

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Qualifiers

Conference

Acceptance Rates

Funding Sources

Article Metrics

Other Metrics

PDF Format

eReader

Digital Edition

Share this Publication link

Share on Social Media