ABSTRACT
Many machine learning methods have recently been applied to natural language processing tasks. Among them, the Winnow algorithm has been argued to be particularly suitable for NLP problems, due to its robustness to irrelevant features. However in theory, Winnow may not converge for non-separable data. To remedy this problem, a modification called regularized Winnow has been proposed. In this paper, we apply this new method to text chunking. We show that this method achieves state of the art performance with significantly less computation than previous approaches.
- S. P. Abney. 1991. Parsing by chunks. In R. C. Berwick, S. P. Abney, and C. Tenny, editors, Principle-Based Parsing: Computation and Psycholinguistics, pages 257--278. Kluwer, Dordrecht.]] Google ScholarDigital Library
- Eric Brill. 1994. Some advances in rule-based part of speech tagging. In Proc. AAAI 94, pages 722--727.]] Google ScholarDigital Library
- I. Dagan, Y. Karov, and D. Roth. 1997. Mistake-driven learning in text categorization. In Proceedings of the Second Conference on Empirical Methods in NLP.]]Google Scholar
- C. Gentile and M. K. Warmuth. 1998. Linear hinge loss and average margin. In Proc. NIPS'98.]] Google ScholarDigital Library
- A. Grove and D. Roth. 2001. Linear concepts and hidden variables. Machine Learning, 42:123--141.]]Google ScholarDigital Library
- R. Khardon, D. Roth, and L. Valiant. 1999. Relational learning for NLP using linear threshold elements. In Proceedings IJCAI-99.]] Google ScholarDigital Library
- Taku Kudoh and Yuji Matsumoto. 2000. Use of support vector learning for chunk identification. In Proc. CoNLL-2000 and LLL-2000, pages 142--144.]] Google ScholarDigital Library
- N. Littlestone. 1988. Learning quickly when irrelevant attributes abound: a new linear-threshold algorithm. Machine Learning, 2:285--318.]] Google ScholarCross Ref
- Michael McCord. 1989. Slot grammar: a system for simple construction of practical natural language grammars. Natural Language and Logic, pages 118--145.]] Google ScholarDigital Library
- Vasin Punyakanok and Dan Roth. 2001. The use of classifiers in sequential inference. In Todd K. Leen, Thomas G. Dietterich, and Volker Tresp, editors, Advances in Neural Information Processing Systems 13, pages 995--1001. MIT Press.]]Google Scholar
- Erik F. Tjong Kim Sang and Sabine Buchholz. 2000. Introduction to the conll-2000 shared tasks: Chunking. In Proc. CoNLL-2000 and LLL-2000, pages 127--132.]] Google ScholarDigital Library
- Hans van Halteren. 2000. Chunking with wpdv models. In Proc. CoNLL-2000 and LLL-2000, pages 154--156.]] Google ScholarDigital Library
- Tong Zhang. 2001. Regularized winnow methods. In Advances in Neural Information Processing Systems 13, pages 703--709.]]Google Scholar
- Text chunking using regularized Winnow
Recommendations
Hybrid text chunking
ConLL '00: Proceedings of the 2nd workshop on Learning language in logic and the 4th conference on Computational natural language learning - Volume 7This paper proposes an error-driven HMM-based text chunk tagger with context-dependent lexicon. Compared with standard HMM-based tagger, this tagger incorporates more contextual information into a lexical entry. Moreover, an error-driven learning ...
Text chunking by combining hand-crafted rules and memory-based learning
ACL '03: Proceedings of the 41st Annual Meeting on Association for Computational Linguistics - Volume 1This paper proposes a hybrid of hand-crafted rules and a machine learning method for chunking Korean. In the partially free word-order languages such as Korean and Japanese, a small number of rules dominate the performance due to their well-developed ...
Noun phrase chunking in Hebrew: influence of lexical and morphological features
ACL-44: Proceedings of the 21st International Conference on Computational Linguistics and the 44th annual meeting of the Association for Computational LinguisticsWe present a method for Noun Phrase chunking in Hebrew. We show that the traditional definition of base-NPs as non-recursive noun phrases does not apply in Hebrew, and propose an alternative definition of Simple NPs. We review syntactic properties of ...
Comments