skip to main content
10.3115/1220835.1220873dlproceedingsArticle/Chapter ViewAbstractPublication PageshltConference Proceedingsconference-collections
Article
Free Access

Integrating probabilistic extraction models and data mining to discover relations and patterns in text

Published:04 June 2006Publication History

ABSTRACT

In order for relation extraction systems to obtain human-level performance, they must be able to incorporate relational patterns inherent in the data (for example, that one's sister is likely one's mother's daughter, or that children are likely to attend the same college as their parents). Hand-coding such knowledge can be time-consuming and inadequate. Additionally, there may exist many interesting, unknown relational patterns that both improve extraction performance and provide insight into text. We describe a probabilistic extraction model that provides mutual benefits to both "top-down" relational pattern discovery and "bottom-up" relation extraction.

References

  1. Eugene Agichtein and Luis Gravano. 2000. Snowball: Extracting relations from large plain-text collections. In Proceedings of the Fifth ACM International Conference on Digital Libraries. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. Sergey Brin. 1998. Extracting patterns and relations from the world wide web. In WebDB Workshop at 6th International Conference on Extending Database Technology. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. Razvan Bunescu and Raymond Mooney. 2006. Subsequence kernels for relation extraction. In Y. Weiss, B. Schölkopf, and J. Platt, editors, Advances in Neural Information Processing Systems 18. MIT Press, Cambridge, MA.Google ScholarGoogle Scholar
  4. Mark Craven, Dan DiPasquo, Dayne Freitag, Andrew K. McCallum, Tom M. Mitchell, Kamal Nigam, and Seán Slattery. 1998. Learning to extract symbolic knowledge from the World Wide Web. In Proceedings of AAAI-98, 15th Conference of the American Association for Artificial Intelligence, pages 509--516, Madison, US. AAAI Press, Menlo Park, US. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. Aron Culotta and Andrew McCallum. 2004. Confidence estimation for information extraction. In Human Langauge Technology Conference (HLT 2004), Boston, MA. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. Aron Culotta and Jeffrey Sorensen. 2004. Dependency tree kernels for relation extraction. In ACL. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. L. Dehaspe. 1997. Maximum entropy modeling with clausal constraints. In Proceedings of the Seventh International Workshop on Inductive Logic Programming, pages 109--125, Prague, Czech Republic. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. M. Hearst. 1999. Untangling text data mining. In 37th Annual Meeting of the Association for Computational Linguistics. Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. Nanda Kambhatla. 2004. Combining lexical, syntactic, and semantic features with maximum entropy models for extracting relations. In ACL. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. John Lafferty, Andrew McCallum, and Fernando Pereira. 2001. Conditional random fields: Probabilistic models for segmenting and labeling sequence data. In Proc. 18th International Conf. on Machine Learning, pages 282--289. Morgan Kaufmann, San Francisco, CA. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. Gideon Mann and David Yarowsky. 2005. Multi-field information extraction and cross-document fusion. In ACL. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. D. Masterson and N. Kushmerik. 2003. Information extraction from multi-document threads. In ECML-2003: Workshop on Adaptive Text Extraction and Mining, pages 34--41.Google ScholarGoogle Scholar
  13. Andrew McCallum and David Jensen. 2003. A note on the unification of information extraction and data mining using conditional-probability, relational models. In IJCAI03 Workshop on Learning Statistical Models from Relational Data.Google ScholarGoogle Scholar
  14. Andrew McCallum. 2002. Mallet: A machine learning for language toolkit, http://mallet.cs.umass.edu.Google ScholarGoogle Scholar
  15. Andrew McCallum. 2003. Efficiently inducing features of conditional random fields. In Nineteenth Conference on Uncertainty in Artificial Intelligence (UAI03). Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. Scott Miller, Heidi Fox, Lance A. Ramshaw, and Ralph Weischedel. 2000. A novel use of statistical parsing to extract information from text. In ANLP. Google ScholarGoogle ScholarDigital LibraryDigital Library
  17. Raymond J. Mooney and Razvan Bunescu. 2005. Mining knowledge from text using information extraction. SigKDD Explorations on Text Mining and Natural Language Processing. Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. Un Yong Nahm and Raymond J. Mooney. 2000. A mutually beneficial integration of data mining and information extraction. In AAAI/IAAI.Google ScholarGoogle Scholar
  19. Bradley L. Richards and Raymond J. Mooney. 1992. Learning relations by pathfinding. In Proceedings of the Tenth National Conference on Artificial Intelligence (AAAI-92), pages 50--55, San Jose, CA.Google ScholarGoogle ScholarDigital LibraryDigital Library
  20. Dan Roth and Wen tau Yih. 2002. Probabilistic reasoning for entity and relation recognition. In COLING. Google ScholarGoogle ScholarDigital LibraryDigital Library
  21. Sunita Sarawagi and William W. Cohen. 2004. Semi-markov conditional random fields for information extraction. In NIPS 04.Google ScholarGoogle Scholar
  22. Charles Sutton and Andrew McCallum. 2004. Dynamic conditional random fields: Factorized probabilistic models for labeling and segmenting sequence data. In Proceedings of the Twenty-First International Conference on Machine Learning (ICML). Google ScholarGoogle ScholarDigital LibraryDigital Library
  23. Charles Sutton and Andrew McCallum. 2006. An introduction to conditional random fields for relational learning. In Lise Getoor and Ben Taskar, editors, Introduction to Statistical Relational Learning. MIT Press. To appear.Google ScholarGoogle Scholar
  24. Dmitry Zelenko, Chinatsu Aone, and Anthony Richardella. 2003. Kernel methods for relation extraction. Journal of Machine Learning Research, 3:1083--1106. Google ScholarGoogle ScholarDigital LibraryDigital Library
  1. Integrating probabilistic extraction models and data mining to discover relations and patterns in text

      Recommendations

      Comments

      Login options

      Check if you have access through your login credentials or your institution to get full access on this article.

      Sign in
      • Published in

        cover image DL Hosted proceedings
        HLT-NAACL '06: Proceedings of the main conference on Human Language Technology Conference of the North American Chapter of the Association of Computational Linguistics
        June 2006
        522 pages

        Publisher

        Association for Computational Linguistics

        United States

        Publication History

        • Published: 4 June 2006

        Qualifiers

        • Article

        Acceptance Rates

        HLT-NAACL '06 Paper Acceptance Rate62of257submissions,24%Overall Acceptance Rate240of768submissions,31%

      PDF Format

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader