- H. Cunningham, D. Maynard, K. Bontcheva, and V. Tablan. GATE: An Architecture for Development of Robust HLT Applications. In ACL, pages 168--175, 2002. Google ScholarDigital Library
- D. Ferrucci and A. Lally. UIMA: An Architectural Approach to Unstructured Information Processing in the Corporate Research Environment. Nat. Lang. Eng., 10:327--348, 2004. Google ScholarDigital Library
- A. Y. Halevy. Answering Queries Using Views: A Survey. VLDB Journal, 10(4):270--294, 2001. Google ScholarDigital Library
- A. Jain, A. Doan, and L. Gravano. Optimizing SQL Queries over Text Databases. In ICDE, pages 636--645, 2008. Google ScholarDigital Library
- A. Jain, P. G. Ipeirotis, A. Doan, and L. Gravano. Join Optimization of Information Extraction Output: Quality Matters! In ICDE, pages 186--197, 2009. Google ScholarDigital Library
- A. Jain, P. G. Ipeirotis, and L. Gravano. Building Query Optimizers for Information Extraction: The SQoUT Project. SIGMOD Record, 37(4):28--34, 2008. Google ScholarDigital Library
- F. Reiss, S. Raghavan, R. Krishnamurthy, H. Zhu, and S. Vaithyanathan. An Algebraic Approach to Rule-Based Information Extraction. In ICDE, pages 933--942, 2008. Google ScholarDigital Library
- W. Shen, A. Doan, J. F. Naughton, and R. Ramakrishnan. Declarative Information Extraction Using Datalog with Embedded Extraction Predicates. In VLDB, pages 1033--1044, 2007. Google ScholarDigital Library
Index Terms
- Just-in-time information extraction using extraction views
Recommendations
Systematic Feature Extraction
A systematic feature extraction procedure is proposed. It is based on successive extractions of features. At each stage a dimensionality reduction is made and a new feature is extracted. A specific example is given using the Gaussian minus-log-...
Managing information extraction: state of the art and research directions
SIGMOD '06: Proceedings of the 2006 ACM SIGMOD international conference on Management of dataThis tutorial makes the case for developing a unified framework that manages information extraction from unstructured data (focusing in particular on text). We first survey research on information extraction in the database, AI, NLP, IR, and Web ...
Accurate keyphrase extraction by discriminating overlapping phrases
In this paper we define the document phrase maximality index DPM-index, a new measure to discriminate overlapping keyphrase candidates in a text document. As an application we developed a supervised learning system that uses 18 statistical features, ...
Comments