Elsevier

Journal of Web Semantics

Volume 30, January 2015, Pages 3-21
Journal of Web Semantics

Global machine learning for spatial ontology population

https://doi.org/10.1016/j.websem.2014.06.001Get rights and content

Abstract

Understanding spatial language is important in many applications such as geographical information systems, human computer interaction or text-to-scene conversion. Due to the challenges of designing spatial ontologies, the extraction of spatial information from natural language still has to be placed in a well-defined framework. In this work, we propose an ontology which bridges between cognitive–linguistic spatial concepts in natural language and multiple qualitative spatial representation and reasoning models. To make a mapping between natural language and the spatial ontology, we propose a novel global machine learning framework for ontology population. In this framework we consider relational features and background knowledge which originate from both ontological relationships between the concepts and the structure of the spatial language. The advantage of the proposed global learning model is the scalability of the inference, and the flexibility for automatically describing text with arbitrary semantic labels that form a structured ontological representation of its content. The machine learning framework is evaluated with SemEval-2012 and SemEval-2013 data from the spatial role labeling task.

Introduction

An essential function of natural language is to talk about the location and translocation of objects in space. Understanding spatial language is important in many applications such as geographical information systems (GIS), human computer interaction, text-to-scene conversion, and representation and extraction of spatial information from web resources such as travelers blogs or websites about tourism. Due to the complexity of spatial primitives and notions, and the challenges of designing ontologies for formal spatial representation, the extraction of the spatial semantics from natural language still has to be placed in a well-defined framework.

We have two main contributions toward solving this problem. The first contribution is that we propose a spatial ontology based on two layers of semantics. This ontology is based on a previously proposed spatial annotation scheme by the authors  [1]. Its first layer is based on commonly accepted cognitive spatial notions and the second is based on multiple well-known qualitative spatial reasoning models. An automatic mapping to such an ontology bridges between natural language and qualitative spatial representation and reasoning models, which makes automatic spatial reasoning based on spatial information in linguistic expressions feasible. This ontology can be integrated in larger ontologies, for example, to represent spatial meaning in unstructured data in the context of the Semantic Web.

The second contribution of this work is that we propose a novel global supervised machine learning model for spatial ontology population. For this supervised learning framework, we build rich annotated corpora and an evaluation scheme. We point to the linguistic features and structural characteristics of spatial language that aid the use of machine learning. We view ontology population as a means for creating meaning representations from text. In this model the segments of the input text are described by semantic abstractions or concepts and their relationships defined by the ontology, which form the output space of the learning problem  [2]. In the proposed global learning framework, the ontology components including spatial roles and their relations, and multiple formal semantic types are learned while taking into account the ontological constraints and the structural characteristics of the spatial language.

Learning a model that considers the global correlations between the output components usually becomes computationally complex. To deal with the complexity in training and prediction phases, we use an efficient inference approach based upon combinatorial optimization techniques for both phases. This approach can deal with a large number of variables and constraints, and makes building a structured machine learning model for ontology population, feasible.

We decompose the learning problem into simpler problems that are jointly optimized. We propose a technique which we call communicative inference based on the ideas of alternating optimization for solving smaller subproblems of the main objective function  [3]. Each subproblem is solved by using linear programming (LP) solvers and the subproblems communicate to each other by passing the local solutions. We show that the suggested framework is beneficial compared to local learning as well as compared to pipelining the independently learned models for the concepts in the ontology. The proposed inference approach makes the global learning scalable.

The application of the global machine learning model for ontology population is not limited to the extraction of spatial semantics; it could be used to populate any ontology. Moreover, due to decomposing the ontology to its solvable parts, this approach is scalable to be applied for approximate global learning for large ontologies of the Semantic Web. We argue therefore that this work is an important step towards automatically describing text with semantic labels that form a structured ontological representation of the content.

Our extensive experimental study using the spatial ontology indicates the advantage of global learning while considering ontological constraints and structural characteristics of the spatial language compared to learning local models for the various parts of the ontology independently. The experiments are performed using the corpora provided by the SemEval-2012 and SemEval-2013 shared task on spatial role labeling.

In Section  2, we provide the problem definition and the spatial ontology population task in its two layers of semantics. In Section  3, we discuss the features and constraints that are useful for learning the spatial ontology population. A background to structured learning is provided in Section  4. The proposed structured learning model for spatial ontology population is described in Section  5. The proposed inference approach is explained in Section  6. Section  7 specifies the details of the components of the spatial ontology population model. The various designed local and global models are clarified in Section  8. Section  9 presents the experimental results. An overview of the related research is provided in Section  10. We draw conclusions, set our work in a broader context, and point to the future extensions in Section  11.

Section snippets

General problem definition

We define a framework for mapping natural language to spatial ontologies. Although pragmatic, our proposed framework is based on the theoretical cognitive and linguistic foundations, as well as on cognitively adequate formal spatial models. The task is formulated as an ontology population to be performed via supervised machine learning models. We aim at learning to assign the segments in the sentence to the concepts in the ontology. The considered concepts form a light weight ontology which is

Constraints and features for the machine learning models

As in other computational linguistic tasks, the lexical, syntactic and semantic features of language can help with the extraction of spatial semantics. There is also linguistic and commonsense background knowledge on the spatial language to be exploited when designing an intelligent model for automatic spatial semantic extraction. In this section we aim to specify all types of information that can be useful for the machine learning models that we design. We divide these characteristics in two

Structured learning setting

In learning models for structured output prediction, given a set of N input–output pairs of training examples E={(xi,yi)X×Y:i=1N}, we learn an objective function g(x,y;W) which is a linear discriminant function defined over the combined feature representation of the inputs and outputs denoted by f(x,y)   [23]: g(x,y;W)=W,f(x,y).W denotes a weight vector and , denotes a dot product between two vectors. A popular discriminative training approach is to minimize the following convex upper

Link-And-Label model

We aim to provide a simple and useful abstraction for designing global structured learning models for ontology population from text that is easily integrated in the above non-probabilistic structured output prediction models. We specify the learning components including input, output, joint feature function, global constraints, loss and inference in a framework which we name Link-And-Label (LAL) framework. The Link-And-Label name is inspired by the conceptualization process that a human does

Communicative inference

Solving the LAL objective function, given in Eq. (2), during training of the model can become highly inefficient for most relational data domains. This is because the objective function and the constraints are in fact expressed in a first order representation (i.e. templates and types), and the corresponding ontologies or output label structures often produce a large number of output labels and constraints when instantiated for each training example. To solve this problem we propose an

Model specification

In this section, we formulate the problem of mapping natural language to spatial ontologies. We represent the supervised structured learning model designed for solving this problem using the Link-And-Label model described in Section  5 and specify: (a) the input components and types; (b) the output single labels, linked labels and global constraints over the output structure; (c) the joint feature templates, candidate generation for the templates and the main objective function.

Local–global training and prediction models

In this section we collect the required pieces from the last sections and discuss the model variations belonging to the spectrum of local and global training and prediction models that we design.

The global loss augmented objective function of our problem is built by adding the components of Eqs. (19), (20). We train the parameters W of the function g in the framework of discriminative inference-based structured prediction models such as structured SVM, structured perceptron and average

Experiments

For the extraction of the linguistic features we use the LTH2 tool that produces features in the CoNLL-08 format3 The applied machine learning techniques are the structured SVM using the SVM-struct Matlab wrapper  [34] (coded as SSVM) and our implementation of the structured perceptron (coded as SPerc) and the averaged structured perceptron (coded as AvGSPerc). For local learning settings a

Related work

The ontology we use in this work is based on the spatial annotation scheme that we have proposed in a previous work  [1]. We discuss the two layers of semantics and the adequacy of mapping to qualitative spatial representation and reasoning models in [12], [8], [9]. We have previously developed machine learning models, but they were restricted to the annotation of text with the concepts of the SpRL layer  [10], [38]. The SpRL layer has been worked out by the participants of a semantic

Conclusions

We have proposed a framework for representing the spatial semantics in natural language in terms of multiple calculi models. Moreover, a novel structured machine learning framework for mapping natural language to ontologies is provided. We propose a framework that we call Link-And-Label which is able to deal with relational data both in the input and in the output and is able to consider ontological relationships and background knowledge about the task during training and prediction. Using the

Acknowledgments

The research was funded by the KU Leuven grant DBOF/08/043, the EU FP7-296703 project MUSE (Machine Understanding for interactive Story tElling) and by the KU Leuven Postdoctoral grant PDMK/13/115.

References (53)

  • L.A. Carlson et al.

    The space in spatial language

    J. Mem. Lang.

    (2004)
  • B. Kuipers

    The spatial semantic hierarchy

    Artif. Intell.

    (2000)
  • Y. Cao et al.

    A structural support vector method for extracting contexts and answers of questions from online forums

    Inf. Process. Manage.

    (2011)
  • P. Kordjamshidi, M. van Otterlo, M.F. Moens, Spatial role labeling: task definition and annotation scheme, in: N....
  • G. Petasis et al.

    Knowledge-Driven Multimedia Information Extraction and Ontology Evolution

    (2011)
  • J.C. Bezdek et al.

    Some notes on alternating optimization

  • J. Hois et al.

    Natural language meets spatial calculi

  • J. Renz et al.

    Qualitative spatial reasoning using constraint calculi

  • J.A. Bateman

    Language and space: a two-level semantic approach based on principles of ontological engineering

    Int. J. Speech Technol.

    (2010)
  • P. Kordjamshidi, M. van Otterlo, M.F. Moens, From language towards formal spatial calculi, in: R. J. Ross, J. Hois, J....
  • P. Kordjamshidi, J. Hois, M. van Otterlo, M.F. Moens, Machine learning for interpretation of spatial natural language...
  • P. Kordjamshidi et al.

    Spatial role labeling: towards extraction of spatial relations from natural language

    ACM Trans. Speech Lang. Process.

    (2011)
  • A. Galton

    Spatial and temporal knowledge representation

    J. Earth Sci Inform.

    (2009)
  • P. Kordjamshidi et al.

    Learning to interpret spatial natural language in terms of qualitative spatial relations

  • M. MacMahon, B. Stankiewicz, B. Kuipers, Walk the talk: connecting language, knowledge, and action in route...
  • W. Wong et al.

    Ontology learning from text: a look back and into the future

    ACM Comput. Surv.

    (2012)
  • J. Zlatev, Holistic spatial semantics of Thai, Cognitive Linguistics and Non-Indo-European Languages (2003)...
  • J. Zlatev

    Spatial semantics

  • P. Kordjamshidi

    Structured machine learning for mapping natural language to spatial ontologies

    (2013)
  • I. Mani et al.

    Interpreting Motion: Grounded Representations for Spatial Language, Explorations in language and space

    (2012)
  • D.A. Randell, Z. Cui, A.G. Cohn, A spatial logic based on regions and connection, in: Proceedings of the 3rd...
  • A. Klippel et al.

    The endpoint hypothesis: a topological-cognitive assessment of geographic scale movement patterns

  • M.W. Chang et al.

    Structured learning with constrained conditional models

    Mach. Learn.

    (2012)
  • I. Tsochantaridis et al.

    Large margin methods for structured and interdependent output variables

    J. Mach. Learn. Res.

    (2006)
  • M. Collins

    Discriminative training methods for hidden Markov models: theory and experiments with perceptron algorithms

  • M. Collins

    Parameter estimation for statistical parsing models: theory and practice of distribution-free methods

  • Cited by (44)

    • ISEE: A heterogeneous information system for event explainability in smart connected environments

      2021, Internet of Things (Netherlands)
      Citation Excerpt :

      These two instances could be extracted, for example, from the employee record of the building office. Two main approaches are used in the literature for ontology instantiation: (1) algorithmic/heuristic based approaches [10–12] which usually involve NLP techniques, predefined mapping rules and human intervention for control and validation; (2) machine learning-based approaches [13–15] which also use NLP techniques associated with annotation methods from the semantic web. In our context, we used an algorithm-based approach, further details are presented in Section 4.

    • Ontology and rule-based natural language processing approach for interpreting textual regulations on underground utility infrastructure

      2021, Advanced Engineering Informatics
      Citation Excerpt :

      Two types of mappings – pre-learned and hand-crafted – were used to map spatial indicators to spatial relation types. The pre-learned mapping relies on the connections between the spatial language (mostly spatial prepositions) to formal spatial relations learned through ML[48], e.g., the match between the spatial prepositions of “under” or “below” to the spatial relation of “Below”. The hand-crafted mapping is domain-specific.

    • Understanding spatial language in radiology: Representation framework, annotation, and spatial relation extraction from chest X-ray reports using deep learning

      2020, Journal of Biomedical Informatics
      Citation Excerpt :

      Motivated by this, we utilize contextualized embedding models based on transformers by applying BERT- and XLNet-based models for extracting spatial relations. Understanding spatial relations relies on the syntactic structure of a sentence as demonstrated in previous works where various syntactic features and rules based on lexico-syntactic patterns and syntactic parse trees were employed [14–17]. BERT [18], based on a deep bi-directional transformer architecture, encodes rich linguistic information in a hierarchical manner with syntactic features in the middle layers.

    View all citing articles on Scopus
    View full text