A Practical Guide to Hybrid Natural Language Processing

Combining Neural Models and Knowledge Graphs for NLP

Authors: Jose Manuel Gomez-Perez, Ronald Denaux, Andres Garcia-Silva

Publisher: Springer International Publishing

Part of: Springer Professional "Wirtschaft+Technik" , Springer Professional "Technik" , Springer Professional "Wirtschaft"

About this book

This book provides readers with a practical guide to the principles of hybrid approaches to natural language processing (NLP) involving a combination of neural methods and knowledge graphs. To this end, it first introduces the main building blocks and then describes how they can be integrated to support the effective implementation of real-world NLP applications. To illustrate the ideas described, the book also includes a comprehensive set of experiments and exercises involving different algorithms over a selection of domains and corpora in various NLP tasks.

Throughout, the authors show how to leverage complementary representations stemming from the analysis of unstructured text corpora as well as the entities and relations described explicitly in a knowledge graph, how to integrate such representations, and how to use the resulting features to effectively solve NLP tasks in a range of domains. In addition, the book offers access to executable code with examples, exercises and real-world applications in key domains, like disinformation analysis and machine reading comprehension of scientific literature. All the examples and exercises proposed in the book are available as executable Jupyter notebooks in a GitHub repository. They are all ready to be run on Google Colaboratory or, if preferred, in a local environment.

A valuable resource for anyone interested in the interplay between neural and knowledge-based approaches to NLP, this book is a useful guide for readers with a background in structured knowledge representations as well as those whose main approach to AI is fundamentally based on logic. Further, it will appeal to those whose main background is in the areas of machine and deep learning who are looking for ways to leverage structured knowledge bases to optimize results along the NLP downstream.

Frontmatter

Preliminaries and Building Blocks

Frontmatter

Chapter 1. Hybrid Natural Language Processing: An Introduction

Abstract

The proliferation of knowledge graphs and recent advances in artificial intelligence have raised great expectations related to the combination of symbolic and data-driven approaches in cognitive tasks. This is particularly the case of knowledge-based approaches to natural language processing as near-human symbolic understanding relies on expressive, structured knowledge representations. Engineered by humans, knowledge graphs are frequently well curated and of high quality, but they can also be labor-intensive, rely on rigid formalisms and sometimes be biased towards the specific viewpoint of their authors. This book aims to provide the reader with means to address limitations like the above by bringing together bottom-up, data-driven models and top-down, structured knowledge graphs. To this purpose, the book explores how to reconcile both views and enrich the resulting representations beyond the possibilities of each individual approach. Throughout this book, we delve into this idea and show how such hybrid approach can be used with great effectiveness in a variety of natural language processing tasks.