skip to main content
10.1145/2488388.2488473acmotherconferencesArticle/Chapter ViewAbstractPublication PageswwwConference Proceedingsconference-collections
research-article

Sorry, i don't speak SPARQL: translating SPARQL queries into natural language

Published:13 May 2013Publication History

ABSTRACT

Over the past years, Semantic Web and Linked Data technologies have reached the backend of a considerable number of applications. Consequently, large amounts of RDF data are constantly being made available across the planet. While experts can easily gather information from this wealth of data by using the W3C standard query language SPARQL, most lay users lack the expertise necessary to proficiently interact with these applications. Consequently, non-expert users usually have to rely on forms, query builders, question answering or keyword search tools to access RDF data. However, these tools have so far been unable to explicate the queries they generate to lay users, making it difficult for these users to i) assess the correctness of the query generated out of their input, and ii) to adapt their queries or iii) to choose in an informed manner between possible interpretations of their input. This paper addresses this drawback by presenting SPARQL2NL, a generic approach that allows verbalizing SPARQL queries, i.e., converting them into natural language. Our framework can be integrated into applications where lay users are required to understand SPARQL or to generate SPARQL queries in a direct (forms, query builders) or an indirect (keyword search, question answering) manner. We evaluate our approach on the DBpedia question set provided by QALD-2 within a survey setting with both SPARQL experts and lay users. The results of the 115 filled surveys show that SPARQL2NL can generate complete and easily understandable natural language descriptions. In addition, our results suggest that even SPARQL experts can process the natural language representation of SPARQL queries computed by our approach more efficiently than the corresponding SPARQL queries. Moreover, non-experts are enabled to reliably understand the content of SPARQL queries.

References

  1. G. Aguado, A. Bañón, John A. Bateman, S. Bernardos, M. Fernández, A. Gómez-Pérez, E. Nieto, A. Olalla, R. Plaza, and A. Sánchez. ONTOGENERATION: Reusing domain and linguistic ontologies for Spanish text generation. In Workshop on Applications of Ontologies and Problem Solving Methods, ECAI'98, 1998.Google ScholarGoogle Scholar
  2. Kalina Bontcheva and Yorick Wilks. Automatic report generation from ontologies: The miakt approach. In NLDB, pages 324--335, 2004.Google ScholarGoogle ScholarCross RefCross Ref
  3. H. Dalianis and E.H. Hovy. Aggregation in natural language generation. In G. Adorni and M. Zock, editors, Trends in natural language generation: an artificial intelligence perspective, volume 1036 of Lecture Notes in Artificial Intelligence, pages 88--105. Springer, 1996. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. Brian Davis, Ahmad Iqbal, Adam Funk, Valentin Tablan, Kalina Bontcheva, Hamish Cunningham, and Siegfried Handschuh. Roundtrip ontology authoring. In ISWC, pages 50--65, 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. George Doddington. Automatic evaluation of machine translation quality using n-gram co-occurrence statistics. In Proceedings of HLT, pages 138--145, 2002. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. Basil Ell, Denny Vrandecic, and Elena Paslaru Bontas Simperl. Labels in the web of data. In Proceedings of ISWC, volume 7031, pages 162--176. Springer, 2011. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. Basil Ell, Denny Vrandecic, and Elena Simperl. SPARTIQULATION - Verbalizing SPARQL queries. In Proceedings of ILD Workshop, ESWC2012, 2012.Google ScholarGoogle Scholar
  8. Gunther Fliedl, Christian Kop, and Jurgen Vohringer. Guideline based evaluation and verbalization of owl class and property labels. Data Knowl. Eng., 69(4), 2010. Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. Norbert E. Fuchs. First-order reasoning for attempto controlled english. In CNL, pages 73--94, 2010. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. Dimitrios Galanis and Ion Androutsopoulos. Generating multilingual descriptions from linguistically annotated owl ontologies: the naturalowl system. In Proceedings of the Eleventh European Workshop on Natural Language Generation, ENLG '07, pages 143--146, Stroudsburg, PA, USA, 2007. Association for Computational Linguistics. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. Daniel Gerber and Axel-Cyrille Ngonga Ngomo. Extracting multilingual natural-language patterns for rdf predicates. In EKAW, pages 87--96, 2012. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. Daniel Gerber and Axel-Cyrille Ngonga Ngomo. Bootstrapping the linked data web. In 1st Workshop on Web Scale Knowledge Extraction @ ISWC 2011, 2011.Google ScholarGoogle Scholar
  13. Normunds Gruzitis, Gunta Nespore, and Baiba Saulite. Verbalizing ontologies in controlled baltic languages. In Inguna Skadina and Andrejs Vasiljevs, editors, Baltic HLT, volume 219 of Frontiers in Artificial Intelligence and Applications, pages 187--194. IOS Press, 2010. Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. Daniel Hewlett, Aditya Kalyanpur, Vladimir Kolovski, and Chris Halaschek-Wiener. Effective natural language paraphrasing of ontologies on the semantic web. In Proceedings of the End User Semantic Web Interaction Workshop (ISWC 2005), 2005.Google ScholarGoogle Scholar
  15. Yannis Ioannidis. From databases to natural language: The unusual direction. In E. Kapetanios, V. Sugumaran, and M. Spiliopoulou, editors, Natural Language and Information Systems, volume 5039 of LNCS, pages 12--16, 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. Kaarel Kaljurand and Norbert E. Fuchs. Verbalizing OWL in Attempto Controlled English. In Proceedings of Third International Workshop on OWL: Experiences and Directions, Innsbruck, Austria (6th-7th June 2007), volume 258, 2007.Google ScholarGoogle Scholar
  17. G. Koutrika, A. Simitsis, and Y.E. Ioannidis. Explaining structured queries in natural language. In Proceedings of the 26th International Conference on Data Engineering (ICDE), pages 333--344, 2010.Google ScholarGoogle ScholarCross RefCross Ref
  18. Jens Lehmann and Lorenz Bühmann. Autosparql: Let users query your knowledge base. In Proceedings of ESWC 2011, 2011. Google ScholarGoogle ScholarDigital LibraryDigital Library
  19. Jens Lehmann, Daniel Gerber, Mohamed Morsey, and Axel-Cyrille Ngonga Ngomo. Defacto - deep fact validation. In ISWC, 2012. Google ScholarGoogle ScholarDigital LibraryDigital Library
  20. Chris Mellish and Xiantang Sun. The semantic web as a linguistic resource: opportunities for natural language generation. In Twenty-sixth SGAI International Conference on Innovative Techniques and Applications of Artificial Intelligence, 2006.Google ScholarGoogle ScholarCross RefCross Ref
  21. H. Piccinini, M. A. Casanova, A. L. Furtado, and B. P. Nunes. Verbalization of rdf triples with applications. In ISWC - Outrageous Ideas track, 2011.Google ScholarGoogle Scholar
  22. Aleksander Pohl. The polish interface for linked open data. In Proceedings of the ISWC 2010 Posters & Demonstrations Track, pages 165--168, 2011.Google ScholarGoogle Scholar
  23. Ana-Maria Popescu, Oren Etzioni, and Henry Kautz. Towards a theory of natural language interfaces to databases. In Proceedings of the 8th international conference on Intelligent user interfaces, IUI '03, pages 149--157, 2003. Google ScholarGoogle ScholarDigital LibraryDigital Library
  24. Ehud Reiter and Robert Dale. Building natural language generation systems. Cambridge University Press, New York, NY, USA, 2000. Google ScholarGoogle ScholarDigital LibraryDigital Library
  25. Saeedeh Shekarpour, Soren Auer, Axel-Cyrille Ngonga Ngomo, Daniel Gerber, Sebastian Hellmann, and Claus Stadler. Keyword-driven sparql query generation leveraging background knowledge. In ACM/IEEE WI, 2011. Google ScholarGoogle ScholarDigital LibraryDigital Library
  26. Xiantang Sun and Chris Mellish. An experiment on "free generation" from single rdf triples. In Proceedings of the Eleventh European Workshop on Natural Language Generation, ENLG '07, pages 105--108, Stroudsburg, PA, USA, 2007. Association for Computational Linguistics. Google ScholarGoogle ScholarDigital LibraryDigital Library
  27. Allan Third, Sandra Williams, and Richard Power. Owl to english: a tool for generating organised easily-navigated hypertexts from ontologies. In ISWC Poster and Demo Track, 2011.Google ScholarGoogle Scholar
  28. Christina Unger, Lorenz Buhmann, Jens Lehmann, Axel-Cyrille Ngonga Ngomo, Daniel Gerber, and Philipp Cimiano. Template-based question answering over RDF data. In Proceedings of WWW, 2012. Google ScholarGoogle ScholarDigital LibraryDigital Library
  29. Graham Wilcock. Talking OWLs: Towards an Ontology Verbalizer. In Human Language Technology for the Semantic Web and Web Services, Workshop at ISWC 2003, pages 109--112, 2003.Google ScholarGoogle Scholar
  30. Graham Wilcock and Kristiina Jokinen. Generating Responses and Explanations from RDF/XML and DAML+OIL, 2003.Google ScholarGoogle Scholar

Recommendations

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Sign in
  • Published in

    cover image ACM Other conferences
    WWW '13: Proceedings of the 22nd international conference on World Wide Web
    May 2013
    1628 pages
    ISBN:9781450320351
    DOI:10.1145/2488388

    Copyright © 2013 Copyright is held by the International World Wide Web Conference Committee (IW3C2).

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    • Published: 13 May 2013

    Permissions

    Request permissions about this article.

    Request Permissions

    Check for updates

    Qualifiers

    • research-article

    Acceptance Rates

    WWW '13 Paper Acceptance Rate125of831submissions,15%Overall Acceptance Rate1,899of8,196submissions,23%

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader