Skip to main content
Top

2025 | OriginalPaper | Chapter

Advancing Robotic Perception with Perceived-Entity Linking

Authors : Mark Adamik, Romana Pernisch, Ilaria Tiddi, Stefan Schlobach

Published in: The Semantic Web – ISWC 2024

Publisher: Springer Nature Switzerland

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

The capabilities of current robotic applications are significantly constrained by their limited ability to perceive and understand their surroundings. The Semantic Web aims to offer general, machine-readable knowledge about the world and could be a potential solution to address the information needs of robotic agents. We introduce the Perceived-Entity Linking (PEL) problem as the task of recognizing entities and linking the sensory data of an autonomous agent to a unique identifier in a target knowledge graph. We provide a formal definition of PEL, and propose a PEL baseline based on the YOLO object detection algorithm and a conventional entity linking method as an initial attempt to solve the task. The baseline is evaluated by linking the concepts contained in MS COCO and VisualGenome datasets to WikiData, DBpedia and YAGO as target knowledge graphs. This study makes a first step in allowing robotic agents to leverage the extensive knowledge contained in general-purpose knowledge graphs.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Footnotes
1
The results of the experiment, as well as the implementation of the architecture can be openly accessed online https://​github.​com/​Dorteel/​pel.
 
Literature
1.
go back to reference Aguado, E., Sanz, R.: Using ontologies in autonomous robots engineering. Robot. Softw. Des. Eng. 71 (2021) Aguado, E., Sanz, R.: Using ontologies in autonomous robots engineering. Robot. Softw. Des. Eng. 71 (2021)
5.
go back to reference Beetz, M., Balint-Benczedi, F., Blodow, N., Nyga, D., Wiedemeyer, T., Marton, Z.: Robosherlock: unstructured information processing for robot perception. In: IEEE International Conference on Robotics and Automation. ICRA 2015, Seattle, WA, USA, May 26–30 2015, pp. 1549–1556. IEEE (2015). https://doi.org/10.1109/ICRA.2015.7139395 Beetz, M., Balint-Benczedi, F., Blodow, N., Nyga, D., Wiedemeyer, T., Marton, Z.: Robosherlock: unstructured information processing for robot perception. In: IEEE International Conference on Robotics and Automation. ICRA 2015, Seattle, WA, USA, May 26–30 2015, pp. 1549–1556. IEEE (2015). https://​doi.​org/​10.​1109/​ICRA.​2015.​7139395
6.
go back to reference Beetz, M., Beßler, D., Haidu, A., Pomarlan, M., Bozcuoglu, A.K., Bartels, G.: Know rob 2.0 - a 2nd generation knowledge processing framework for cognition-enabled robotic agents. In: 2018 IEEE International Conference on Robotics and Automation. ICRA 2018, Brisbane, Australia, 21–25 May 2018, pp. 512–519. IEEE (2018). https://doi.org/10.1109/ICRA.2018.8460964 Beetz, M., Beßler, D., Haidu, A., Pomarlan, M., Bozcuoglu, A.K., Bartels, G.: Know rob 2.0 - a 2nd generation knowledge processing framework for cognition-enabled robotic agents. In: 2018 IEEE International Conference on Robotics and Automation. ICRA 2018, Brisbane, Australia, 21–25 May 2018, pp. 512–519. IEEE (2018). https://​doi.​org/​10.​1109/​ICRA.​2018.​8460964
7.
go back to reference Berners-Lee, T., Hendler, J.A., Lassila, O.: The semantic web: a new form of web content that is meaningful to computers will unleash a revolution of new possibilities. In: Seneviratne, O., Hendler, J.A. (eds.) Linking the World’s Information - Essays on Tim Berners-Lee’s Invention of the World Wide Web, ACM Books, vol. 52, pp. 91–103. ACM (2023). https://doi.org/10.1145/3591366.3591376 Berners-Lee, T., Hendler, J.A., Lassila, O.: The semantic web: a new form of web content that is meaningful to computers will unleash a revolution of new possibilities. In: Seneviratne, O., Hendler, J.A. (eds.) Linking the World’s Information - Essays on Tim Berners-Lee’s Invention of the World Wide Web, ACM Books, vol. 52, pp. 91–103. ACM (2023). https://​doi.​org/​10.​1145/​3591366.​3591376
10.
go back to reference Brachman, R.J., Levesque, H.J.: Toward a new science of common sense. In: Thirty-Sixth AAAI Conference on Artificial Intelligence. AAAI 2022, Thirty-Fourth Conference on Innovative Applications of Artificial Intelligence, IAAI 2022, The Twelfth Symposium on Educational Advances in Artificial Intelligence. EAAI 2022 Virtual Event, 22 February–1 March 2022, pp. 12245–12249. AAAI Press (2022). https://doi.org/10.1609/AAAI.V36I11.21485 Brachman, R.J., Levesque, H.J.: Toward a new science of common sense. In: Thirty-Sixth AAAI Conference on Artificial Intelligence. AAAI 2022, Thirty-Fourth Conference on Innovative Applications of Artificial Intelligence, IAAI 2022, The Twelfth Symposium on Educational Advances in Artificial Intelligence. EAAI 2022 Virtual Event, 22 February–1 March 2022, pp. 12245–12249. AAAI Press (2022). https://​doi.​org/​10.​1609/​AAAI.​V36I11.​21485
12.
go back to reference Dredze, M., McNamee, P., Rao, D., Gerber, A., Finin, T.: Entity disambiguation for knowledge base population. In: Huang, C., Jurafsky, D. (eds.) COLING 2010, 23rd International Conference on Computational Linguistics, Proceedings of the Conference, 23–27 August 2010, Beijing, China, pp. 277–285. Tsinghua University Press (2010). https://aclanthology.org/C10-1032/ Dredze, M., McNamee, P., Rao, D., Gerber, A., Finin, T.: Entity disambiguation for knowledge base population. In: Huang, C., Jurafsky, D. (eds.) COLING 2010, 23rd International Conference on Computational Linguistics, Proceedings of the Conference, 23–27 August 2010, Beijing, China, pp. 277–285. Tsinghua University Press (2010). https://​aclanthology.​org/​C10-1032/​
14.
go back to reference Ehrlinger, L., Wöß, W.: Towards a definition of knowledge graphs. In: Martin, M., Cuquet, M., Folmer, E. (eds.) Joint Proceedings of the Posters and Demos Track of the 12th International Conference on Semantic Systems - SEMANTiCS2016 and the 1st International Workshop on Semantic Change and Evolving Semantics (SuCCESS’16) Co-located with the 12th International Conference on Semantic Systems (SEMANTiCS 2016), Leipzig, Germany, 12–15 September 2016. CEUR Workshop Proceedings, vol. 1695. CEUR-WS.org (2016). https://ceur-ws.org/Vol-1695/paper4.pdf Ehrlinger, L., Wöß, W.: Towards a definition of knowledge graphs. In: Martin, M., Cuquet, M., Folmer, E. (eds.) Joint Proceedings of the Posters and Demos Track of the 12th International Conference on Semantic Systems - SEMANTiCS2016 and the 1st International Workshop on Semantic Change and Evolving Semantics (SuCCESS’16) Co-located with the 12th International Conference on Semantic Systems (SEMANTiCS 2016), Leipzig, Germany, 12–15 September 2016. CEUR Workshop Proceedings, vol. 1695. CEUR-WS.org (2016). https://​ceur-ws.​org/​Vol-1695/​paper4.​pdf
15.
go back to reference Fischer, L., et al.: Which tool to use? Grounded reasoning in everyday environments with assistant robots. In: Steinbauer, G., Ferrein, A. (eds.) Proceedings of the 11th Cognitive Robotics Workshop 2018, co-located with 16th International Conference on Principles of Knowledge Representation and Reasoning, CogRob@KR 2018, Tempe, AZ, USA, 27 October 2018. CEUR Workshop Proceedings, vol. 2325, pp. 3–10. CEUR-WS.org (2018). https://ceur-ws.org/Vol-2325/paper-03.pdf Fischer, L., et al.: Which tool to use? Grounded reasoning in everyday environments with assistant robots. In: Steinbauer, G., Ferrein, A. (eds.) Proceedings of the 11th Cognitive Robotics Workshop 2018, co-located with 16th International Conference on Principles of Knowledge Representation and Reasoning, CogRob@KR 2018, Tempe, AZ, USA, 27 October 2018. CEUR Workshop Proceedings, vol. 2325, pp. 3–10. CEUR-WS.org (2018). https://​ceur-ws.​org/​Vol-2325/​paper-03.​pdf
16.
go back to reference Gan, J., Luo, J., Wang, H., Wang, S., He, W., Huang, Q.: Multimodal entity linking: a new dataset and a baseline. In: Shen, H.T., et al. (eds.) MM ’21: ACM Multimedia Conference, Virtual Event, China, 2–24 October 2021, pp. 993–1001. ACM (2021). https://doi.org/10.1145/3474085.3475400 Gan, J., Luo, J., Wang, H., Wang, S., He, W., Huang, Q.: Multimodal entity linking: a new dataset and a baseline. In: Shen, H.T., et al. (eds.) MM ’21: ACM Multimedia Conference, Virtual Event, China, 2–24 October 2021, pp. 993–1001. ACM (2021). https://​doi.​org/​10.​1145/​3474085.​3475400
20.
go back to reference Li, Y., Ouyang, W., Zhou, B., Wang, K., Wang, X.: Scene graph generation from objects, phrases and region captions. In: IEEE International Conference on Computer Vision. ICCV 2017, Venice, Italy, 22–29 October 2017, pp. 1270–1279. IEEE Computer Society (2017). https://doi.org/10.1109/ICCV.2017.142 Li, Y., Ouyang, W., Zhou, B., Wang, K., Wang, X.: Scene graph generation from objects, phrases and region captions. In: IEEE International Conference on Computer Vision. ICCV 2017, Venice, Italy, 22–29 October 2017, pp. 1270–1279. IEEE Computer Society (2017). https://​doi.​org/​10.​1109/​ICCV.​2017.​142
22.
go back to reference Mendes, P.N., Jakob, M., García-Silva, A., Bizer, C.: DBPedia spotlight: shedding light on the web of documents. In: Ghidini, C., Ngomo, A.N., Lindstaedt, S.N., Pellegrini, T. (eds.) Proceedings the 7th International Conference on Semantic Systems. I-SEMANTICS 2011, Graz, Austria, 7–9 September 2011, pp. 1–8. ACM International Conference Proceeding Series. ACM (2011). https://doi.org/10.1145/2063518.2063519 Mendes, P.N., Jakob, M., García-Silva, A., Bizer, C.: DBPedia spotlight: shedding light on the web of documents. In: Ghidini, C., Ngomo, A.N., Lindstaedt, S.N., Pellegrini, T. (eds.) Proceedings the 7th International Conference on Semantic Systems. I-SEMANTICS 2011, Graz, Austria, 7–9 September 2011, pp. 1–8. ACM International Conference Proceeding Series. ACM (2011). https://​doi.​org/​10.​1145/​2063518.​2063519
25.
go back to reference Rossetto, L., Baumgartner, M., Ashena, N., Ruosch, F., Pernischová, R., Bernstein, A.: Lifegraph: a knowledge graph for lifelogs. In: Gurrin, C., et al. (eds.) Proceedings of the Third ACM Workshop on Lifelog Search Challenge, LSC@ICMR 2020, Dublin, Ireland, 8–11 June 2020, pp. 13–17. ACM (2020). https://doi.org/10.1145/3379172.3391717 Rossetto, L., Baumgartner, M., Ashena, N., Ruosch, F., Pernischová, R., Bernstein, A.: Lifegraph: a knowledge graph for lifelogs. In: Gurrin, C., et al. (eds.) Proceedings of the Third ACM Workshop on Lifelog Search Challenge, LSC@ICMR 2020, Dublin, Ireland, 8–11 June 2020, pp. 13–17. ACM (2020). https://​doi.​org/​10.​1145/​3379172.​3391717
33.
go back to reference Wu, Z., Xu, Y., Yang, Y., Zhang, C., Zhu, X., Ji, Y.: Towards a semantic web of things: a hybrid semantic annotation, extraction, and reasoning framework for cyber-physical system. Sensors 17(2), 403 (2017). https://doi.org/10.3390/S17020403 Wu, Z., Xu, Y., Yang, Y., Zhang, C., Zhu, X., Ji, Y.: Towards a semantic web of things: a hybrid semantic annotation, extraction, and reasoning framework for cyber-physical system. Sensors 17(2), 403 (2017). https://​doi.​org/​10.​3390/​S17020403
35.
go back to reference Young, J., Basile, V., Kunze, L., Cabrio, E., Hawes, N.: Towards lifelong object learning by integrating situated robot perception and semantic web mining. In: Kaminka, G.A., et al. (eds.) ECAI 2016 - 22nd European Conference on Artificial Intelligence, 29 August–2 September 2016, The Hague, The Netherlands - Including Prestigious Applications of Artificial Intelligence (PAIS 2016). Frontiers in Artificial Intelligence and Applications, vol. 285, pp. 1458–1466. IOS Press (2016). https://doi.org/10.3233/978-1-61499-672-9-1458 Young, J., Basile, V., Kunze, L., Cabrio, E., Hawes, N.: Towards lifelong object learning by integrating situated robot perception and semantic web mining. In: Kaminka, G.A., et al. (eds.) ECAI 2016 - 22nd European Conference on Artificial Intelligence, 29 August–2 September 2016, The Hague, The Netherlands - Including Prestigious Applications of Artificial Intelligence (PAIS 2016). Frontiers in Artificial Intelligence and Applications, vol. 285, pp. 1458–1466. IOS Press (2016). https://​doi.​org/​10.​3233/​978-1-61499-672-9-1458
36.
go back to reference Young, J., Kunze, L., Basile, V., Cabrio, E., Hawes, N., Caputo, B.: Semantic web-mining and deep vision for lifelong object discovery. In: 2017 IEEE International Conference on Robotics and Automation. ICRA 2017, Singapore, Singapore, 29 May–3 June 2017, pp. 2774–2779. IEEE (2017). https://doi.org/10.1109/ICRA.2017.7989323 Young, J., Kunze, L., Basile, V., Cabrio, E., Hawes, N., Caputo, B.: Semantic web-mining and deep vision for lifelong object discovery. In: 2017 IEEE International Conference on Robotics and Automation. ICRA 2017, Singapore, Singapore, 29 May–3 June 2017, pp. 2774–2779. IEEE (2017). https://​doi.​org/​10.​1109/​ICRA.​2017.​7989323
Metadata
Title
Advancing Robotic Perception with Perceived-Entity Linking
Authors
Mark Adamik
Romana Pernisch
Ilaria Tiddi
Stefan Schlobach
Copyright Year
2025
DOI
https://doi.org/10.1007/978-3-031-77850-6_11

Premium Partner