Skip to main content
Erschienen in: Earth Science Informatics 2/2024

16.02.2024 | RESEARCH

Constraint information extraction for 3D geological modelling using a span-based joint entity and relation extraction model

verfasst von: Can Zhuang, Chunhua Liu, Henghua Zhu, Yuhong Ma, Guoping Shi, Zhizheng Liu, Bohan Liu

Erschienen in: Earth Science Informatics | Ausgabe 2/2024

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Data sparsity has long been a problem in 3D geological modeling work. The geometric, topological, and attribute information of geological bodies in geological reports provide important constraint information during 3D geological modeling. However, manually extracting complex and diverse constraint knowledge from a large amount of textual data is a challenging and time-consuming task. The development of information extraction and text mining technology has made it possible to automatically extract textual constraint information. To this end, this study firstly summarized the textual description characteristics of geological body constraint information in geological reports, and used a span-based tagging scheme for data annotation; Secondly, a span-based joint entity and relation extraction framework was introduced to extract constraint information in geological 3D modeling, which improves the extraction capability of the geological modeling constraint information by obtaining deep semantic information of the characters through the BERT model, in addition, the model has the joint extraction capabilities of entity classification and relation classification on candidate entities; Finally, in the experiments study, a Chinese geological survey report was used as training data for evaluation, and we validated our method’s effectiveness through comparison of our results to those of different models. We further compared and analyzed the impact of different parameters and span representations on our model’s performance.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
Zurück zum Zitat Brunsting S, De Sterck H, Dolman R et al (2016) GeoTextTagger: High-Precision Location Tagging of Textual Documents using a Natural Language Processing Approach Brunsting S, De Sterck H, Dolman R et al (2016) GeoTextTagger: High-Precision Location Tagging of Textual Documents using a Natural Language Processing Approach
Zurück zum Zitat Budi I, Bressan S (2003) Association rules mining for name entity recognition[C]// Proceedings of the International Conference on Web Information Systems Engineering Budi I, Bressan S (2003) Association rules mining for name entity recognition[C]// Proceedings of the International Conference on Web Information Systems Engineering
Zurück zum Zitat Cakaloglu T, Szegedy C, Xu X (2020) Text embeddings for retrieval from a large knowledge base[C]// Proceedings of the International Conference on Research Challenges in Information Science 338–351 Cakaloglu T, Szegedy C, Xu X (2020) Text embeddings for retrieval from a large knowledge base[C]// Proceedings of the International Conference on Research Challenges in Information Science 338–351
Zurück zum Zitat Chen Z, Guo C (2022) A pattern-first pipeline approach for entity and relation extraction. Neurocomputing 494:182–191CrossRef Chen Z, Guo C (2022) A pattern-first pipeline approach for entity and relation extraction. Neurocomputing 494:182–191CrossRef
Zurück zum Zitat Chen Q, Liu G, He Z et al (2020) Current situation and prospect of structure-attribute integrated 3D geological modeling technology for geological big data. Bull Geol Sci Technol 39(4):51–58 Chen Q, Liu G, He Z et al (2020) Current situation and prospect of structure-attribute integrated 3D geological modeling technology for geological big data. Bull Geol Sci Technol 39(4):51–58
Zurück zum Zitat Eberts M, Ulges A (2020) Span-Based Joint Entity and Relation Extraction with Transformer Pre-Training[C]// Proceedings of the European Conference on Artificial Intelligence Eberts M, Ulges A (2020) Span-Based Joint Entity and Relation Extraction with Transformer Pre-Training[C]// Proceedings of the European Conference on Artificial Intelligence
Zurück zum Zitat Enkhsaikhan M, Holden EJ, Duuring P et al (2021) Understanding Ore-Forming Conditions using Machine Reading of Text. Ore Geol Rev 135(2):104200CrossRef Enkhsaikhan M, Holden EJ, Duuring P et al (2021) Understanding Ore-Forming Conditions using Machine Reading of Text. Ore Geol Rev 135(2):104200CrossRef
Zurück zum Zitat Enkhsaikhan M, Liu W, Holden E-J, et al (2018) Towards Geological Knowledge Discovery Using Vector-Based Semantic Similarity[C]// Proceedings of the International Conference on Advanced Data Mining and Applications, Cham 224–237 Enkhsaikhan M, Liu W, Holden E-J, et al (2018) Towards Geological Knowledge Discovery Using Vector-Based Semantic Similarity[C]// Proceedings of the International Conference on Advanced Data Mining and Applications, Cham 224–237
Zurück zum Zitat Fan R, Wang L, Yan J et al (2019) Deep Learning-Based Named Entity Recognition and Knowledge Graph Construction for Geological Hazards. Int J Geo-Inf 9(1):15CrossRef Fan R, Wang L, Yan J et al (2019) Deep Learning-Based Named Entity Recognition and Knowledge Graph Construction for Geological Hazards. Int J Geo-Inf 9(1):15CrossRef
Zurück zum Zitat Garcia LF, Abel M, Perrin M et al (2020) The GeoCore ontology: A core ontology for general use in Geology. Comput Geosci 135:104387CrossRef Garcia LF, Abel M, Perrin M et al (2020) The GeoCore ontology: A core ontology for general use in Geology. Comput Geosci 135:104387CrossRef
Zurück zum Zitat Gil Y, Hill M, Horel J et al (2018) Intelligent systems for geosciences: An essential research agenda. Commun ACM 62(1):76–84CrossRef Gil Y, Hill M, Horel J et al (2018) Intelligent systems for geosciences: An essential research agenda. Commun ACM 62(1):76–84CrossRef
Zurück zum Zitat Goyal A, Gupta V, Kumar M (2018) Recent Named Entity Recognition and Classification techniques: A systematic review. Computer Science Review 29(AUG.): 21–43 Goyal A, Gupta V, Kumar M (2018) Recent Named Entity Recognition and Classification techniques: A systematic review. Computer Science Review 29(AUG.): 21–43
Zurück zum Zitat Gupta P, Roth B, Schütze H (2018) Joint Bootstrapping Machines for High Confidence Relation Extraction. arXiv e-prints Gupta P, Roth B, Schütze H (2018) Joint Bootstrapping Machines for High Confidence Relation Extraction. arXiv e-prints
Zurück zum Zitat Gusenbauer M (2019) Google Scholar to overshadow them all? Comparing the sizes of 12 academic search engines and bibliographic databases. Scientometrics 118(1):177–214CrossRef Gusenbauer M (2019) Google Scholar to overshadow them all? Comparing the sizes of 12 academic search engines and bibliographic databases. Scientometrics 118(1):177–214CrossRef
Zurück zum Zitat Hao M, Li M, Zhang J et al (2021) Research on 3D geological modeling method based on multiple constraints. Earth Sci Inf 14(1):291–297CrossRef Hao M, Li M, Zhang J et al (2021) Research on 3D geological modeling method based on multiple constraints. Earth Sci Inf 14(1):291–297CrossRef
Zurück zum Zitat Holden E-J, Liu W, Horrocks T et al (2019) GeoDocA – Fast analysis of geological content in mineral exploration reports: A text mining approach. Ore Geol Rev 111:102919CrossRef Holden E-J, Liu W, Horrocks T et al (2019) GeoDocA – Fast analysis of geological content in mineral exploration reports: A text mining approach. Ore Geol Rev 111:102919CrossRef
Zurück zum Zitat Hou Z, Zhu Y, Gao X et al (2015) A Chinese geological time scale ontology for geodata discovery[C]// Proceedings of the 2015 23rd International Conference on Geoinformatics 1–5 Hou Z, Zhu Y, Gao X et al (2015) A Chinese geological time scale ontology for geodata discovery[C]// Proceedings of the 2015 23rd International Conference on Geoinformatics 1–5
Zurück zum Zitat Huang L, Du Y, Chen G (2015) GeoSegmenter: A statistically learned Chinese word segmenter for the geoscience domain. Comput Geosci 76:11–17ADSCrossRef Huang L, Du Y, Chen G (2015) GeoSegmenter: A statistically learned Chinese word segmenter for the geoscience domain. Comput Geosci 76:11–17ADSCrossRef
Zurück zum Zitat Huang W, Mao Y, Yang L et al (2021) Local-to-global GCN with knowledge-aware representation for distantly supervised relation extraction. Knowl-Based Syst (Dec.25): 234 Huang W, Mao Y, Yang L et al (2021) Local-to-global GCN with knowledge-aware representation for distantly supervised relation extraction. Knowl-Based Syst (Dec.25): 234
Zurück zum Zitat Li W, Wu L, Xie Z et al (2019) Ontology-based question understanding with the constraint of Spatio-temporal geological knowledge. Earth Sci Inf 12(4):599–613CrossRef Li W, Wu L, Xie Z et al (2019) Ontology-based question understanding with the constraint of Spatio-temporal geological knowledge. Earth Sci Inf 12(4):599–613CrossRef
Zurück zum Zitat Li Z, Pan M, Han D et al (2016) Three-Dimensional Structural Modeling Technique. Earth Sci 41(12):2136–2146 Li Z, Pan M, Han D et al (2016) Three-Dimensional Structural Modeling Technique. Earth Sci 41(12):2136–2146
Zurück zum Zitat Liu C, Yang S (2022) Using text mining to establish knowledge graph from accident/incident reports in risk assessment. Expert Syst Appl 207:117991CrossRef Liu C, Yang S (2022) Using text mining to establish knowledge graph from accident/incident reports in risk assessment. Expert Syst Appl 207:117991CrossRef
Zurück zum Zitat Liu P, Guo Y, Wang F et al (2022) Chinese named entity recognition: The state of the art. Neurocomputing 473:37–53CrossRef Liu P, Guo Y, Wang F et al (2022) Chinese named entity recognition: The state of the art. Neurocomputing 473:37–53CrossRef
Zurück zum Zitat Liu W, Wen Y, Yu Z et al (2016) Large-Margin Softmax Loss for Convolutional Neural Networks. JMLRorg Liu W, Wen Y, Yu Z et al (2016) Large-Margin Softmax Loss for Convolutional Neural Networks. JMLRorg
Zurück zum Zitat Ma X (2022) Knowledge graph construction and application in geosciences: A review. Comput Geosci 161:105082CrossRef Ma X (2022) Knowledge graph construction and application in geosciences: A review. Comput Geosci 161:105082CrossRef
Zurück zum Zitat Ma Y, Xie Z, Li G et al (2022) Text visualization for geological hazard documents via text mining and natural language processing. Earth Sci Inf 15(1):439–454ADSCrossRef Ma Y, Xie Z, Li G et al (2022) Text visualization for geological hazard documents via text mining and natural language processing. Earth Sci Inf 15(1):439–454ADSCrossRef
Zurück zum Zitat Mai G, Janowicz K, Cai L, et al (2020) SE-KGE: A Location-Aware Knowledge Graph Embedding Model for Geographic Question Answering and Spatial Semantic Lifting Mai G, Janowicz K, Cai L, et al (2020) SE-KGE: A Location-Aware Knowledge Graph Embedding Model for Geographic Question Answering and Spatial Semantic Lifting
Zurück zum Zitat Olierook H, Scalzo R, Kohn D et al (2021) Bayesian geological and geophysical data fusion for the construction and uncertainty quantification of 3D geological models. Geosci Front 12(1):479–493CrossRef Olierook H, Scalzo R, Kohn D et al (2021) Bayesian geological and geophysical data fusion for the construction and uncertainty quantification of 3D geological models. Geosci Front 12(1):479–493CrossRef
Zurück zum Zitat Qin Z, Ye F (2018) Research on Reliability of Instance and Pattern in Semi-supervised Entity Relation Extraction[C]// Proceedings of the International Conference on Intelligent Computing, Communication and Devices Qin Z, Ye F (2018) Research on Reliability of Instance and Pattern in Semi-supervised Entity Relation Extraction[C]// Proceedings of the International Conference on Intelligent Computing, Communication and Devices
Zurück zum Zitat Qiu Q, Zhong X, Liang W (2018a) A cyclic self-learning Chinese word segmentation for the geoscience domain. Geomatica 72(1):16–26CrossRef Qiu Q, Zhong X, Liang W (2018a) A cyclic self-learning Chinese word segmentation for the geoscience domain. Geomatica 72(1):16–26CrossRef
Zurück zum Zitat Qiu Q, Xie Z, Wu L et al (2018b) DGeoSegmenter: A dictionary-based Chinese word segmenter for the geoscience domain. Comput Geosci 121:1–11ADSCrossRef Qiu Q, Xie Z, Wu L et al (2018b) DGeoSegmenter: A dictionary-based Chinese word segmenter for the geoscience domain. Comput Geosci 121:1–11ADSCrossRef
Zurück zum Zitat Qiu Q, Xie Z, Wu L et al (2019) GNER: A Generative Model for Geological Named Entity Recognition Without Labeled Data Using Deep Learning. Earth and Space Science 6(6):931–946ADSCrossRef Qiu Q, Xie Z, Wu L et al (2019) GNER: A Generative Model for Geological Named Entity Recognition Without Labeled Data Using Deep Learning. Earth and Space Science 6(6):931–946ADSCrossRef
Zurück zum Zitat Qiu Q, Xie Z, Wu L et al (2020) Automatic spatiotemporal and semantic information extraction from unstructured geoscience reports using text mining techniques. Earth Sci Inf 13(4):1393–1410CrossRef Qiu Q, Xie Z, Wu L et al (2020) Automatic spatiotemporal and semantic information extraction from unstructured geoscience reports using text mining techniques. Earth Sci Inf 13(4):1393–1410CrossRef
Zurück zum Zitat Qiu Q, Ma K, Zhu H et al (2022a) BERT-based Method and Significance of Constraint Information Extraction for 3D Geological Modelling. Northwest Geol 55(4):124–132 Qiu Q, Ma K, Zhu H et al (2022a) BERT-based Method and Significance of Constraint Information Extraction for 3D Geological Modelling. Northwest Geol 55(4):124–132
Zurück zum Zitat Qiu Q, Xie Z, Ma K et al (2022b) Spatially oriented convolutional neural network for spatial relation extraction from natural language texts. Trans GIS 26(2):839–866CrossRef Qiu Q, Xie Z, Ma K et al (2022b) Spatially oriented convolutional neural network for spatial relation extraction from natural language texts. Trans GIS 26(2):839–866CrossRef
Zurück zum Zitat Qiu Q, Ma K, Lv H et al (2023a) Construction and application of a knowledge graph for iron deposits using text mining analytics and a deep learning algorithm. Math Geosci 55(3):423–456 Qiu Q, Ma K, Lv H et al (2023a) Construction and application of a knowledge graph for iron deposits using text mining analytics and a deep learning algorithm. Math Geosci 55(3):423–456
Zurück zum Zitat Qiu Q, Wang B, Ma K et al (2023b) Geological profile-text information association model of mineral exploration reports for fast analysis of geological content. Ore Geol Rev 153:105278CrossRef Qiu Q, Wang B, Ma K et al (2023b) Geological profile-text information association model of mineral exploration reports for fast analysis of geological content. Ore Geol Rev 153:105278CrossRef
Zurück zum Zitat Sobhana N, Mitra P, Ghosh S (2010) Conditional random field based named entity recognition in geological text. Int J Comput Appl 1(3):143–147 Sobhana N, Mitra P, Ghosh S (2010) Conditional random field based named entity recognition in geological text. Int J Comput Appl 1(3):143–147
Zurück zum Zitat Sobhana NV, Ghosh SK, Mitra P (2012) Entity Relation Extraction from geological text using Conditional Random Fields and subsequence kernels[C]// Proceedings of the India Conference (INDICON), Annual IEEE 2013 Sobhana NV, Ghosh SK, Mitra P (2012) Entity Relation Extraction from geological text using Conditional Random Fields and subsequence kernels[C]// Proceedings of the India Conference (INDICON), Annual IEEE 2013
Zurück zum Zitat Sun Q, Zhang K, Lv L et al (2022) Joint extraction of entities and overlapping relations by improved graph convolutional networks. Appl Intell 52(5):5212–5224CrossRef Sun Q, Zhang K, Lv L et al (2022) Joint extraction of entities and overlapping relations by improved graph convolutional networks. Appl Intell 52(5):5212–5224CrossRef
Zurück zum Zitat Wan Q, Wei L, Chen X et al (2021) A region-based hypergraph network for joint entity-relation extraction. Knowl-Based Syst 10:107298CrossRef Wan Q, Wei L, Chen X et al (2021) A region-based hypergraph network for joint entity-relation extraction. Knowl-Based Syst 10:107298CrossRef
Zurück zum Zitat Wang B, Wu L, Xie Z et al (2022a) Understanding geological reports based on knowledge graphs using a deep learning approach. Comput Geosci 168:105229CrossRef Wang B, Wu L, Xie Z et al (2022a) Understanding geological reports based on knowledge graphs using a deep learning approach. Comput Geosci 168:105229CrossRef
Zurück zum Zitat Wang B, Ma K, Wu L et al (2022b) Visual analytics and information extraction of geological content for text-based mineral exploration reports. Ore Geol Rev 144:104818CrossRef Wang B, Ma K, Wu L et al (2022b) Visual analytics and information extraction of geological content for text-based mineral exploration reports. Ore Geol Rev 144:104818CrossRef
Zurück zum Zitat Wang L, Li Z, Zheng X (2021) Unsupervised Word Segmentation with Bi-directional Neural Language Model. ACM Transactions on Asian and Low-Resource Language Information Processing 22(1):1–16CrossRef Wang L, Li Z, Zheng X (2021) Unsupervised Word Segmentation with Bi-directional Neural Language Model. ACM Transactions on Asian and Low-Resource Language Information Processing 22(1):1–16CrossRef
Zurück zum Zitat Wang C, Li Y, Chen J (2023) Text mining and knowledge graph construction from geoscience literature legacy: A review. Geol Soc Am Spec 558:11–28 Wang C, Li Y, Chen J (2023) Text mining and knowledge graph construction from geoscience literature legacy: A review. Geol Soc Am Spec 558:11–28
Zurück zum Zitat Wang C, Xiaogang, et al (2018) Information extraction and knowledge graph construction from geoscience literature. Comput Geosci 112:112–120ADSCrossRef Wang C, Xiaogang, et al (2018) Information extraction and knowledge graph construction from geoscience literature. Comput Geosci 112:112–120ADSCrossRef
Zurück zum Zitat Wei D, Jiang B, Zhang J (2021) Research on content storage method for unstructured geological data. Northwest Geol 54(04):266–273 Wei D, Jiang B, Zhang J (2021) Research on content storage method for unstructured geological data. Northwest Geol 54(04):266–273
Zurück zum Zitat Yong PC, Nordholm, et al (2013) Optimization and evaluation of sigmoid function with a priori SNR estimate for real-time speech enhancement. Speech Commun 55(2):358–376CrossRef Yong PC, Nordholm, et al (2013) Optimization and evaluation of sigmoid function with a priori SNR estimate for real-time speech enhancement. Speech Commun 55(2):358–376CrossRef
Zurück zum Zitat Yue K, Xu F, Yu J (2017) Shallow and wide fractional max-pooling network for image classification. Neural Comput Appl 31:409–419CrossRef Yue K, Xu F, Yu J (2017) Shallow and wide fractional max-pooling network for image classification. Neural Comput Appl 31:409–419CrossRef
Zurück zum Zitat Zhan X, Lu C, Hu G. 3D structural modeling for seismic exploration based on knowledge graphs. Geophysics, 2022, 87(3): IM81-IM100 Zhan X, Lu C, Hu G. 3D structural modeling for seismic exploration based on knowledge graphs. Geophysics, 2022, 87(3): IM81-IM100
Zurück zum Zitat Zhang X, Zhang J, Tian Y et al (2020) Urban geological 3D modeling based on papery borehole log. ISPRS Int J Geo Inf 9(6):389CrossRef Zhang X, Zhang J, Tian Y et al (2020) Urban geological 3D modeling based on papery borehole log. ISPRS Int J Geo Inf 9(6):389CrossRef
Zurück zum Zitat Zhang C, Zhang X, Jiang W et al (2009) Rule-Based Extraction of Spatial Relations in Natural Language Text[C]// Proceedings of the International Conference on Computational Intelligence & Software Engineering Zhang C, Zhang X, Jiang W et al (2009) Rule-Based Extraction of Spatial Relations in Natural Language Text[C]// Proceedings of the International Conference on Computational Intelligence & Software Engineering
Zurück zum Zitat Zhong DY, Wang LG, Lin BI et al (2019) Implicit modeling of complex orebody with constraints of geological rules. Transa Nonferrous Metals Soc China 29(11):2392–2399CrossRef Zhong DY, Wang LG, Lin BI et al (2019) Implicit modeling of complex orebody with constraints of geological rules. Transa Nonferrous Metals Soc China 29(11):2392–2399CrossRef
Zurück zum Zitat Zhuang C, Li W, Xie Z et al (2021) A multi-granularity knowledge association model of geological text based on hypernetwork. Earth Sci Inf 14(1):227–246CrossRef Zhuang C, Li W, Xie Z et al (2021) A multi-granularity knowledge association model of geological text based on hypernetwork. Earth Sci Inf 14(1):227–246CrossRef
Zurück zum Zitat Zhuang C, Zhu H, Wang W et al (2023) Research on urban 3D geological modeling based on multi-modal data fusion: a case study in Jinan China. Earth Science Informatics 16(1):549–563ADSCrossRef Zhuang C, Zhu H, Wang W et al (2023) Research on urban 3D geological modeling based on multi-modal data fusion: a case study in Jinan China. Earth Science Informatics 16(1):549–563ADSCrossRef
Metadaten
Titel
Constraint information extraction for 3D geological modelling using a span-based joint entity and relation extraction model
verfasst von
Can Zhuang
Chunhua Liu
Henghua Zhu
Yuhong Ma
Guoping Shi
Zhizheng Liu
Bohan Liu
Publikationsdatum
16.02.2024
Verlag
Springer Berlin Heidelberg
Erschienen in
Earth Science Informatics / Ausgabe 2/2024
Print ISSN: 1865-0473
Elektronische ISSN: 1865-0481
DOI
https://doi.org/10.1007/s12145-024-01245-2

Weitere Artikel der Ausgabe 2/2024

Earth Science Informatics 2/2024 Zur Ausgabe

Premium Partner