Abstract
Interoperability is the most critical issue facing businesses that need to access information from multiple information systems. Our objective in this research is to develop a comprehensive framework and methodology to facilitate semantic interoperability among distributed and heterogeneous information systems. A comprehensive framework for managing various semantic conflicts is proposed. Our proposed framework provides a unified view of the underlying representational and reasoning formalism for the semantic mediation process. This framework is then used as a basis for automating the detection and resolution of semantic conflicts among heterogeneous information sources. We define several types of semantic mediators to achieve semantic interoperability. A domain-independent ontology is used to capture various semantic conflicts. A mediation-based query processing technique is developed to provide uniform and integrated access to the multiple heterogeneous databases. A usable prototype is implemented as a proof-of-concept for this work. Finally, the usefulness of our approach is evaluated using three cases in different application domains. Various heterogeneous datasets are used during the evaluation phase. The results of the evaluation suggest that correct identification and construction of both schema and ontology-schema mapping knowledge play very important roles in achieving interoperability at both the data and schema levels.
- Arens, Y. Knoblock, C. A., and Shen, W.-M. 1996. Query reformulation for dynamic information integration. J. Intell. Inf. Syst. 6, 2/3, 99--130. Google Scholar
- Batini, C. and Lenzerini, M. 1984. A methodology for data schema integration in the entity relationship model. IEEE Trans. Softw. Eng. SE-10, 8, 650--664.Google Scholar
- Bishop, M. 1996. Unix security: threats and solutions. Presentation given at SHARE 86.0 (Anaheim, CA, March 1996). Available online at http://seclab.cs.ucdavis.edu/projects/ vulnerabilities/scriv/1996-share86.pdf.Google Scholar
- Calmet, J., Debertin, D., Jekutsch, S., and Schu, J. 1996. An executable graphical representation of mediatory information systems. In Proceedings of the 12th International Conference on Data Engineering (New Orleans, LA, Feb. 26--March 1). S. Y. W. Su, Ed. 124--131. Google Scholar
- Collet, C., Huhns, M. N., and Shen, W.-M. 1991. Resource integration using a large knowledge base in Carnot. IEEE Comput. 24, 12, 55--62. Google Scholar
- Czejdo, B., Rusinkiewics, M., and Embley, D. W. 1987. An approach to schema integration and query formulation in federated database systems. In Proceedings of the 3rd International Conference on Data Engineering (Los Angeles, CA, Feb. 3--5). 477--484. Google Scholar
- DeMichiel, L. G. 1989. Resolving database incompatibility: An approach to performing relational operations over mismatched domains. IEEE Trans. Knowl. Data Eng. 1, 4, 485--493. Google Scholar
- Dowell, M. L., Stephens, L. M., and Bonnell, R. D. 1998. Using a domain-knowledge ontology as a semantic gateway among information resources. In Readings in Agents. M. N. Huhns and M. P. Singh, Eds. Morgan Kaufman. San Francisco, CA, 255--260. Google Scholar
- Fan, W., Lu, H., Madnick, S. E., and Cheung, D. 2001. Discovering and reconciling value conflicts for numerical data integration. Inform. Syst. 26, 8, 635--656. Google Scholar
- Garcia-Solaco, M., Saltor, F., and Castellanos, M. 1995. A structural based schema integration methodology. In Proceedings of the 11th International Conference on Data Engineering (Taipei, Taiwan, March 6--10). 505--512. Google Scholar
- Geller, J., Perl, Y., Neuhold, E., and Sheth, A. P. 1992. Structural schema integration with full and partial correspondence using the dual model. Inform. Syst. 17, 6, 443--464. Google Scholar
- Genesereth, M. R., Keller, A. M., and Duschka, O. M. 1997. Infomaster: An information integration system. In Proceedings of the 1997 ACM SIGMOD International Conference on Management of Data (Tucson, AZ, May 13--15), J. M. Peckman Ed. ACM Press, New York, NY, 39--542. Google Scholar
- Goble, C. A., Stevens, R., Ng, G., Bechhofer, S., Paton, N. W., Baker, P. G., Peim, M., and Brass, A. 2001. Transparent access to multiple bioinformatics information sources. IBM Syst. J. 40, 2, 532--551. Google Scholar
- Goh, C. H., Bressan, S., Madnick, S. E., and Siegel, M. D. 1999. Context interchange: New features and formalisms for the intelligent integration of information. ACM Trans. Inform. Syst. 17, 3, 270--293. Google Scholar
- Goh, C. H., Madnick, S. E., and Siegel, M. D. 1994. Context interchange: overcoming the challenges of large-scale interoperable database systems in a dynamic environment. In Proceedings of the 3rd International Conference on Information and Knowledge Management (Gaithersburg, MD, Nov. 29-Dec. 2). 337--346. Google Scholar
- Hayne, S. and Ram, S. 1990. Multi-user view integration system (MUVIS): An expert system for view integration. In Proceedings of the 6th International Conference on Data Engineering (Los Angeles, CA, Feb. 5--9). 402--409. Google Scholar
- Huhns, M. N. and Singh, M. P. 1998. Managing heterogeneous transaction workflows with co-operating agents. In Agent Technology: Foundations, Applications, and Markets, N. R. Jennings and M. J. Wooldridge, Eds. Springer, Berlin, Germany, 219--239. Google Scholar
- Hwang, Y. 1999. Implementation of semantic conflict resolvers: A production rule-based approach. Master's thesis. Department of Management Information Systems, The University of Arizona, Tucson, AZ.Google Scholar
- Jaworski, J. and Perrone, P. J. 2000. Java Security Handbook. Sams Publishing, Indianapolis, IN. Google Scholar
- Jennings, N. R. and Wooldridge, M. J. 1998. Applications of intelligent agents. In Agent Technology: Foundations, Applications, and Markets, N. R. Jennings and M. J. Wooldridge, Eds. Springer, Berlin, Germany, 3--28. Google Scholar
- Kahng, J. and McLeod, D. 1998. Dynamic classificational ontologies: Mediation of information sharing in cooperative federated database systems. In Cooperative Information Systems: Trends and Directions, M. P. Papazoglou and G. Sohlageter, Eds. Academic Press, San Diego, CA, 179--203.Google Scholar
- Kashyap, V. and Sheth, A. P. 1996. Semantic and schematic similarities between database objects: A context-based approach. The VLDB J. 5, 4, 276--304. Google Scholar
- Kim, K. 1999. The development of interoperable mediation system architecture: An ontology and mediator-based approach, Master's thesis. Department of Management Information Systems, The University of Arizona, Tucson, AZ.Google Scholar
- Koetzle, L., Rutstein, C., Liddell, H., Buss, C., and Nakashima, T. 2001. Reducing integration's cost. Available online at http://www.forrester.com/ER/Research/Report/0,1338,11981,00.html.Google Scholar
- Krishnamurthy, R., Litwin, W., and Kent, W. 1991. Language features for interoperability of databases with schematic discrepancies. In Proceedings of the 1991 ACM SIGMOD International Conference on Management of Data (Denver, CO, May 29--31), J. Clifford and R. King, Eds. ACM Press, New York, NY, 40--49. Google Scholar
- Kuokka, D. and Harada, L. 1996. Integrating information via matchmaking. J. Intell. Inform. Syst. 6, 2/3, 261--279. Google Scholar
- Lakshmanan, L. V. S., Sadri, F., and Subramanian, I. N. 1997. Logic and algebraic languages for interoperability in multidatabase systems. J. Logic Programm. 33, 2, 101--149.Google Scholar
- Lee, D. and Hwang, Y. 2001. Extracting semantic metadata and its visualization. ACM Crossroads 7, 3, 19--27. Google Scholar
- Lenat, D. B., Guha, R. V., Pittman, K., Pratt, D., and Shepherd, M. 1990. Cyc: Toward programs with common sense. Commun. ACM 33, 8, 30--49. Google Scholar
- Ludäscher, B., Gupta, A., and Martone, M. E. 2001. Model-based mediation with domain maps. In Proceedings of the 17th International Conference Data Engineering (Heidelberg, Germany, April 2--6). 81--90. Google Scholar
- MacGregor, R. 1991. The evolving technology of classification-based knowledge representation systems. In Principles of Semantic Networks: Explorations in the Representation of Knowledge, J. F. Sowa, Ed. Morgan Kaufmann, San Mateo, CA, 385--400.Google Scholar
- Madnick, S. E. 1999. Metadata Jones and the tower of Babel: The challenge of large-scale semantic heterogeneity. In Proceedings of the 3rd IEEE Meta-Data Conference (April 6--7). Bethesda, MD.Google Scholar
- Mahalingam, K. and Huhns, M. N. 1997. An ontology tool for query formulation in an agent-based context. In Proceedings of the 2nd IFCIS international Conference on Cooperative Information Systems (Kiawah Island, SC, June 24--27). Google Scholar
- March, S. T., Hevner, A., and Ram, S. 2000. Research commentary: An agenda for information technology research in heterogeneous and distributed environment. Inform. Syst. Res. 11, 4, 327--341. Google Scholar
- March, S. T. and Smith, G. F. 1995. Design and natural science research on information technology. Decis. Supp. Syst. 15, 4, 251--266. Google Scholar
- McLeod, D. and Si, A. 1995. The design and experimental evaluation of an information discovery mechanism for networks of autonomous database systems. In Proceedings of the 11th IEEE International Conference Data Engineering (Taipei, Taiwan, March 6--10). 15--24. Google Scholar
- Navathe, S. B., Elmasri, R. and Larson, J. A. 1986. Integrating user views in database design. IEEE Comput. 19, 1, 50--62.Google Scholar
- Nyerges, T. L. 1989. Schema integration analysis for the development of GIS databases. Int. J. Geograph. Inform. Syst. 3, 2, 153--183.Google Scholar
- Ouksel, A. M. 1999. A framework for a scalable agent architecture of cooperating heterogeneous knowledge sources. In Intelligent Information Agents: Agent-Based Information Discovery and Management on the Internet, M. Klusch, Ed. Springer, Berlin, Germany, 100--124. Google Scholar
- Papakonstantinou, Y., Garcia-Molina, H., and Ullman, J. 1996. Medmaker: A mediation system based on declarative specifications. In Proceedings of the 12th IEEE International Conference on Data Engineering (New Orleans, LA, Feb. 26--March 1), S. Y. W. Su, Ed. 132--141. Google Scholar
- Park, J. 1999. Facilitating interoperability among heterogeneous geographic database systems: A theoretical framework, a prototype system, and evaluation, Ph.D. dissertation. Department of Management Information Systems, The University of Arizona, Tucson, AZ. Google Scholar
- Ram, S. and Park, J. 2004. Semantic conflict resolution ontology (SCROL): An ontology for detecting and resolving data- and schema-level semantic conflicts. IEEE Trans. Knowl. Data Eng. 16, 2, 189--202. Google Scholar
- Ram, S., Park, J., and Ball, G. 1999a. Semantic model support for geographic information systems. IEEE Comput. 32, 5, 74--81. Google Scholar
- Ram, S., Park, J., Kim, K., and Hwang, Y. 1999b. A comprehensive framework for classifying data- and schema-level semantic conflicts in geographic and non-geographic databases. In Proceedings of the 9th Workshop on Information Technologies and Systems (Charlotte, NC, Dec. 11--12), S. Sarkar and S. Narasimhan, Eds. 185--190.Google Scholar
- Ram, S., Park, J., and Lee, D. 1999c. Digital libraries for the next millennium: challenges and research directions. Inform. Syst. Frontier 1, 1, 75--94. Google Scholar
- Ram, S. and Ramesh, V. 1999. Schema integration: Past, current and future. In Management of Heterogeneous and Autonomous Database Systems, A. Elmagarmid, M. Rusinkeiwicz, and A. P. Sheth, Eds. Morgan Kaufmann, San Francisco, CA, 119--155. Google Scholar
- Reddy, M. P., Prasad, B. E., Reddy, P. G., and Gupta, A. 1994. A methodology for integration of heterogeneous databases. IEEE Trans. Knowl. Data Eng. 6, 6, 920--933. Google Scholar
- Saltor, F. and Rodriuez, E. 1997. On intelligent access to heterogeneous information. In Proceedings of the 4th International Workshop on Knowledge Representation Meets Databases (KRDB '97, Athens, Greece, Aug. 30). 15.1--15.7.Google Scholar
- Sciore, E., Siegel, M., and Rosenthal, A. 1994. Using semantic values to facilitate interoperability among heterogeneous information systems. ACM Trans. Database Syst. 19, 2, 254--290. Google Scholar
- Sheth, A. 1995. Data semantics: What, where and how? Tech. Rep. TR-CS-95-003. Department of Computer Science, The University of Georgia, Athens, GA.Google Scholar
- Sheth, A. P., Gala, S. K., and Navathe, S. B. 1993. On automatic reasoning for schema integration. Int. J. Intell. Coop. Inf. Syst. 2, 1, 23--50.Google Scholar
- Sheth, A. P. and Kashyap, V. 1992. So far (schematically), yet so near (semantically). In Proceedings of the IFIP WG2.6 Database Semantics Conference on Interoperable Database Systems (DS-5, Lorne, Victoria, Australia, Nov. 16--20), D. K. Hsiao, E. J. Neuhold, and R. Sacks-Davis, Eds., 283--312. Google Scholar
- Sheth, A. P. and Larson, J. A. 1990. Federated database systems for managing distributed, heterogeneous, and autonomous databases. ACM Comput. Surv. 22, 3, 184--236. Google Scholar
- Sikora, R. and Shaw, M. J. 1998. A multi-agent framework for the coordination and integration of information systems. Management Sci. 44, 11, Part 2 of 2, S65--S78. Google Scholar
- Stonebraker, M. 1998. Are we working on the right problems? In Proceedings of the 1998 ACM SIGMOD International Conference Management of Data (Seattle, WA, June 1--4). ACM Press, New York, NY, 496. Google Scholar
- Storey, V. C., Chiang, R. H. L., Dey, D., Goldstein, R. C., and Sundaresan, S. 1997. Database design with common sense business reasoning and learning. ACM Trans. Database Syst. 22, 4, 471--512. Google Scholar
- Tejada, S., Knoblock, C. A., and Minton, S. 2001. Learning object identification rules for information integration. Inform. Syst. 26, 8, 607--633. Google Scholar
- Ullman, J. D. 1997. Information integration using logical views. In Proceedings of the International Conference on Database Theory (Delphi, Greece). 19--40. Google Scholar
- van der Vet, P. E. and Mars, N. J. I. 1998. Bottom-up construction of ontologies. IEEE Trans. Knowl. Data Eng. 10, 4, 513--526. Google Scholar
- Ventrone, V. and Heiler, S. 1991. Semantic heterogeneity as a result of domain evolution. SIGMOD Rec. 20, 4, 16--20. Google Scholar
- Wagner, G. 1998. Foundations of Knowledge Systems: With Applications to Databases and Agents. Kluwer Academic Publishers, Boston, MA. Google Scholar
- Whitten, A. and Tygar, J. D. 1998. Usability of security: A case study. Tech. rep. CMU-CS-98-155. School of Computer Science, Carnegie Mellon University, Pittsburgh, PA.Google Scholar
- Wiederhold, G. 1992. Mediators in the architecture of future information systems. IEEE Comput. 25, 3, 38--49. Google Scholar
- Wiederhold, G. and Genesereth, M. R. 1997. The conceptual basis for mediation services. IEEE Expert 12, 5, 38--47. Google Scholar
- Woods, W. A. 1975. What's in a link: Foundations for semantic networks. In Representation and Understanding: Studies in Cognitive Science, D. G. Bobrow and A. Collins, Eds. Academic Press, Inc., New York, NY, 35--82.Google Scholar
- Yin, R. K. 1994. Case Study Research: Design and Methods, 2nd ed. SAGE Publications, Thousand Oaks, CA.Google Scholar
- Yu, C., Sun, W., Dao, S., and Keirsey, D. 1991. Determining relationships among attributes for interoperability of multi-database systems. In Proceedings of the 1st International Workshop on Multidatabase Systems (Kyoto, Japan, April 7--9), Y. Kambayashi, M. Rusinkiewicz, and A. P. Sheth, Eds. 251--255.Google Scholar
- Yu, C. T. and Meng, W. 1998. Principles of Database Query Processing for Advanced Applications. Morgan Kaufmann, San Francisco, CA. Google Scholar
Index Terms
- Information systems interoperability: What lies beneath?
Recommendations
Semantic Conflict Resolution Ontology (SCROL): An Ontology for Detecting and Resolving Data and Schema-Level Semantic Conflicts
Abstract--Establishing semantic interoperability among heterogeneous information sources has been a critical issue in the database community for the past two decades. Despite the critical importance, current approaches to semantic interoperability of ...
Context interchange: new features and formalisms for the intelligent integration of information
The Context Interchange strategy presents a novel perspective for mediated data access in which semantic conflicts among heterogeneous systems are not identified a priori, but are detected and reconciled by a context mediator through comparison of ...
Common knowledge based access to disparate semantic spaces: the ontology switching approach
PAKM'06: Proceedings of the 6th international conference on Practical Aspects of Knowledge ManagementWe present the ontology switching system MyShelf which is implemented using technological standards of the semantic web. MyShelf solves the problem of semantic heterogeneity by integrating different cognitive views on one data collection. Homogenization ...
Comments