skip to main content
10.1145/1772690.1772750acmotherconferencesArticle/Chapter ViewAbstractPublication PageswwwConference Proceedingsconference-collections
research-article

Redundancy detection in service-oriented systems

Published:26 April 2010Publication History

ABSTRACT

This paper addresses the problem of identifying redundant data in large-scale service-oriented information systems. Specifically, the paper puts forward an automated method to pinpoint potentially redundant data attributes from a given collection of semantically-annotated Web service interfaces. The key idea is to construct a service network to represent all input and output dependencies between data attributes and operations captured in the service interfaces, and to apply centrality measures from network theory in order to quantify the degree to which an attribute belongs to a given subsystem. The proposed method was tested on a federated governmental information system consisting of 58 independently-maintained information systems providing altogether about 1000 service operations described in WSDL. The accuracy of the method is evaluated in terms of precision and recall.

References

  1. E. F. Codd. A relational model of data for large shared data banks. Commun. ACM, 13(6):377--387, 1970. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. Christian Fahrner and Gottfried Vossen. Transforming relational database schemas into object-oriented schemas according to odmg-93. In DOOD '95: Proceedings of the Fourth International Conference on Deductive and Object-Oriented Databases, pages 429--446, London, UK, 1995. Springer-Verlag. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. A. Kalja, A. Reitsakas, and N. Saard. eGovernment in Estonia: Best practices. In Technology Management: A Unifying Discipline for Melting the Boundaries, pages 500--506. IEEE Press, 31 July-4 August 2005.Google ScholarGoogle Scholar
  4. P. Küngas and M. Dumas. Cost-effective semantic annotation of XML schemas and web service interfaces. In Proceedings of the IEEE International Conference on Services Computing, SCC 2009, Bangalore, India, September 21-25, 2009, pages 372--379. IEEE Computer Society Press, 2009. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. T.-W. Ling, F. W. Tompa, and T. Kameda. An improved third normal form for relational databases. ACM Transactions on Database Systems, 6(2):329--346, 1981. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. D. L. Moody. Metrics for evaluating the quality of entity relationship models. In Proceedings of the 17th International Conference on Conceptual Modeling, volume 1507 of Lecture Notes in Computer Science, pages 211--225, London, UK, 1998. Springer-Verlag. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. D. L. Moody and G. G. Shanks. Improving the quality of data models: empirical validation of a quality management framework. Information Systems, 28(6):619--650, 2003. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. C. O'Brien and S. O'Brien. Mining your legacy systems: A data-based approach. In Asia Pacific DB2 User Group Conference, Melbourne, Australia, November 21--23, 1994, 1994.Google ScholarGoogle Scholar
  9. . P. Sheth and J. A. Larson. Federated database systems for managing distributed, heterogeneous, and autonomous databases. ACM Computing Surveys 22(3):183--236, 1990. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. V. Ventrone and S. Heiler. Some advice for dealing with semantic heterogeneity in federated database systems. In Proceedings of the Database Colloquium, San Diego, August 1994, Armed Forces Communications and Electronics Assc. (AFCEA), 1994.Google ScholarGoogle Scholar
  11. J. P. Wadsack, J. Niere, H. Giese, and J. H. Jahnke. Towards data dependency detection in web information systems. In In Proceedings of the Database Maintenance and Reengineering Workshop (DBMR'2002), Montreal, Canada., 2002.Google ScholarGoogle Scholar
  12. G. C. Witt and G. C. Simsion. Data Modeling Essentials: Analysis, Design, and Innovation. The Coriolis Group, 2000.Google ScholarGoogle Scholar

Index Terms

  1. Redundancy detection in service-oriented systems

      Recommendations

      Comments

      Login options

      Check if you have access through your login credentials or your institution to get full access on this article.

      Sign in
      • Published in

        cover image ACM Other conferences
        WWW '10: Proceedings of the 19th international conference on World wide web
        April 2010
        1407 pages
        ISBN:9781605587998
        DOI:10.1145/1772690

        Copyright © 2010 International World Wide Web Conference Committee (IW3C2)

        Publisher

        Association for Computing Machinery

        New York, NY, United States

        Publication History

        • Published: 26 April 2010

        Permissions

        Request permissions about this article.

        Request Permissions

        Check for updates

        Qualifiers

        • research-article

        Acceptance Rates

        Overall Acceptance Rate1,899of8,196submissions,23%

      PDF Format

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader

      ePub

      View this article in ePub.

      View ePub