Skip to main content
Top
Published in: Business & Information Systems Engineering 5/2019

22-07-2019 | Research Paper

Discovering Data Quality Problems

The Case of Repurposed Data

Authors: Ruojing Zhang, Marta Indulska, Shazia Sadiq

Published in: Business & Information Systems Engineering | Issue 5/2019

Log in

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

Existing methodologies for identifying data quality problems are typically user-centric, where data quality requirements are first determined in a top-down manner following well-established design guidelines, organizational structures and data governance frameworks. In the current data landscape, however, users are often confronted with new, unexplored datasets that they may not have any ownership of, but that are perceived to have relevance and potential to create value for them. Such repurposed datasets can be found in government open data portals, data markets and several publicly available data repositories. In such scenarios, applying top-down data quality checking approaches is not feasible, as the consumers of the data have no control over its creation and governance. Hence, data consumers – data scientists and analysts – need to be empowered with data exploration capabilities that allow them to investigate and understand the quality of such datasets to facilitate well-informed decisions on their use. This research aims to develop such an approach for discovering data quality problems using generic exploratory methods that can be effectively applied in settings where data creation and use is separated. The approach, named LANG, is developed through a Design Science approach on the basis of semiotics theory and data quality dimensions. LANG is empirically validated in terms of soundness of the approach, its repeatability and generalizability.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Show more products
Appendix
Available only for authorised users
Footnotes
1
The researchers named the approach as ‘LANG’ – ‘Lang’ conveys the meaning of ‘becoming clear’ in the Chinese language, which fits with the aim of the approach, that is, to make clear the data quality requirements of a dataset.
 
2
The mapping is omitted due to length considerations but is available from the authors upon request.
 
3
The download period is between June and August 2016. We note that the datasets are frequently updated in the respective open data portals including change of meta-data, such as adding or removing columns as well as providing or removing other documentation related to the dataset. Hence, the current versions of the datasets may not have the same data quality problems as those identified in our study.
 
4
In this paper we have demonstrated the application of LANG with the help of relational database (MySQL). We present the overall approach in the body of the paper, and present the SQL instantiation of the method in Appendix A.
 
5
Some detail is abstracted in this figure for visual simplicity; in particular sequences between some of the individual checks, which may result in skipping certain checks/stages (as relevant on the basis of analysis results).
 
6
“The Jupyter Notebook is an open-source web application that allows you to create and share documents that contain live code, equations, visualizations and narrative text. Uses include: data cleaning and transformation, numerical simulation, statistical modelling, data visualization, machine learning, and much more.” (jupyter.org).
 
Literature
go back to reference Abedjan Z, Golab L, Naumann F (2015) Profiling relational data: a survey. VLDB J Int J Very Large Data Bases 24(4):557–581CrossRef Abedjan Z, Golab L, Naumann F (2015) Profiling relational data: a survey. VLDB J Int J Very Large Data Bases 24(4):557–581CrossRef
go back to reference Almars A (2016) Automated data quality discovery tool. Master Thesis, The University of Queensland Almars A (2016) Automated data quality discovery tool. Master Thesis, The University of Queensland
go back to reference Batini C, Scannapieco M (2006) Data quality—concepts, methodologies and techniques. Springer, Heidelberg Batini C, Scannapieco M (2006) Data quality—concepts, methodologies and techniques. Springer, Heidelberg
go back to reference Batini C, Francalanci C, Cappiello C, Maurino A (2009) Methodologies for data quality assessment and improvement. ACM Comput Surv 41(3):1–52CrossRef Batini C, Francalanci C, Cappiello C, Maurino A (2009) Methodologies for data quality assessment and improvement. ACM Comput Surv 41(3):1–52CrossRef
go back to reference Bohannon P, Fan W, Geerts F, Jia X, Kementsietsidis A (2007) Conditional functional dependencies for data cleaning. In: IEEE 23rd international conference on data engineering, pp 746–755 Bohannon P, Fan W, Geerts F, Jia X, Kementsietsidis A (2007) Conditional functional dependencies for data cleaning. In: IEEE 23rd international conference on data engineering, pp 746–755
go back to reference Byrne B, Kling J, Mccarty D, Sauter G, Smith H, Worcester P (2008) The information perspective of SOA design, part 6: the value of applying the data quality analysis pattern in SOA. IBM Corporation Byrne B, Kling J, Mccarty D, Sauter G, Smith H, Worcester P (2008) The information perspective of SOA design, part 6: the value of applying the data quality analysis pattern in SOA. IBM Corporation
go back to reference Caballero I, Verbo E, Calero C, Piattini M (2007) A data quality measurement information model based on ISO/IEC 15939. In: Proceedings of the 12th international conference on information quality, pp 393–408 Caballero I, Verbo E, Calero C, Piattini M (2007) A data quality measurement information model based on ISO/IEC 15939. In: Proceedings of the 12th international conference on information quality, pp 393–408
go back to reference Caballero I, Verbo E, Calero C, Piattini M (2008) MMPRO: a methodology based on ISO/IEC 15939 to draw up data quality measurement processes. In: Proceedings of the 13th international conference on information quality, pp 326–340 Caballero I, Verbo E, Calero C, Piattini M (2008) MMPRO: a methodology based on ISO/IEC 15939 to draw up data quality measurement processes. In: Proceedings of the 13th international conference on information quality, pp 326–340
go back to reference Chakraborti S, Dey S (2019) Analysis of competitor intelligence in the era of big data. Bus Inf Syst Eng 61(3):345–355CrossRef Chakraborti S, Dey S (2019) Analysis of competitor intelligence in the era of big data. Bus Inf Syst Eng 61(3):345–355CrossRef
go back to reference Corsar D, Edwards P (2017) Challenges of open data qality: more than just license, format, and customer support. ACM J Data Inf Qual 9(1):3:1–3:4 Corsar D, Edwards P (2017) Challenges of open data qality: more than just license, format, and customer support. ACM J Data Inf Qual 9(1):3:1–3:4
go back to reference Dasu T, Johnson T (2003) Exploratory data mining and data cleaning. Wiley, New YorkCrossRef Dasu T, Johnson T (2003) Exploratory data mining and data cleaning. Wiley, New YorkCrossRef
go back to reference Ehling M, Körner T (2007) Handbook on data quality assessment methods and tools. European Commission, Eurostat Ehling M, Körner T (2007) Handbook on data quality assessment methods and tools. European Commission, Eurostat
go back to reference English LP (1999) Improving data warehouse and business information quality. Wiley English LP (1999) Improving data warehouse and business information quality. Wiley
go back to reference English LP (2009) Information quality applied. Best practices for improving business information, processes and systems. Wiley, New York English LP (2009) Information quality applied. Best practices for improving business information, processes and systems. Wiley, New York
go back to reference Eppler MJ (2001) The concept of information quality. Stud Commun Sci 1(2):167–182 Eppler MJ (2001) The concept of information quality. Stud Commun Sci 1(2):167–182
go back to reference Fan W, Geerts F (2012) Foundations of data quality management. Synth Lect Data Manag 4(5):1–217CrossRef Fan W, Geerts F (2012) Foundations of data quality management. Synth Lect Data Manag 4(5):1–217CrossRef
go back to reference Fisher T (2009) The data asset: how smart companies govern their data for business success. Wiley, New York Fisher T (2009) The data asset: how smart companies govern their data for business success. Wiley, New York
go back to reference Gatling GCBR, Champlin R, Stefani H, Weigel G (2007) Enterprise information management with SAP. Galileo, Boston Gatling GCBR, Champlin R, Stefani H, Weigel G (2007) Enterprise information management with SAP. Galileo, Boston
go back to reference Gregor S, Jones D (2007) The anatomy of a design theory. J Assoc Inf Syst 8(5):312–335 Gregor S, Jones D (2007) The anatomy of a design theory. J Assoc Inf Syst 8(5):312–335
go back to reference Hernández MA, Stolfo SJ (1998) Real-world data is dirty. Data cleansing and the merge/purge problem. Data Min Knowl Discov 2(1):9–37CrossRef Hernández MA, Stolfo SJ (1998) Real-world data is dirty. Data cleansing and the merge/purge problem. Data Min Knowl Discov 2(1):9–37CrossRef
go back to reference Hevner AR, March ST, Park J, Ram S (2004) Design science in information systems research. MIS Q 28(1):75–105CrossRef Hevner AR, March ST, Park J, Ram S (2004) Design science in information systems research. MIS Q 28(1):75–105CrossRef
go back to reference ISO (2011) ISO/TS 8000-1 data quality part 1: overview. ISO ISO (2011) ISO/TS 8000-1 data quality part 1: overview. ISO
go back to reference ISO (2012) ISO 8000-2 data quality-part 2-vocabulary. ISO ISO (2012) ISO 8000-2 data quality-part 2-vocabulary. ISO
go back to reference Jayawardene V, Sadiq S, Indulska M (2013a) An analysis of data quality dimensions. School of Information Technology and Electrical Engineering, The University of Queensland, ITEE Technical Report Jayawardene V, Sadiq S, Indulska M (2013a) An analysis of data quality dimensions. School of Information Technology and Electrical Engineering, The University of Queensland, ITEE Technical Report
go back to reference Jayawardene V, Sadiq S, Indulska M (2013b) The curse of dimensionality in data quality. In: 24th Australasian conference on information systems. RMIT University, pp 1–11 Jayawardene V, Sadiq S, Indulska M (2013b) The curse of dimensionality in data quality. In: 24th Australasian conference on information systems. RMIT University, pp 1–11
go back to reference Judah S, Friedman T (2015) Magic quadrant for data quality tools. Gartner Judah S, Friedman T (2015) Magic quadrant for data quality tools. Gartner
go back to reference Kenett RS, Shmueli G (2014) On information quality. J R Stat Soc Ser A 177(1):3–38CrossRef Kenett RS, Shmueli G (2014) On information quality. J R Stat Soc Ser A 177(1):3–38CrossRef
go back to reference Köhler H, Leck U, Link S (2013) Possible and certain SQL keys. Department of Computer Science, The University of Auckland Köhler H, Leck U, Link S (2013) Possible and certain SQL keys. Department of Computer Science, The University of Auckland
go back to reference Köhler H, Link S, Zhou X (2015) Possible and certain SQL keys. Proc VLDB Endow 8(11):1118–1129CrossRef Köhler H, Link S, Zhou X (2015) Possible and certain SQL keys. Proc VLDB Endow 8(11):1118–1129CrossRef
go back to reference Krogstie J (2002) A semiotic approach to quality in requirements specifications. In: Proceedings of the IFIP TC8/WG8 (1), pp 231–249 Krogstie J (2002) A semiotic approach to quality in requirements specifications. In: Proceedings of the IFIP TC8/WG8 (1), pp 231–249
go back to reference Krogstie J, Lindland OI, Sindre G (1995a) Defining quality aspects for conceptual models. In: Falkenberg ED, Hesse W, Olivé A (eds) Information system concepts. Springer, Boston, pp 216–231CrossRef Krogstie J, Lindland OI, Sindre G (1995a) Defining quality aspects for conceptual models. In: Falkenberg ED, Hesse W, Olivé A (eds) Information system concepts. Springer, Boston, pp 216–231CrossRef
go back to reference Krogstie J, Lindland OI, Sindre G (1995b) Towards a deeper understanding of quality in requirements engineering. In: International conference on advanced information systems engineering. Springer, Heidelberg, pp 82–95 Krogstie J, Lindland OI, Sindre G (1995b) Towards a deeper understanding of quality in requirements engineering. In: International conference on advanced information systems engineering. Springer, Heidelberg, pp 82–95
go back to reference Krueger R, Casey M (1994) Focus groups. A practical guide for applied research. Sage Publications, Thousand Oaks Krueger R, Casey M (1994) Focus groups. A practical guide for applied research. Sage Publications, Thousand Oaks
go back to reference Lee YW, Strong DM, Kahn BK, Wang RY (2002) AIMQ: a methodology for information quality assessment. Inf Manag 40(2):133–146CrossRef Lee YW, Strong DM, Kahn BK, Wang RY (2002) AIMQ: a methodology for information quality assessment. Inf Manag 40(2):133–146CrossRef
go back to reference Lindland OI, Sindre G, Solvberg A (1994) Understanding quality in conceptual modeling. IEEE Softw 11(2):42–49CrossRef Lindland OI, Sindre G, Solvberg A (1994) Understanding quality in conceptual modeling. IEEE Softw 11(2):42–49CrossRef
go back to reference Loshin D (2001) Enterprise knowledge management. The data quality approach. Morgan Kaufmann, Burlington Loshin D (2001) Enterprise knowledge management. The data quality approach. Morgan Kaufmann, Burlington
go back to reference Loshin D (2006) Monitoring data quality performance using data quality metrics. Informatica Corporation, Redwood City Loshin D (2006) Monitoring data quality performance using data quality metrics. Informatica Corporation, Redwood City
go back to reference Maydanchik A (2007) Data quality assessment. Technics Publications, New Jersey Maydanchik A (2007) Data quality assessment. Technics Publications, New Jersey
go back to reference McGilvray D (2008) Executing data quality projects: ten steps to quality data and trusted information. Morgan Kaufmann, Burlington McGilvray D (2008) Executing data quality projects: ten steps to quality data and trusted information. Morgan Kaufmann, Burlington
go back to reference Morgan DL (ed) (1993) Sage focus editions. Successful focus groups: advancing the state of the art, vol 156. Sage Publications, Thousand Oaks Morgan DL (ed) (1993) Sage focus editions. Successful focus groups: advancing the state of the art, vol 156. Sage Publications, Thousand Oaks
go back to reference Morris CW (1938) Foundations of the theory of signs. In: Langford CH (ed) International encyclopedia of unified science. University of Chicago Press, London Morris CW (1938) Foundations of the theory of signs. In: Langford CH (ed) International encyclopedia of unified science. University of Chicago Press, London
go back to reference Naumann F, Rolker C (2000) Assessment methods for information quality criteria. Humboldt-Universität zu Berlin, Informatik-Berichte, Berlin Naumann F, Rolker C (2000) Assessment methods for information quality criteria. Humboldt-Universität zu Berlin, Informatik-Berichte, Berlin
go back to reference OMB U (2002) Guidelines for ensuring and maximizing the quality, objectivity, utility, and integrity of information disseminated by federal agencies, part IX. Office of Management and Budget OMB U (2002) Guidelines for ensuring and maximizing the quality, objectivity, utility, and integrity of information disseminated by federal agencies, part IX. Office of Management and Budget
go back to reference Peffers K, Tuunanen T, Rothenberger MA, Chatterjee S (2007) A design science research methodology for information systems research. J Manag Inf Syst 24(3):45–77CrossRef Peffers K, Tuunanen T, Rothenberger MA, Chatterjee S (2007) A design science research methodology for information systems research. J Manag Inf Syst 24(3):45–77CrossRef
go back to reference Pierce CS (1931–1935) Collected papers. Harvard University Press, Cambridge Pierce CS (1931–1935) Collected papers. Harvard University Press, Cambridge
go back to reference Pipino L, Lee YW, Wang RY (2002) Data quality assessment. Commun ACM 45(4):211–218CrossRef Pipino L, Lee YW, Wang RY (2002) Data quality assessment. Commun ACM 45(4):211–218CrossRef
go back to reference Price R, Shanks G (2004) A semiotic information quality framework. In: Proceedings of the international conference on decision support systems, pp 658–672 Price R, Shanks G (2004) A semiotic information quality framework. In: Proceedings of the international conference on decision support systems, pp 658–672
go back to reference Price R, Shanks G (2005a) A semiotic information quality framework: development and comparative analysis. J Inf Technol 20(2):88–102CrossRef Price R, Shanks G (2005a) A semiotic information quality framework: development and comparative analysis. J Inf Technol 20(2):88–102CrossRef
go back to reference Price R. J, Shanks G (2005b) Empirical refinement of a semiotic information quality framework. In: Proceedings of the 38th annual Hawaii international conference on system sciences, Big Island, pp 216a Price R. J, Shanks G (2005b) Empirical refinement of a semiotic information quality framework. In: Proceedings of the 38th annual Hawaii international conference on system sciences, Big Island, pp 216a
go back to reference Raman V, Hellerstein JM (2001) Potter’s wheel: an interactive data cleaning system. In: Proceedings of the 27th VLDB conference, Rome, pp 381–390 Raman V, Hellerstein JM (2001) Potter’s wheel: an interactive data cleaning system. In: Proceedings of the 27th VLDB conference, Rome, pp 381–390
go back to reference Rosemann M, Vessey I (2008) Toward improving the relevance of information systems research to practice: the role of applicability checks. MIS Q 32(1):1–22CrossRef Rosemann M, Vessey I (2008) Toward improving the relevance of information systems research to practice: the role of applicability checks. MIS Q 32(1):1–22CrossRef
go back to reference Sadiq S, Indulska M (2017) Open data: quality over quantity. Int J Inf Manag 37(3):150–154CrossRef Sadiq S, Indulska M (2017) Open data: quality over quantity. Int J Inf Manag 37(3):150–154CrossRef
go back to reference Sadiq S, Yeganeh NK, Indulska M (2011) 20 years of data quality research: themes, trends and synergies. In: 22nd Australasian database conference, Perth, pp 153–162 Sadiq S, Yeganeh NK, Indulska M (2011) 20 years of data quality research: themes, trends and synergies. In: 22nd Australasian database conference, Perth, pp 153–162
go back to reference Scannapieco M, Virgillito A, Marchetti C, Mecella M, Baldoni R (2004) The Daquincis architecture: a platform for exchanging and improving data quality in cooperative information systems. Inf Syst 29(7):551–582CrossRef Scannapieco M, Virgillito A, Marchetti C, Mecella M, Baldoni R (2004) The Daquincis architecture: a platform for exchanging and improving data quality in cooperative information systems. Inf Syst 29(7):551–582CrossRef
go back to reference Selvage M, Saul J, Jain A (2017) Magic quadrant for data quality tools. Gartner Selvage M, Saul J, Jain A (2017) Magic quadrant for data quality tools. Gartner
go back to reference Shanks GG, Darke P (1998) Understanding data quality and data warehousing: a semiotic approach. IQ, pp 292–309 Shanks GG, Darke P (1998) Understanding data quality and data warehousing: a semiotic approach. IQ, pp 292–309
go back to reference Shanks G, Tansley E (2002) Data quality tagging and decision outcomes. An experimental study. IFIP Working Group, pp 399–410 Shanks G, Tansley E (2002) Data quality tagging and decision outcomes. An experimental study. IFIP Working Group, pp 399–410
go back to reference Sismanis Y, Brown P, Haas PJ, Reinwald B (2006) Gordian: efficient and scalable discovery of composite keys. In: Proceedings of the 32nd international conference on very large data bases, VLDB Endowment, pp 691–702 Sismanis Y, Brown P, Haas PJ, Reinwald B (2006) Gordian: efficient and scalable discovery of composite keys. In: Proceedings of the 32nd international conference on very large data bases, VLDB Endowment, pp 691–702
go back to reference Song S, Chen L (2011) Differential dependencies Reasoning and discovery. ACM Trans Database Syst 36(3):16CrossRef Song S, Chen L (2011) Differential dependencies Reasoning and discovery. ACM Trans Database Syst 36(3):16CrossRef
go back to reference Sonnenberg C, vom Brocke J (2012) Evaluations in the science of the artificial. Reconsidering the build-evaluate pattern in design science research. In: Peffers K, Rothenberger M, Kuechler B (eds) Design science research in information systems, vol 7286. Advances in theory and practice. DESRIST. Lecture notes in computer science. Springer, Heidelberg Sonnenberg C, vom Brocke J (2012) Evaluations in the science of the artificial. Reconsidering the build-evaluate pattern in design science research. In: Peffers K, Rothenberger M, Kuechler B (eds) Design science research in information systems, vol 7286. Advances in theory and practice. DESRIST. Lecture notes in computer science. Springer, Heidelberg
go back to reference Stamper RK (1992) Review of Andersen “Theory of Computer Semiotics”. Comput J 1 Stamper RK (1992) Review of Andersen “Theory of Computer Semiotics”. Comput J 1
go back to reference Stamper R (1993) A semiotic theory of information and information systems/applied semiotics. In: Invited Papers for the ICL/University of Newcastle Seminar on “Information”, September 6–10 Stamper R (1993) A semiotic theory of information and information systems/applied semiotics. In: Invited Papers for the ICL/University of Newcastle Seminar on “Information”, September 6–10 
go back to reference Storey V, Wang R (2001) Extending the ER model to represent data quality requirements. Kluwer, Dordrecht Storey V, Wang R (2001) Extending the ER model to represent data quality requirements. Kluwer, Dordrecht
go back to reference Sturm B, Sunyaev A (2019) Design principles for systematic search systems. Bus Inf Syst Eng 61(1):91–111CrossRef Sturm B, Sunyaev A (2019) Design principles for systematic search systems. Bus Inf Syst Eng 61(1):91–111CrossRef
go back to reference Stvilia B, Gasser L, Twidale MB, Smith LC (2007) A framework for information quality assessment. J Am Soc Inf Sci Technol 58(12):1720–1733CrossRef Stvilia B, Gasser L, Twidale MB, Smith LC (2007) A framework for information quality assessment. J Am Soc Inf Sci Technol 58(12):1720–1733CrossRef
go back to reference Tu SY, Wang Y-YR (1993) Modeling data quality and context through extension of the ER model. Total Data Quality Management Research Program, Sloan School of Management, Massachusetts Institute of Technology, Cambridge Tu SY, Wang Y-YR (1993) Modeling data quality and context through extension of the ER model. Total Data Quality Management Research Program, Sloan School of Management, Massachusetts Institute of Technology, Cambridge
go back to reference Venable J, Pries-Heje J, Baskerville R (2012) A comprehensive framework for evaluation in design science research. In: Peffers K, Rothenberger M, Kuechler B (eds) Design science research in information systems, vol 786. Advances in theory and practice. Springer, Heidelberg, pp 423–438 Venable J, Pries-Heje J, Baskerville R (2012) A comprehensive framework for evaluation in design science research. In: Peffers K, Rothenberger M, Kuechler B (eds) Design science research in information systems, vol 786. Advances in theory and practice. Springer, Heidelberg, pp 423–438
go back to reference Venable J, Pries-Heje J, Baskerville R (2016) FEDS: a framework for evaluation in design science research. Eur J Inf Syst 25(1):77–89CrossRef Venable J, Pries-Heje J, Baskerville R (2016) FEDS: a framework for evaluation in design science research. Eur J Inf Syst 25(1):77–89CrossRef
go back to reference Wand Y, Wang RY (1996) Anchoring data quality dimensions in ontological foundations. Commun ACM 39(11):86–95CrossRef Wand Y, Wang RY (1996) Anchoring data quality dimensions in ontological foundations. Commun ACM 39(11):86–95CrossRef
go back to reference Wang R (1998) A product perspective on total data quality management. Commun ACM 41(2):58–65CrossRef Wang R (1998) A product perspective on total data quality management. Commun ACM 41(2):58–65CrossRef
go back to reference Wang RY, Strong DM (1996) Beyond accuracy: what data quality means to data consumers. J Manag Inf Syst 12(4):5–33CrossRef Wang RY, Strong DM (1996) Beyond accuracy: what data quality means to data consumers. J Manag Inf Syst 12(4):5–33CrossRef
go back to reference Wang R, Ziad M, Lee Y (2001) Data quality. Kluwer, Dordrecht Wang R, Ziad M, Lee Y (2001) Data quality. Kluwer, Dordrecht
go back to reference Zhang R, Jayawardene V, Indulska M, Sadiq S, Zhou X (2014) A data driven approach for discovering data quality requirements. In: 35th international conference on information systems, Auckland Zhang R, Jayawardene V, Indulska M, Sadiq S, Zhou X (2014) A data driven approach for discovering data quality requirements. In: 35th international conference on information systems, Auckland
Metadata
Title
Discovering Data Quality Problems
The Case of Repurposed Data
Authors
Ruojing Zhang
Marta Indulska
Shazia Sadiq
Publication date
22-07-2019
Publisher
Springer Fachmedien Wiesbaden
Published in
Business & Information Systems Engineering / Issue 5/2019
Print ISSN: 2363-7005
Electronic ISSN: 1867-0202
DOI
https://doi.org/10.1007/s12599-019-00608-0

Other articles of this Issue 5/2019

Business & Information Systems Engineering 5/2019 Go to the issue

Premium Partner