skip to main content
article

A Survey of Data Marketplaces and Their Business Models

Published:21 November 2022Publication History
Skip Abstract Section

Abstract

Data is becoming an indispensable production factor for the modern economy, matching or exceeding in importance traditional factors such as land, infrastructure, labor and capital. As part of this, a wide range of applications in different sectors require huge amounts of information to feed machine learning models and algorithms responsible for critical roles in production chains and business processes. A variety of data trading entities including, but not limited to data marketplaces, have thus appeared in order to satisfy and match the offer with the demand for data. In this paper, we present the results and conclusions from a comprehensive survey covering 190 commercial data trading entities, the types of data that their trade, as well as their business models and the technologies that they rely upon. We also point to promising open research questions in the areas of data marketplace federation, pricing, and data ownership protection that could benefit the growing ecosystem of data trading entities that we have surveyed.

References

  1. A. Agarwal, M. Dahleh, and T. Sarkar. 2019. A Marketplace for Data: An Algorithmic Solution. In Proc. of ACM EC'19.Google ScholarGoogle Scholar
  2. R. Agrawal and J. Kiernan. 2002. Watermarking Relational Databases. In Proc. of the VLDB'02.Google ScholarGoogle Scholar
  3. B. Ahlgren, C. Dannewitz, C. Imbrenda, D. Kutscher, and B. Ohlman. 2012. A survey of information-centric networking. IEEE Comm. Magazine 50, 7 (2012).Google ScholarGoogle ScholarCross RefCross Ref
  4. H. Aly, J. Krumm, G. Ranade, and E. Horvitz. 2018. On the value of spatiotemporal information: principles and scenarios. In Proc. of ACM SIGSPATIAL'18.Google ScholarGoogle Scholar
  5. H. Aly, J. Krumm, G. Ranade, and E. Horvitz. 2019. To Buy or Not to Buy: Computing Value of Spatiotemporal Information. ACM Trans. Spatial Algorithms Syst. 5, 4, Article 22 (2019).Google ScholarGoogle Scholar
  6. I. Arrieta-Ibarra, L. Goff, D. Jiménez-Hernández, J. Lanier, and E. G.Weyl. 2018. Should We Treat Data as Labor? Moving beyond "Free". AEA Papers and Proceedings 108 (2018).Google ScholarGoogle ScholarCross RefCross Ref
  7. International Data Spaces Association. https: //internationaldataspaces.org/. Accessed: Apr'22.Google ScholarGoogle Scholar
  8. S. Andrés Azcoitia, C. Iordanou, and N. Laoutaris. 2021. What Is the Price of Data? A Measurement Study of Commercial Data Marketplaces. arXiv:2111.04427Google ScholarGoogle Scholar
  9. S. Andrés Azcoitia and N. Laoutaris. 2020. Try Before You Buy: A Practical Data Purchasing Algorithm for Real-World Data Marketplaces.Google ScholarGoogle Scholar
  10. S. Andrés Azcoitia, M. Paraschiv, and N. Laoutaris. 2020. Computing the Relative Value of Spatio-Temporal Data in Wholesale and Retail Data Marketplaces. arXiv:2002.11193Google ScholarGoogle Scholar
  11. C. Biancotti and P. Ciocca. 2019. Opening Internet Monopolies to Competition with Data Sharing Mandates. Peterson Institute for International Economics. Policy Brief.Google ScholarGoogle Scholar
  12. D. Brickley, M. Burgess, and N. Noy. 2019. Google Dataset Search: Building a Search Engine for Datasets in an Open Web Ecosystem. In Proc. WWW'19.Google ScholarGoogle Scholar
  13. J. P. Carrascal, C. Riederer, V. Erramilli, M. Cherubini, and R. de Oliveira. 2013. Your Browsing Behavior for a Big Mac: Economics of Personal Information Online. In Proc. WWW'13.Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. R. Castro, M. Coates, G. Liang, R. Nowak, and B. Yu. 2004. Network Tomography: Recent Developments. Stat. Sci. 19, 3 (2004).Google ScholarGoogle Scholar
  15. S. Chawla, S. Deep, P. Koutris, and Y. Teng. 2019. Revenue maximization for query pricing. In Proc. VLDB Endow. 13 (09 2019).Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. L. Chen, P. Koutris, and A. Kumar. 2019. Towards Model-Based Pricing for Machine Learning in a Data Marketplace. In Proc. of ACMSIGMOD'19.Google ScholarGoogle Scholar
  17. Z. Chen, Z.Wang, and C. Jia. Semantic-Integrated Software Watermarking with Tamper-Proofing. Multimedia Tools Appl. 77, 9 (2018).Google ScholarGoogle Scholar
  18. H. Chesbrough and R. Rosenbloom. 2002. The Role of the Business Model in Capturing Value from Innovation: Evidence from Xerox Corporation's Technology Spin-Off Companies. Industrial and Corporate Change 11 (2002).Google ScholarGoogle Scholar
  19. M. Dahleh. 2018. Why the Data Marketplaces of the Future Will Sell Insights, Not Data.Google ScholarGoogle Scholar
  20. DataRade.ai. 2022. Platforms. https://datarade.ai/platforms. Accessed: Apr'22.Google ScholarGoogle Scholar
  21. S. Delacroix and N. D. Lawrence. 2019. Bottom-up data Trusts: disturbing the 'one size fits all' approach to data governance. International Data Privacy Law 9, 4 (2019).Google ScholarGoogle Scholar
  22. D. Derler and D. Slamanig. 2018. Highly-Efficient Fully- Anonymous Dynamic Group Signatures. In Proc. of ASIACCS'18.Google ScholarGoogle Scholar
  23. G. J. Doërr and J.L. Dugelay. 2003. A guide tour of video watermarking. Signal Process. Image Comm. 18 (2003).Google ScholarGoogle Scholar
  24. EC and IDC. 2021. EU Data Landscape. https://datalandscape.eu/. Accessed: Apr'22.Google ScholarGoogle Scholar
  25. EU. 2016. General Data Protection Regulation.Google ScholarGoogle Scholar
  26. EU. 2020. Data Governance Act.Google ScholarGoogle Scholar
  27. EU. 2020. Press release: Commission proposes measures to boost data sharing and support European data spaces.Google ScholarGoogle Scholar
  28. R. C. Fernandez, P. Subramaniam, and M. J. Franklin. 2020. Data Market Platforms: Trading Data Assets to Solve Data Problems. In Proc. VLDB Endow. 13, 12 (2020).Google ScholarGoogle ScholarDigital LibraryDigital Library
  29. Gaia-X. https://gaia-x.eu/. Accessed: Apr'22.Google ScholarGoogle Scholar
  30. A. Ghorbani and J. Zou. 2019. Data Shapley: Equitable Valuation of Data for Machine Learning. In Proc. of ICML'19.Google ScholarGoogle Scholar
  31. A. Ghosh and A. Roth. 2011. Selling Privacy at Auction. In Proc. of ACM EC '11.Google ScholarGoogle Scholar
  32. A. Gionis, P. Indyk, and R. Motwani. 1999. Similarity Search in High Dimensions via Hashing. In Proc. of VLDB'99.Google ScholarGoogle Scholar
  33. A. V. Goldberg and J. D. Hartline. 2003. Competitiveness via Consensus. In Proc. of ACM-SIAM SODA'03.Google ScholarGoogle Scholar
  34. A. V. Goldberg, J. D. Hartline, and A. Wright. 2001. Competitive Auctions and Digital Goods. In Proc. of ACM-SIAM SODA'01.Google ScholarGoogle Scholar
  35. J. R. Heckman, E. Boehmer, E. H. Peters, M. Davaloo, and N. G Kurup. 2015. A Pricing Model for Data Markets. In Proc. of iConference.Google ScholarGoogle Scholar
  36. N. Henke, J. Bughin, M. Chui, J. Manyika, T. Saleh, and B. Wiseman. 2016. The age of analytics: Competing in a data-driven world. McKinsey Global Institute.Google ScholarGoogle Scholar
  37. C. Iordanou, N. Kourtellis, J. M. Carrascosa, C. Soriente, R. Cuevas, and N. Laoutaris. 2019. Beyond Content Analysis: Detecting Targeted Ads via Distributed Counting. In Proc. of ACM CoNEXT '19.Google ScholarGoogle Scholar
  38. C. Iordanou, C. Soriente, M. Sirivianos, and N. Laoutaris. 2017. Who is Fiddling with Prices? Building and Deploying a Watchdog Service for E-Commerce. In Proc. of ACM SIGCOMM'17.Google ScholarGoogle Scholar
  39. P. Koutris, P. Upadhyaya, M. Balazinska, B. Howe, and D. Suciu. 2012. QueryMarket demonstration: Pricing for online data markets. In Proc. VLDB Endow. 5 (08 2012).Google ScholarGoogle ScholarDigital LibraryDigital Library
  40. V. Koutsos, D. Papadopoulos, D. Chatzopoulos, S. Tarkoma, and P. Hui. 2020. Agora: A Privacy-aware Data Marketplace. In IEEE Int. Conf. on Distributed Comp. Systems (ICDCS).Google ScholarGoogle Scholar
  41. J. Lanier. 2013. Who Owns the Future? Simon & Schuster.Google ScholarGoogle Scholar
  42. N. Laoutaris and C. Iordanou. 2021. What Do Information Centric Networks, Trusted Execution Environments, and Digital Watermarking Have to Do with Privacy, the Data Economy, and Their Future? SIGCOMM Computing Comm. Rev. (March 2021).Google ScholarGoogle Scholar
  43. C. Li, D. Y. Li, G. Miklau, and D. Suciu. 2015. A Theory of Pricing Private Data. ACM Transactions on Database Systems.Google ScholarGoogle Scholar
  44. X. Liang and S. Xiang. 2020. Robust reversible audio watermarking based on high-order difference statistics. Signal Processing 173.Google ScholarGoogle Scholar
  45. A. Löser, F. Stahl, A. Muschalle, and G. Vossen. 2012. Pricing Approaches for Data Markets. In Proc. of the BIRTE.Google ScholarGoogle Scholar
  46. W. Mao, Z. Zheng, and F. Wu. 2019. Pricing for Revenue Maximization in IoT Data Markets: An Information Design Perspective. In IEEE INFOCOM 2019.Google ScholarGoogle Scholar
  47. D. Moody and P. Walsh. 1999. Measuring the Value Of Information - An Asset Valuation Approach. In ECIS.Google ScholarGoogle Scholar
  48. State of California. 2018. California Consumer Privacy Act (CCPA).Google ScholarGoogle Scholar
  49. O. Ohrimenko, S. Tople, and S. Tschiatschek. 2019. Collaborative Machine Learning Markets with Data-Replication-Robust Payments. arXiv:1911.09052.Google ScholarGoogle Scholar
  50. L. Olejnik, M. Tran, and C. Castelluccia. 2014. Selling Off Privacy at Auction. In Proc. of NDSS'14.Google ScholarGoogle Scholar
  51. W. Org, J. Becker, K. Backhaus, H. Grob, B. Hellingrath, T. Hoeren, S. Klein, H. Kuchen, U. Müller-Funk, U. Thonemann, G. Vossen, F. Stahl, and F. Schomm. 2014. The Data Marketplace Survey Revisited. Westf. Wilhelms-Univ., ERCIS.Google ScholarGoogle Scholar
  52. A. Osterwalder. 2004. The business model ontology. A proposition in a design science approach.Google ScholarGoogle Scholar
  53. P. Papadopoulos, N. Kourtellis, P. Rodriguez, and N. Laoutaris. 2017. If You Are Not Paying for It, You Are the Product: How Much Do Advertisers Pay to Reach You? In Proc. ACM IMC'17.Google ScholarGoogle Scholar
  54. J. Pei. 2020. Data Pricing -- From Economics to Data Science. In Proc. of ACM SIGKDD'20.Google ScholarGoogle ScholarDigital LibraryDigital Library
  55. E. Posner and G. Weyl. 2018. Radical Markets. Uprooting Capitalism and Democracy for a Just Society. Princeton Univ. Press.Google ScholarGoogle Scholar
  56. Daria R. 2019. The Future of Data Marketplaces. Link. Accessed: Apr'22.Google ScholarGoogle Scholar
  57. D. Reinsel, J. Gantz, and J. Rydning. 2018. The Digitization of the World - From Edge to Core. Data Age 2025.Google ScholarGoogle Scholar
  58. B. Rozemberczki, L.Watson, P. Bayer, H. Yang, O. Kiss, S. Nilsson, and R. Sarkar. 2022. The Shapley Value in Machine Learning. arXiv:2202.05594Google ScholarGoogle Scholar
  59. M. Sabt, M. Achemlal, and A. Bouabdallah. 2015. Trusted Execution Environment: What It is, and What It is Not. In 2015 IEEE Trustcom/BigDataSE/ISPA, Vol. 1.Google ScholarGoogle Scholar
  60. F. Schomm, F. Stahl, and G. Vossen. 2013. Marketplaces for Data: An Initial Survey. SIGMOD Record 42, 1 (May 2013), 12 pagesGoogle ScholarGoogle ScholarDigital LibraryDigital Library
  61. C. Shapiro and H. R. Varian. 2000. Information Rules: A Strategic Guide to the Network Economy. Harvard Business School Press.Google ScholarGoogle Scholar
  62. M. Spiekermann. 2019. Data Marketplaces: Trends and Monetisation of Data Goods. Intereconomics.Google ScholarGoogle Scholar
  63. F. Stahl, F. Schomm, L. Vomfell, and G. Vossen. 2017. Marketplaces for Digital Data: Quo Vadis? Computer and Information Science 10 (2017).Google ScholarGoogle Scholar
  64. European Union. 2021. Consultation on the Data Act & amended rules on the legal protection of databases.Google ScholarGoogle Scholar
  65. C. Veliz. 2021. Privacy is Power: Why and How You Should Take Back Control of Your Data. Bantam Press.Google ScholarGoogle Scholar
  66. Q. Wang, R. Li, Q. Wang, and S. Chen. 2021. Non-Fungible Token (NFT): Overview, Evaluation, Opportunities and Challenges. arXiv:2105.07447.Google ScholarGoogle Scholar
  67. X. Xu, A. Hannun, and L. Van Der Maaten. 2022. Data Appraisal Without Data Sharing. In Proc. of Machine Learning Research.Google ScholarGoogle Scholar
  68. H. Yu and M. Zhang. 2017. Data pricing strategy based on data quality. Computers and Industrial Engineering 112 (2017).Google ScholarGoogle Scholar
  69. D. Zhang, H. Wang, D. Xiaoou, Y. Zhang, J. Li, and H. Gao. 2018. On the Fairness of Quality-based Data Markets. (2018). arXiv:1808.01624.Google ScholarGoogle Scholar
  70. K. Zhao, H. Lu, and J. Mei. 2014. Locality Preserving Hashing. In Proc. of the AAAI Conference 28, 1 (2014).Google ScholarGoogle Scholar
  71. K. R. Özyilmaz, M. Do?gan, and A. Yurdakul. 2018. IDMoB: IoT Data Marketplace on Blockchain. In Crypto Valley Conference on Blockchain Technology (CVCBT).Google ScholarGoogle Scholar

Recommendations

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Sign in

Full Access

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader