Abstract
Data is becoming an indispensable production factor for the modern economy, matching or exceeding in importance traditional factors such as land, infrastructure, labor and capital. As part of this, a wide range of applications in different sectors require huge amounts of information to feed machine learning models and algorithms responsible for critical roles in production chains and business processes. A variety of data trading entities including, but not limited to data marketplaces, have thus appeared in order to satisfy and match the offer with the demand for data. In this paper, we present the results and conclusions from a comprehensive survey covering 190 commercial data trading entities, the types of data that their trade, as well as their business models and the technologies that they rely upon. We also point to promising open research questions in the areas of data marketplace federation, pricing, and data ownership protection that could benefit the growing ecosystem of data trading entities that we have surveyed.
- A. Agarwal, M. Dahleh, and T. Sarkar. 2019. A Marketplace for Data: An Algorithmic Solution. In Proc. of ACM EC'19.Google Scholar
- R. Agrawal and J. Kiernan. 2002. Watermarking Relational Databases. In Proc. of the VLDB'02.Google Scholar
- B. Ahlgren, C. Dannewitz, C. Imbrenda, D. Kutscher, and B. Ohlman. 2012. A survey of information-centric networking. IEEE Comm. Magazine 50, 7 (2012).Google ScholarCross Ref
- H. Aly, J. Krumm, G. Ranade, and E. Horvitz. 2018. On the value of spatiotemporal information: principles and scenarios. In Proc. of ACM SIGSPATIAL'18.Google Scholar
- H. Aly, J. Krumm, G. Ranade, and E. Horvitz. 2019. To Buy or Not to Buy: Computing Value of Spatiotemporal Information. ACM Trans. Spatial Algorithms Syst. 5, 4, Article 22 (2019).Google Scholar
- I. Arrieta-Ibarra, L. Goff, D. Jiménez-Hernández, J. Lanier, and E. G.Weyl. 2018. Should We Treat Data as Labor? Moving beyond "Free". AEA Papers and Proceedings 108 (2018).Google ScholarCross Ref
- International Data Spaces Association. https: //internationaldataspaces.org/. Accessed: Apr'22.Google Scholar
- S. Andrés Azcoitia, C. Iordanou, and N. Laoutaris. 2021. What Is the Price of Data? A Measurement Study of Commercial Data Marketplaces. arXiv:2111.04427Google Scholar
- S. Andrés Azcoitia and N. Laoutaris. 2020. Try Before You Buy: A Practical Data Purchasing Algorithm for Real-World Data Marketplaces.Google Scholar
- S. Andrés Azcoitia, M. Paraschiv, and N. Laoutaris. 2020. Computing the Relative Value of Spatio-Temporal Data in Wholesale and Retail Data Marketplaces. arXiv:2002.11193Google Scholar
- C. Biancotti and P. Ciocca. 2019. Opening Internet Monopolies to Competition with Data Sharing Mandates. Peterson Institute for International Economics. Policy Brief.Google Scholar
- D. Brickley, M. Burgess, and N. Noy. 2019. Google Dataset Search: Building a Search Engine for Datasets in an Open Web Ecosystem. In Proc. WWW'19.Google Scholar
- J. P. Carrascal, C. Riederer, V. Erramilli, M. Cherubini, and R. de Oliveira. 2013. Your Browsing Behavior for a Big Mac: Economics of Personal Information Online. In Proc. WWW'13.Google ScholarDigital Library
- R. Castro, M. Coates, G. Liang, R. Nowak, and B. Yu. 2004. Network Tomography: Recent Developments. Stat. Sci. 19, 3 (2004).Google Scholar
- S. Chawla, S. Deep, P. Koutris, and Y. Teng. 2019. Revenue maximization for query pricing. In Proc. VLDB Endow. 13 (09 2019).Google ScholarDigital Library
- L. Chen, P. Koutris, and A. Kumar. 2019. Towards Model-Based Pricing for Machine Learning in a Data Marketplace. In Proc. of ACMSIGMOD'19.Google Scholar
- Z. Chen, Z.Wang, and C. Jia. Semantic-Integrated Software Watermarking with Tamper-Proofing. Multimedia Tools Appl. 77, 9 (2018).Google Scholar
- H. Chesbrough and R. Rosenbloom. 2002. The Role of the Business Model in Capturing Value from Innovation: Evidence from Xerox Corporation's Technology Spin-Off Companies. Industrial and Corporate Change 11 (2002).Google Scholar
- M. Dahleh. 2018. Why the Data Marketplaces of the Future Will Sell Insights, Not Data.Google Scholar
- DataRade.ai. 2022. Platforms. https://datarade.ai/platforms. Accessed: Apr'22.Google Scholar
- S. Delacroix and N. D. Lawrence. 2019. Bottom-up data Trusts: disturbing the 'one size fits all' approach to data governance. International Data Privacy Law 9, 4 (2019).Google Scholar
- D. Derler and D. Slamanig. 2018. Highly-Efficient Fully- Anonymous Dynamic Group Signatures. In Proc. of ASIACCS'18.Google Scholar
- G. J. Doërr and J.L. Dugelay. 2003. A guide tour of video watermarking. Signal Process. Image Comm. 18 (2003).Google Scholar
- EC and IDC. 2021. EU Data Landscape. https://datalandscape.eu/. Accessed: Apr'22.Google Scholar
- EU. 2016. General Data Protection Regulation.Google Scholar
- EU. 2020. Data Governance Act.Google Scholar
- EU. 2020. Press release: Commission proposes measures to boost data sharing and support European data spaces.Google Scholar
- R. C. Fernandez, P. Subramaniam, and M. J. Franklin. 2020. Data Market Platforms: Trading Data Assets to Solve Data Problems. In Proc. VLDB Endow. 13, 12 (2020).Google ScholarDigital Library
- Gaia-X. https://gaia-x.eu/. Accessed: Apr'22.Google Scholar
- A. Ghorbani and J. Zou. 2019. Data Shapley: Equitable Valuation of Data for Machine Learning. In Proc. of ICML'19.Google Scholar
- A. Ghosh and A. Roth. 2011. Selling Privacy at Auction. In Proc. of ACM EC '11.Google Scholar
- A. Gionis, P. Indyk, and R. Motwani. 1999. Similarity Search in High Dimensions via Hashing. In Proc. of VLDB'99.Google Scholar
- A. V. Goldberg and J. D. Hartline. 2003. Competitiveness via Consensus. In Proc. of ACM-SIAM SODA'03.Google Scholar
- A. V. Goldberg, J. D. Hartline, and A. Wright. 2001. Competitive Auctions and Digital Goods. In Proc. of ACM-SIAM SODA'01.Google Scholar
- J. R. Heckman, E. Boehmer, E. H. Peters, M. Davaloo, and N. G Kurup. 2015. A Pricing Model for Data Markets. In Proc. of iConference.Google Scholar
- N. Henke, J. Bughin, M. Chui, J. Manyika, T. Saleh, and B. Wiseman. 2016. The age of analytics: Competing in a data-driven world. McKinsey Global Institute.Google Scholar
- C. Iordanou, N. Kourtellis, J. M. Carrascosa, C. Soriente, R. Cuevas, and N. Laoutaris. 2019. Beyond Content Analysis: Detecting Targeted Ads via Distributed Counting. In Proc. of ACM CoNEXT '19.Google Scholar
- C. Iordanou, C. Soriente, M. Sirivianos, and N. Laoutaris. 2017. Who is Fiddling with Prices? Building and Deploying a Watchdog Service for E-Commerce. In Proc. of ACM SIGCOMM'17.Google Scholar
- P. Koutris, P. Upadhyaya, M. Balazinska, B. Howe, and D. Suciu. 2012. QueryMarket demonstration: Pricing for online data markets. In Proc. VLDB Endow. 5 (08 2012).Google ScholarDigital Library
- V. Koutsos, D. Papadopoulos, D. Chatzopoulos, S. Tarkoma, and P. Hui. 2020. Agora: A Privacy-aware Data Marketplace. In IEEE Int. Conf. on Distributed Comp. Systems (ICDCS).Google Scholar
- J. Lanier. 2013. Who Owns the Future? Simon & Schuster.Google Scholar
- N. Laoutaris and C. Iordanou. 2021. What Do Information Centric Networks, Trusted Execution Environments, and Digital Watermarking Have to Do with Privacy, the Data Economy, and Their Future? SIGCOMM Computing Comm. Rev. (March 2021).Google Scholar
- C. Li, D. Y. Li, G. Miklau, and D. Suciu. 2015. A Theory of Pricing Private Data. ACM Transactions on Database Systems.Google Scholar
- X. Liang and S. Xiang. 2020. Robust reversible audio watermarking based on high-order difference statistics. Signal Processing 173.Google Scholar
- A. Löser, F. Stahl, A. Muschalle, and G. Vossen. 2012. Pricing Approaches for Data Markets. In Proc. of the BIRTE.Google Scholar
- W. Mao, Z. Zheng, and F. Wu. 2019. Pricing for Revenue Maximization in IoT Data Markets: An Information Design Perspective. In IEEE INFOCOM 2019.Google Scholar
- D. Moody and P. Walsh. 1999. Measuring the Value Of Information - An Asset Valuation Approach. In ECIS.Google Scholar
- State of California. 2018. California Consumer Privacy Act (CCPA).Google Scholar
- O. Ohrimenko, S. Tople, and S. Tschiatschek. 2019. Collaborative Machine Learning Markets with Data-Replication-Robust Payments. arXiv:1911.09052.Google Scholar
- L. Olejnik, M. Tran, and C. Castelluccia. 2014. Selling Off Privacy at Auction. In Proc. of NDSS'14.Google Scholar
- W. Org, J. Becker, K. Backhaus, H. Grob, B. Hellingrath, T. Hoeren, S. Klein, H. Kuchen, U. Müller-Funk, U. Thonemann, G. Vossen, F. Stahl, and F. Schomm. 2014. The Data Marketplace Survey Revisited. Westf. Wilhelms-Univ., ERCIS.Google Scholar
- A. Osterwalder. 2004. The business model ontology. A proposition in a design science approach.Google Scholar
- P. Papadopoulos, N. Kourtellis, P. Rodriguez, and N. Laoutaris. 2017. If You Are Not Paying for It, You Are the Product: How Much Do Advertisers Pay to Reach You? In Proc. ACM IMC'17.Google Scholar
- J. Pei. 2020. Data Pricing -- From Economics to Data Science. In Proc. of ACM SIGKDD'20.Google ScholarDigital Library
- E. Posner and G. Weyl. 2018. Radical Markets. Uprooting Capitalism and Democracy for a Just Society. Princeton Univ. Press.Google Scholar
- Daria R. 2019. The Future of Data Marketplaces. Link. Accessed: Apr'22.Google Scholar
- D. Reinsel, J. Gantz, and J. Rydning. 2018. The Digitization of the World - From Edge to Core. Data Age 2025.Google Scholar
- B. Rozemberczki, L.Watson, P. Bayer, H. Yang, O. Kiss, S. Nilsson, and R. Sarkar. 2022. The Shapley Value in Machine Learning. arXiv:2202.05594Google Scholar
- M. Sabt, M. Achemlal, and A. Bouabdallah. 2015. Trusted Execution Environment: What It is, and What It is Not. In 2015 IEEE Trustcom/BigDataSE/ISPA, Vol. 1.Google Scholar
- F. Schomm, F. Stahl, and G. Vossen. 2013. Marketplaces for Data: An Initial Survey. SIGMOD Record 42, 1 (May 2013), 12 pagesGoogle ScholarDigital Library
- C. Shapiro and H. R. Varian. 2000. Information Rules: A Strategic Guide to the Network Economy. Harvard Business School Press.Google Scholar
- M. Spiekermann. 2019. Data Marketplaces: Trends and Monetisation of Data Goods. Intereconomics.Google Scholar
- F. Stahl, F. Schomm, L. Vomfell, and G. Vossen. 2017. Marketplaces for Digital Data: Quo Vadis? Computer and Information Science 10 (2017).Google Scholar
- European Union. 2021. Consultation on the Data Act & amended rules on the legal protection of databases.Google Scholar
- C. Veliz. 2021. Privacy is Power: Why and How You Should Take Back Control of Your Data. Bantam Press.Google Scholar
- Q. Wang, R. Li, Q. Wang, and S. Chen. 2021. Non-Fungible Token (NFT): Overview, Evaluation, Opportunities and Challenges. arXiv:2105.07447.Google Scholar
- X. Xu, A. Hannun, and L. Van Der Maaten. 2022. Data Appraisal Without Data Sharing. In Proc. of Machine Learning Research.Google Scholar
- H. Yu and M. Zhang. 2017. Data pricing strategy based on data quality. Computers and Industrial Engineering 112 (2017).Google Scholar
- D. Zhang, H. Wang, D. Xiaoou, Y. Zhang, J. Li, and H. Gao. 2018. On the Fairness of Quality-based Data Markets. (2018). arXiv:1808.01624.Google Scholar
- K. Zhao, H. Lu, and J. Mei. 2014. Locality Preserving Hashing. In Proc. of the AAAI Conference 28, 1 (2014).Google Scholar
- K. R. Özyilmaz, M. Do?gan, and A. Yurdakul. 2018. IDMoB: IoT Data Marketplace on Blockchain. In Crypto Valley Conference on Blockchain Technology (CVCBT).Google Scholar
Recommendations
Marketplaces for data: an initial survey
Data is becoming more and more of a commodity, so that it is not surprising that data has reached the status of tradable goods. An increasing number of data providers is recognizing this and is consequently setting up platforms for selling, buying, or ...
A classification framework for data marketplaces
Trading data as a commodity has become increasingly popular in recent years, and data marketplaces have emerged as a new business model where data from a variety of sources can be collected, aggregated, processed, enriched, bought, and sold. They are ...
Comments