Skip to main content
Top
Published in: Distributed and Parallel Databases 1/2018

21-09-2017

AUDIT: approving and tracking updates with dependencies in collaborative databases

Authors: Khaleel Mershad, Qutaibah M. Malluhi, Mourad Ouzzani, Mingjie Tang, Michael Gribskov, Walid G. Aref 

Published in: Distributed and Parallel Databases | Issue 1/2018

Log in

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

Collaborative databases such as genome databases, often involve extensive curation activities where collaborators need to interact to be able to converge and agree on the content of data. In a typical scenario, a member of the collaboration makes some updates and these become visible to all collaborators for possible comments and modifications. At the same time, these updates are usually pending the approval or rejection from the data custodian based on the related discussion and the content of the data. Unfortunately, the approval and authorization of updates in current databases is based solely on the identity of the user, e.g., via the SQL GRANT and REVOKE commands. In this paper, we present a scalable cloud-based collaborative database system to support collaboration and data curation scenarios. Our system is based on an Update Pending Approval model. In a nutshell, when a collaborator updates a given data item, it is marked as pending approval until the data custodian approves or rejects the update. Until then, any other collaborator can view and comment on the data, pending its approval. We fully realized our system inside HBase, a cloud-based platform. We also conducted extensive experiments showing that the system scales well under different workloads.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literature
3.
go back to reference Griffiths, P.P., Wade, B.W.: An authorization mechanism for a relational database system. ACM TODS 1(3), 242–255 (1976)CrossRef Griffiths, P.P., Wade, B.W.: An authorization mechanism for a relational database system. ACM TODS 1(3), 242–255 (1976)CrossRef
5.
go back to reference Mershad, K., Malluhi, Q., Quzzani, M., Tang, M., Aref, A.: Approving updates in collaborative databases. In: Proceedings of the 3rd IEEE International Conference on Cloud Engineering. IC2E 15 (2015) Mershad, K., Malluhi, Q., Quzzani, M., Tang, M., Aref, A.: Approving updates in collaborative databases. In: Proceedings of the 3rd IEEE International Conference on Cloud Engineering. IC2E 15 (2015)
6.
go back to reference Dayal, U., Hsu, M., Ladin, R.: Organizing long-running activities with triggers and transactions. SIGMOD Rec. 19(2), 204–214 (1990)CrossRef Dayal, U., Hsu, M., Ladin, R.: Organizing long-running activities with triggers and transactions. SIGMOD Rec. 19(2), 204–214 (1990)CrossRef
7.
8.
go back to reference Aiken, A., Hellerstein, J., Widom, J.: Behavior of database production rules: termination, confluence, and observable determinism. In: SIGMOD (1992) Aiken, A., Hellerstein, J., Widom, J.: Behavior of database production rules: termination, confluence, and observable determinism. In: SIGMOD (1992)
9.
go back to reference Paton, N.W., Daz, O.: Active database systems. ACM Comput. Surv. 31(1), 63–103 (1999)CrossRef Paton, N.W., Daz, O.: Active database systems. ACM Comput. Surv. 31(1), 63–103 (1999)CrossRef
10.
go back to reference Lomet, D., Barga, R., Mokbel, M., Shegalov, G.: Transaction time support inside a database engine. In: Proceedings of the 22nd International Conference on Data Engineering (ICDE 06) (2006) Lomet, D., Barga, R., Mokbel, M., Shegalov, G.: Transaction time support inside a database engine. In: Proceedings of the 22nd International Conference on Data Engineering (ICDE 06) (2006)
12.
go back to reference Shankar, S., Kini, A., DeWitt, D.J., Naughton, J.: Integrating databases and workflow systems. SIGMOD Rec. 34(3), 5–11 (2005)CrossRef Shankar, S., Kini, A., DeWitt, D.J., Naughton, J.: Integrating databases and workflow systems. SIGMOD Rec. 34(3), 5–11 (2005)CrossRef
15.
go back to reference Bhardwaj, A., Deshpande, A., Elmore, A.J., Karger, D., Madden, S., Parameswaran, A., Subramanyam, H., Wu, E., Zhang, R.: Collaborative data analytics with DataHub. Proc. VLDB Endow. 8(12), 1916–1919 (2015)CrossRef Bhardwaj, A., Deshpande, A., Elmore, A.J., Karger, D., Madden, S., Parameswaran, A., Subramanyam, H., Wu, E., Zhang, R.: Collaborative data analytics with DataHub. Proc. VLDB Endow. 8(12), 1916–1919 (2015)CrossRef
16.
go back to reference Xu, L., Huang, S., Hui, S., Elmore, A.J., Parameswaran, A.: ORPHEUSDB: a lightweight approach to relational dataset versioning. In: Proceedings of the ACM International Conference on Management of Data, ACM 2017, pp. 1655–1658 (2017) Xu, L., Huang, S., Hui, S., Elmore, A.J., Parameswaran, A.: ORPHEUSDB: a lightweight approach to relational dataset versioning. In: Proceedings of the ACM International Conference on Management of Data, ACM 2017, pp. 1655–1658 (2017)
17.
go back to reference Bhagwat, D., Chiticariu, L., Tan, W.C., Vijayvargiya, G.: An annotation management system for relational databases. VLDB J. 14(4), 373–396 (2005)CrossRef Bhagwat, D., Chiticariu, L., Tan, W.C., Vijayvargiya, G.: An annotation management system for relational databases. VLDB J. 14(4), 373–396 (2005)CrossRef
18.
go back to reference Buneman, P., Chapman, A., Cheney, J.: Provenance man-agement in curated databases. In: SIGMOD 06. ACM, pp. 539–550 (2006) Buneman, P., Chapman, A., Cheney, J.: Provenance man-agement in curated databases. In: SIGMOD 06. ACM, pp. 539–550 (2006)
19.
go back to reference Buneman, P., Khanna, S., Wang-Chiew, T.: Why and where: a characterization of data provenance. In: Database Theory ICDT 2001, Ser. Lecture Notes in Computer Science, vol. 1973, pp. 316–330. Springer, Heidelberg (2001) Buneman, P., Khanna, S., Wang-Chiew, T.: Why and where: a characterization of data provenance. In: Database Theory ICDT 2001, Ser. Lecture Notes in Computer Science, vol. 1973, pp. 316–330. Springer, Heidelberg (2001)
20.
go back to reference Davidson, S.B., Freire, J.: Provenance and scientific workflows: Challenges and opportunities. In: SIGMOD 08. ACM, pp. 1345–1350 (2008) Davidson, S.B., Freire, J.: Provenance and scientific workflows: Challenges and opportunities. In: SIGMOD 08. ACM, pp. 1345–1350 (2008)
21.
go back to reference Amsterdamer, Y., Davidson, S.B., Deutch, D., Milo, T., Stoyanovich, J., Tannen, V.: Putting lipstick on pig: enabling database-style workflow provenance. Proc. VLDB Endow. 5(4), 346–357 (2011)CrossRef Amsterdamer, Y., Davidson, S.B., Deutch, D., Milo, T., Stoyanovich, J., Tannen, V.: Putting lipstick on pig: enabling database-style workflow provenance. Proc. VLDB Endow. 5(4), 346–357 (2011)CrossRef
24.
go back to reference Eltabakh, M.Y., Ouzzani, M., Aref, W.G.: DBMS—a database management system for biological data. CIDR 2007, 196–206 (2007) Eltabakh, M.Y., Ouzzani, M., Aref, W.G.: DBMS—a database management system for biological data. CIDR 2007, 196–206 (2007)
25.
go back to reference Eltabakh, M., Aref, W.G., Elmagarmid, A., Ouzzani, M.: Handson db: managing data dependencies involving human actions. In: IEEE TKDE, no. PrePrints, p. 1 (2013) Eltabakh, M., Aref, W.G., Elmagarmid, A., Ouzzani, M.: Handson db: managing data dependencies involving human actions. In: IEEE TKDE, no. PrePrints, p. 1 (2013)
29.
go back to reference Bernstein, P.A., Goodman, N.: Concurrency control in distributed database systems. ACM Comput. Surv. 13(2), 185–221 (1981)MathSciNetCrossRef Bernstein, P.A., Goodman, N.: Concurrency control in distributed database systems. ACM Comput. Surv. 13(2), 185–221 (1981)MathSciNetCrossRef
34.
go back to reference Benson, D.A., Cavanaugh, M., Clark, K., Karsch-Mizrachi, I., Lipman, D.J., Ostell, J., Sayers, E.W.: GenBank. Nucleic Acids Res. 41(D1), D36–D42 (2012)CrossRef Benson, D.A., Cavanaugh, M., Clark, K., Karsch-Mizrachi, I., Lipman, D.J., Ostell, J., Sayers, E.W.: GenBank. Nucleic Acids Res. 41(D1), D36–D42 (2012)CrossRef
Metadata
Title
AUDIT: approving and tracking updates with dependencies in collaborative databases
Authors
Khaleel Mershad
Qutaibah M. Malluhi
Mourad Ouzzani
Mingjie Tang
Michael Gribskov
Walid G. Aref 
Publication date
21-09-2017
Publisher
Springer US
Published in
Distributed and Parallel Databases / Issue 1/2018
Print ISSN: 0926-8782
Electronic ISSN: 1573-7578
DOI
https://doi.org/10.1007/s10619-017-7208-y

Other articles of this Issue 1/2018

Distributed and Parallel Databases 1/2018 Go to the issue

Premium Partner