skip to main content
research-article
Free Access

Managing scientific data

Authors Info & Claims
Published:01 June 2010Publication History
Skip Abstract Section

Abstract

Needed are generic, rather than one-off, DBMS solutions automating storage and analysis of data from scientific collaborations.

References

  1. Bruno, N. and Chaudhuri, S. Online autoadmin: Physical design tuning. In Proceedings of the ACM International Conference on Management of Data (Beijing, June 11--14). ACM Press, New York, 1067--1069. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. Buneman, P., Chapman, A., and Cheney, J. Provenance management in curated databases. In Proceedings of the ACM SIGMOD International Conference on Management of Data (Chicago, June 27--29). ACM Press, New York, 2006, 539--550. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. Buneman, P., Khanna, S., Tajima, K., and Tan, W. Archiving scientific data. In Proceedings of the ACM SIGMOD International Conference on Management of Data (Madison, WI, June 3--6). ACM Press, New York, 2002, 1--12. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. Chervenak, A.L., Schuler, R., Ripeanu, M., Amer, M.A., Bharathi, S., Foster, I., Iamnitchi, A., and Kesselman, C. The Globus Replica Location Service: Design and experience. IEEE Transactions on Parallel Distributed Systems 20, 9 (Sept. 2009), 1260--1272. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. Cohen, S., Hurley, P., Schulz, K.W., Barth, W.L., and Benton, B. Scientific formats for object-relational database systems: A study of suitability and performance. SIGMOD Records 35, 2 (June 2006), 10--15. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. Cudre-Mauroux, P., Kimura, H., Lim, K., Rogers, J., Simakov, R., Soroush, E., Velikhov, P., Wang, D.L., Balazinska, M., Becla, J., DeWitt, D., Heath, B., Maier, D., Madden, S., Patel, J., Stonebraker, M., and Zdonik, S. A demonstration of SciDB: A science-oriented DBMS. Proceedings of VLDB Endowment 2, 2 (Aug. 2009), 1534--1537. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. Davidson, S.B. and Freire, J. Provenance and scientific workflows: Challenges and opportunities. In Proceedings of the ACM SIGMOD International Conference on Management of Data (Vancouver, B.C., June 9--12). ACM Press, New York, 1345--1350. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. Gray, J. and Thomson, D. Supporting Finite-Element Analysis with a Relational Database Backend, Parts i--iii. MSR-TR-2005-49, MSR-TR-2006-21, MSR-TR-2005-151. Microsoft Research, Redmond, WA, 2005.Google ScholarGoogle Scholar
  9. Hey, T., Tansley, S., and Tolle, K. The Fourth Paradigm: Data-Intensive Scientific Discovery. Microsoft, Redmond, WA, Oct. 2009.Google ScholarGoogle Scholar
  10. Ilyas, I.F., Markl, V., Haas, P., Brown, P., and Aboulnaga, A. CORDS: Automatic discovery of correlations and soft functional dependencies. In Proceedings of the ACM SIGMOD International Conference on Management of Data (Paris, June 13--18). ACM Press, New York, 2004, 647--658. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. Kunszt, P.Z., Szalay, A.S., and Thakar, A.R. The hierarchical triangular mesh. In Proceedings of the MPA/ESO/MPE Workshop (Garching, Germany, July 31--Aug. 4). Springer, Berlin, 2000, 631--637.Google ScholarGoogle Scholar
  12. Liu, D.T., Franklin, M.J., Abdulla, G.M., Garlick, J., and Miller, M. Data-preservation in scientific workflow middleware. In Proceedings of the 18th International Conference on Scientific and Statistical Database Management (July 3--5). IEEE Computer Society, Washington, DC, 2006, 49--58. Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. Mishra, C. and Koudas, N. A lightweight online framework for query progress indicators. In Proceedings of the IEEE International Conference on Data Engineering (Istanbul, Apr. 15--20). IEEE Press, 1292--1296.Google ScholarGoogle Scholar
  14. Müller, A. and Sternberg, M. Structural annotation of the human genome. In Proceedings of the German Conference on Bioinformatics (Braunschweig, Germany, Oct. 7--10). German Research Center for Biotechnology, Braunschweig, 2001, 211--212.Google ScholarGoogle Scholar
  15. Ngu, A.H., Bowers, S., Haasch, N., McPhillips, T., and Critchlow, T. Flexible scientific workflow modeling using frames, templates, and dynamic embedding. In Proceedings of the 20th International Conference on Scientific and Statistical Database Management (Hong Kong, July 9--11). Springer-Verlag, Berlin, Heidelberg, 2008, 566--572. Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. Office of Data Management Challenge.Report from the DOE Office of Science Data Management Workshops, Mar.--May 2004; http://www.er.doe.gov/ascr/ProgramDocuments/Docs/Final-report-v26.pdfGoogle ScholarGoogle Scholar
  17. Olston, C., Reed, B., Srivastava, U., Kumar, R., and Tomkins, A. Pig Latin: A not-so-foreign language for data processing. In Proceedings of the ACM SIGMOD International Conference on Management of Data (Vancouver, B.C., June 9--12). ACM Press, New York, 2008, 1099--1110. Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. Papadomanolakis, S., Dash, D., and Ailamaki, A. Efficient use of the query optimizer for automated physical design. In Proceedings of the 33rd International Conference on Very Large Data Bases (Vienna, Austria, Sept. 23--27). VLDB Endowment, 2007, 1093--1104. Google ScholarGoogle ScholarDigital LibraryDigital Library
  19. Papadomanolakis, S., Ailamaki, A., Lopez, J.C., Tu, T., O'Hallaron, D.R., and Heber, G. Efficient query processing on unstructured tetrahedral meshes. In Proceedings of the ACM SIGMOD International Conference on Management of Data (Chicago, June 27--29). ACM Press, New York, 2006, 551--562. Google ScholarGoogle ScholarDigital LibraryDigital Library
  20. Pike, R., Dorward, S., Griesemer, R., and Quinlan, S. Interpreting the data: Parallel analysis with Sawzall. Scientific Programming 13, 4 (Oct. 2005), 277--298. Google ScholarGoogle ScholarDigital LibraryDigital Library
  21. Shahabi, C., Jahangiri, M., and Banaei-Kashani, F. ProDA: An end-to-end wavelet-based OLAP system for massive datasets. Computer 41, 4 (Apr. 2008), 69--77. Google ScholarGoogle ScholarDigital LibraryDigital Library
  22. Spaccapietra, S., Parent, C., Damiani, M.L., de Macedo, J. A., Porto, F., and Vangenot, C. A conceptual view on trajectories. Data Knowledge Engineering 65, 1 (Apr. 2008), 126--146. Google ScholarGoogle ScholarDigital LibraryDigital Library
  23. Stonebraker, M., Bear, C., Çetintemel, U., Cherniack, M., Ge, T., Hachem, N., Harizopoulos, S., Lifter, J., Rogers, J., and Zdonik, S.B. One size fits all? Part 2: Benchmarking studies. In Proceedings of the Conference on Innovative Data Systems Research (Asilomar, Jan. 7--10, 2007), 173--184.Google ScholarGoogle Scholar
  24. Szalay, A., Bell, G., VandenBerg, J., et al. GrayWulf: Scalable clustered architecture for data-intensive computing. In Proceedings of the Hawaii International Conference on System Sciences (Waikoloa, Jan. 5--8). IEEE Computer Society Press, 2009, 1--10. Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. Managing scientific data

                    Recommendations

                    Comments

                    Login options

                    Check if you have access through your login credentials or your institution to get full access on this article.

                    Sign in

                    Full Access

                    • Published in

                      cover image Communications of the ACM
                      Communications of the ACM  Volume 53, Issue 6
                      June 2010
                      148 pages
                      ISSN:0001-0782
                      EISSN:1557-7317
                      DOI:10.1145/1743546
                      Issue’s Table of Contents

                      Copyright © 2010 ACM

                      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

                      Publisher

                      Association for Computing Machinery

                      New York, NY, United States

                      Publication History

                      • Published: 1 June 2010

                      Permissions

                      Request permissions about this article.

                      Request Permissions

                      Check for updates

                      Qualifiers

                      • research-article
                      • Popular
                      • Refereed

                    PDF Format

                    View or Download as a PDF file.

                    PDF

                    eReader

                    View online with eReader.

                    eReader

                    HTML Format

                    View this article in HTML Format .

                    View HTML Format