skip to main content
10.1145/253260.253322acmconferencesArticle/Chapter ViewAbstractPublication PagesmodConference Proceedingsconference-collections
Article
Free Access

High-performance sorting on networks of workstations

Published:01 June 1997Publication History

ABSTRACT

We report the performance of NOW-Sort, a collection of sorting implementations on a Network of Workstations (NOW). We find that parallel sorting on a NOW is competitive to sorting on the large-scale SMPs that have traditionally held the performance records. On a 64-node cluster, we sort 6.0 GB in just under one minute, while a 32-node cluster finishes the Datamation benchmark in 2.41 seconds.

Our implementations can be applied to a variety of disk, memory, and processor configurations; we highlight salient issues for tuning each component of the system. We evaluate the use of commodity operating systems and hardware for parallel sorting. We find existing OS primitives for memory management and file access adequate. Due to aggregate communication and disk bandwidth requirements, the bottleneck of our system is the workstation I/O bus.

References

  1. 1.R. C. Agarwal. A Super Scalar Sort Algorithm for RISC Processors. In Proceedings of the 1996 ACM SIGMOD Conference, pages 240-246, June 1996. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. 2.T. E. Anderson, D. E. Culler, and D. A. Patterson. A Case for NOW (Networks of Workstations). IEEE Micro, Feb. 1994. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. 3.R. H. Arpaci, A. C. Dusseau, A. M. Vahdat, L. T. Liu, T. E. Anderson, and D. A. Patterson. The Interaction of Parallel and Sequential Workloads on a Network of Workstations. In Proceedings qf SIGMETRICS/Performance "95, pages 267-78, 1995. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. 4.C. Baru, G. Fecteau, A. Goyal, H. Hsiao, A. Jhnigran, S. Padmanabhan, and W. Wilson. An Overview of DB2 Parallel Edition. in Proceedings of 1995 SIGMOD International Conference on Management of Data, San Jose, CA, May 1995. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. 5.A. Basu, V. Buch, W. Vogels, and T. yon Eicken. U-Net: A User-Level Network Interface for Parallel and Distributed Computing. In Proceedings of the l 5th ACM Symposium on Operating Systen~v Principles, Copper Mountain, Colorado, Dec. 1995. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. 6.B. Baugsto, J. Greipsland, and J. Kamerbeek. Sorting Large Data Files on POMA. In Proceedings ofCOMPAR-90 VAPPIV, pages 536-547, Sept. 1990. Springer Vedag Lecture Notes No. 357. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. 7.M. Beck, D. Bitton, and W. K. Wilkinson. Sorting Large Files on a Backend Multiprocessor. Technical Report 86-741, Department of Computer Science, Comell University, Mar. 1986. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. 8.G. Blelloch, C. Leiserson, and B. Maggs. A Comparison of Sorting Algorithms for the Connection Machine CM-2. In Symposium on Parallel Algorithms and Architectures, July 199 I. Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. 9.M. A. Blumrich, K. Li, R. Alpert, C. Dubnicki, E. W. Felten, and J. Sandberg. Virtual Memory Mapped Network Interface for the SHRIMP Multicomputer. In Proceedings of the International Symposium on Computer Architecture, pages 142-153, Apr. 1994. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. 10.N. Boden, D. Cohen, R. E. Felderman, A. Kulawik, and C. Seitz. Myrinet: A Gigabit-per-second Local Area Network. IEEE Micro, Feb. 1995. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. 11.H. Boral, W. Alexander, L. Clay, G. Copeland, et al. Prototyping Bubba, a Highly Parallel Database System. IEEE Transactions on Knowledge and Data Engineering, 2(1):4--24, Mar. 1990. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. 12.D. Culler, A. Dusseau, S. Goldstein, A. Krishnamurthy, S. Lumetta, T. von Eicken, and K. Yelick. Parallel Programming in Split-C. In Supercomputing '93, 1993. Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. 13.D. Culler, L. T. Liu, R. Martin, and C. Yoshikawa. LogP Performance Assessment of Fast Network Interfaces. IEEE Micro, Feb. 1996. Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. 14.D. DeWitt, S. Ghandeharizadeh, D. Schneider, A. Bdcker, et al. The Gamma Database Machine Project. IEEE Transactions on Knowledge and Data Engineering, 2( l ):44-62, Mar. 1990. Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. 15.D. Dewitt, J. Naughton, and D. Schneider. Parallel Sorting on a Shared-Nothing Architecture using Probabilistic Splitting. In Proceedings of the International Conference on Parallel and Distributed Information Systmes, 1991. Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. 16.A.C. Dusseau, D.E. Culler, K. E. Schauser, and R. P. Martin. Fast Parallel Sorting Under LogP: Experience with the CM-5. IEEE Transactions on Parallel and Distributed Systems, 7(8):791-805, Aug. 1996. Google ScholarGoogle ScholarDigital LibraryDigital Library
  17. 17.A. et. al. A Measure of Transaction Processing Power. Datamation, 31 (7): 112-118, 1985. Also in Readings in Database Systems, M.H. Stonebraker ed., Morgan Kaufmann, San Mateo, 1989. Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. 18.B. Gerber. Informix Online XPS. In Proceedings of 1995 SIG- MOD International Conference on Management o.f Data, San Jose, CA, May 1995. Google ScholarGoogle ScholarDigital LibraryDigital Library
  19. 19.D. P. Ghormley, D. Petrou, A. M. Vahdat, and T. E. Anderson. GLUnix: A Global Layer Unix for NOW. http://now.cs.berkeley.edu/Glunix/glunix.html.Google ScholarGoogle Scholar
  20. 20.G. Graefe. Volcano: An Extensible and Parallel Dataflow Query Processing System. Technical report, Oregon Graudate Center, June 1989.Google ScholarGoogle Scholar
  21. 21.G. Graefe. Parallel External Sorting in Volcano. Technical Report CU-CS-459, Computer Science, University of Colorado at Boulder, June 1990.Google ScholarGoogle Scholar
  22. 22.M. D. Hill, J. R. Larus, S. Reinhardt, and D. A. Wood. Cooperative-Shared Memory: Software and Hardware for Scalable Multiprocessors. ACM Transactions on Computer Systems, 11(4):300-318, 1993. Google ScholarGoogle ScholarDigital LibraryDigital Library
  23. 23.C. A. R. Hoare. Quicksort. Computer Journal, 5( 1): 10-15,1962.Google ScholarGoogle Scholar
  24. 24.S. Kleiman, J. Voll, J. Eykholt, A. Shivalingiah, D. Williams, M. Smith, S. Barton, and G. Skinner. Symmetric Multiprocessing in Solaris 2.0. In Proceedings of COMPCON Spring '92, 1992. Google ScholarGoogle ScholarDigital LibraryDigital Library
  25. 25.X. Li, G. Linoff, S. Smith, C. Stanfill, and K. Thearling. A Practical External Sort for Shared Disk MPPs. In Proceedings of SUPERCOMPUTING "93, pages 666-675, Nov. 1993. Google ScholarGoogle ScholarDigital LibraryDigital Library
  26. 26.C. Nyberg, T. Barclay, Z. Cvetanovic, J. Gray, and D. Lomet. AlphaSort: A RISC Machine Sort. In Proceedings of 1994 ACM SIGMOD Conference, May 1994. Google ScholarGoogle ScholarDigital LibraryDigital Library
  27. 27.B. Salzberg, A. Tsukerman, J. Gray, M. Stewart, S. Uren, and B. Vaughna. FastSort; A Distributed Single-Input Single-Output External Sort. SIGMOD Record, 19(2):94-I01,June 1990. Google ScholarGoogle ScholarDigital LibraryDigital Library
  28. 28.M. Stonebraker. Operating System Support for Database Management. Communications of the ACM, 24(7):412-418, July 1981. Google ScholarGoogle ScholarDigital LibraryDigital Library
  29. 29.M. Stonebraker. The Case for Shared Nothing. Database Engineering, 9(1 ), 1986.Google ScholarGoogle Scholar
  30. 30.A. Sweeney, D. Doucette, W. Hu, C. Anderson, M. Nishimoto, and G. Peck. Scalability in the XFS File System. in Proceedings of the USENIX 1996 Annual Technical Conference, Jan. 1996. Google ScholarGoogle ScholarDigital LibraryDigital Library
  31. 31.Tandem Performance Group. A Benchmark of NonStop SQL on Debit-Credit Transactions. In Proceedings of SIGMOD International Conference on Managament of Data, Chicago, IL, June 1988. Google ScholarGoogle ScholarDigital LibraryDigital Library
  32. 32.Teradata Corporation. DBC/IO12 Data Base Computer System Manual, release 2.0 edition, Nov. 1985. Document Number c 10- 0001-02.Google ScholarGoogle Scholar
  33. 33.T. yon Eicken, D. E. Culler, S. C. Goldstein, and K. E. Schauser. Active Messages: a Mechanism for Integrated Communication and Computation. In Proceedings of the 19th Annual Symposium on Computer Architecture, Gold Coast, Australia, May 1992. Google ScholarGoogle ScholarDigital LibraryDigital Library
  34. 34.H. Young and A. Swami. The Parameterized Round-Robin Partitioned Algorithm for Parallel External Sort. In Proceedings 9th international Parallel Processing Symposium, pages 213-219, Santa Barbara, CA, Apr. 1995. Google ScholarGoogle ScholarDigital LibraryDigital Library
  35. 35.M. Zagha and G. Blelloch. Radix Sort for Vector Multiprocessors. In Supercomputing, 1991. Google ScholarGoogle ScholarDigital LibraryDigital Library
  36. 36.W. Zhang and P. Larson. A Memory-AdaptiveSort (MASORT) for Database Systems. In Proceedings of CASCON "96, Toronto, Nov. 1996. Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. High-performance sorting on networks of workstations

            Recommendations

            Comments

            Login options

            Check if you have access through your login credentials or your institution to get full access on this article.

            Sign in
            • Published in

              cover image ACM Conferences
              SIGMOD '97: Proceedings of the 1997 ACM SIGMOD international conference on Management of data
              June 1997
              594 pages
              ISBN:0897919114
              DOI:10.1145/253260

              Copyright © 1997 ACM

              Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

              Publisher

              Association for Computing Machinery

              New York, NY, United States

              Publication History

              • Published: 1 June 1997

              Permissions

              Request permissions about this article.

              Request Permissions

              Check for updates

              Qualifiers

              • Article

              Acceptance Rates

              SIGMOD '97 Paper Acceptance Rate42of202submissions,21%Overall Acceptance Rate785of4,003submissions,20%

            PDF Format

            View or Download as a PDF file.

            PDF

            eReader

            View online with eReader.

            eReader