skip to main content
article
Free Access

An algorithm for multidimensional data clustering

Published:01 June 1988Publication History
Skip Abstract Section

Abstract

A new divisive algorithm for multidimensional data clustering is suggested. Based on the minimization of the sum-of-squared-errors, the proposed method produces much smaller quantization errors than the median-cut and mean-split algorithms. It is also observed that the solutions obtained from our algorithm are close to the local optimal ones derived by the k-means iterative procedure.

References

  1. 1 BENTLEY, J. L. Multidimensional divide-and-conquer. Commun. ACM 23, 4 (Apr. 1980), 214-229. Google ScholarGoogle Scholar
  2. 2 BENTLEY, J. L., AND FRIEMAN, J.H. Data structure for range searching. A CM Comput. Surv. 11, 4 (Dec. 1979), 397-409. Google ScholarGoogle Scholar
  3. 3 DUDA, R. O., AND HART, P.E. Pattern Classification and Scene Analysis. Wiley, New York, 1973.Google ScholarGoogle Scholar
  4. 4 FRIEMAN, J. H., BENTLEY, J. L., AND FINKEL, R.A. An algorithm for finding best matches in logarithmic expected time. ACM Trans. Math. So{tw. 3, 3 (Sept. 1977), 209-226. Google ScholarGoogle Scholar
  5. 5 HALL, E.L. Computer Image Processing and Recognition. Academic Press, New York, 1979.Google ScholarGoogle Scholar
  6. 6 HARTIGAN, J.A. Clustering Algorithms. Wiley, New York, 1975. Google ScholarGoogle Scholar
  7. 7 HECKBERT, P. Color image quantization for frame buffer display. ACM Trans. Comput. Gr. 16, 3 (July 1982), 297-307. Google ScholarGoogle Scholar
  8. 8 HYAFIL, L., AND RIVEST, R.L. Construction optimal binary decision trees is NP-complete. In./. Process. Lett. 5 (May 1976), 15-17.Google ScholarGoogle Scholar
  9. 9 MACQUEEN, J.B. Some methods for classification and analysis of multivariate observations. In Proceedings of the 5th Berkley Symposium on Mathematical Statistics and Probability I (1967), 281-297,Google ScholarGoogle Scholar
  10. 10 SELIM, S, Z., AND ISMAIL, M.A. K-means-type algorithms: A generalized convergence theorem and characterization of local optimality. IEEE Trans. Pattern Anal. Mach. Intell. PAMI-6, 1 (1984), 81-87.Google ScholarGoogle Scholar
  11. 11 WAN, S. J., WONG, S, K. M., AND PRUSINKIEWICZ, P. Variance-based color image quantization for frame buffer display. Submitted for publication.Google ScholarGoogle Scholar
  12. 12 WON(~, S. K. M., WAN, S. J., AND PRUSINKIEWICZ, P. Monochrome image quantization. Submitted for publication.Google ScholarGoogle Scholar
  13. 13 Wu, X., AND WITTEN, }. H. A fast k-means type clustering algorithm. Dept. Computer Science, Univ. of Calgary, Canada, May 1985.Google ScholarGoogle Scholar

Index Terms

  1. An algorithm for multidimensional data clustering

    Recommendations

    Reviews

    Florian Petrescu

    The authors introduce a new and promising multivariate data clustering algorithm. They adopt a divisive strategy, that is, a procedure that partitions the input data space sequentially into a number of disjoint subregions. After reviewing several well-known clustering techniques, namely the median-cut, mean-split, and k-means algorithms, they present their method. The clustering algorithm has to make two important decisions at each step while partitioning the input data space: first, which hyperbox should be partitioned and, second, which hyperplane is appropriate to subdivide the hyperbox. Both decisions are based on minimizing the sum-of-squared-errors. Finally, the paper compares the performance of the algorithm and the above-mentioned clustering techniques on three collections of data of different dimensions from a color image database. The new clustering algorithm seems to perform better than the previously known methods.

    Access critical reviews of Computing literature here

    Become a reviewer for Computing Reviews.

    Comments

    Login options

    Check if you have access through your login credentials or your institution to get full access on this article.

    Sign in

    Full Access

    • Published in

      cover image ACM Transactions on Mathematical Software
      ACM Transactions on Mathematical Software  Volume 14, Issue 2
      June 1988
      80 pages
      ISSN:0098-3500
      EISSN:1557-7295
      DOI:10.1145/45054
      Issue’s Table of Contents

      Copyright © 1988 ACM

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      • Published: 1 June 1988
      Published in toms Volume 14, Issue 2

      Permissions

      Request permissions about this article.

      Request Permissions

      Check for updates

      Qualifiers

      • article

    PDF Format

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader