Arithmetic coding for data compression

Authors:
Ian H. Witten

Dept. of Computer Science, The University of Calgary, 2500 University Drive NW, Calgary, Canada T2N 1N4

Dept. of Computer Science, The University of Calgary, 2500 University Drive NW, Calgary, Canada T2N 1N4
View Profile

,
Radford M. Neal

Dept. of Computer Science, The University of Calgary, 2500 University Drive NW, Calgary, Canada T2N 1N4

Dept. of Computer Science, The University of Calgary, 2500 University Drive NW, Calgary, Canada T2N 1N4
View Profile

,
John G. Cleary

Dept. of Computer Science, The University of Calgary, 2500 University Drive NW, Calgary, Canada T2N 1N4

Dept. of Computer Science, The University of Calgary, 2500 University Drive NW, Calgary, Canada T2N 1N4
View Profile

Authors Info & Claims

Communications of the ACM Volume 30 Issue 6June 1987pp 520–540https://doi.org/10.1145/214762.214771

Published:01 June 1987Publication History

Communications of the ACM

Abstract

The state of the art in data compression is arithmetic coding, not the better-known Huffman method. Arithmetic coding gives greater compression, is faster for adaptive models, and clearly separates the model from the channel encoding.

References

1 Abramson, N. Information Theory and Coding. McGraw-Hill, New York, 1963. This textbook contains the first reference to what was to become the method of arithmetic coding (pp. 61-62).Google Scholar
2 Bentley. J.L., Sleator, D.D., Tarjan, R.E., and Wei, V.K. A locally adaptive data compression scheme. Commun. ACM 29, 4 (Apr. 1986), 320-330. Shows how recency effects can be incorporated explicitly into a text compression system. Google ScholarDigital Library
3 Cleary, J.C., and Witten, I.H. A comparison of enumerative and adaptive codes. IEEE Trans. Inf. Theory IT-30,2 (Mar. 1984). 306-315. Demonstrates under quite general conditions that adaptive coding outperforms the method of calculating and transmitting an exact model of the message first.Google ScholarCross Ref
4 Clearv. I.G. and Witten. I.H. Data comoression using adaptive coding and'partial string matching. IEEE T>ans. Commu~ C&-32,4 (Apr. 1984). 396-402. Presents an adaptive modeling method that reduces a large sample of mixed-case English text to around 2.2 bits/character when arithmetically coded.Google Scholar
5 Cormack, G.V., and Horspool, R.N. Algorithms for adaptive Huffman codes. Ir$ Process. Lett. 18, 3 (Mar. 19841, 159-166. Describes how adaptive Huffman coding can be implemented efficiently. Google ScholarDigital Library
6 Cormack, G.V., and Horspool, R.N. Data compression using dynamic Markov modeling. Res. Rep., Computer Science Dept., Univ. of Walerloo, Ontario, Apr. 1985. Also to be published in Cornput. 1. Presents an adaptive state-modeling technique that, in conjunction with arithmetic coding, produces results competitive with those of {4}Google Scholar
7 Gallager, R.G. Variations on a theme by Huffman. IEEE Trans. tnf Theory IT-24, 6 (Nov. 19781, 668-674. Presents an adaptive Huffman coding algorithm, and derives new bounds on the redundancy of Huffman codes.Google Scholar
8 Held, G. Data Compression: Techniques and Applications. Wiley, New York, 1984. Explains a number of ad hoc techniques for compressing text. Google ScholarDigital Library
9 Hester, J.H., and Hirschberg, D.S. Self-organizing linear search. ACM Comput. SKI-V. 17, 3 (Sept. 1985), 295-311. A general analysis of the technique used in the present article to expedite access to an array of dynamically changing frequency counts. Google ScholarDigital Library
10 Huffman, D.A. A method for the construction of minimumredundancy codes. Proc. Inst. Electr. Radio Eng. 40, 9 (Sept. 1952), 1098-1101. The classic paper in which Huffman introduced his famous coding method.Google Scholar
11 Hunter, R., and Robinson, A.H. International digital facsimile coding standards. Proc. Inst. Electr. Electron. Eng. 68, 7 (July 1980), 854-867. Describes the use of Huffman coding to compress run lengths in black/white images.Google ScholarCross Ref
12 Kucera, H., and Francis, W.N. Compufational Analysis of Present-Day American English. Brown University Press, Providence, R.I. 1967. This large corpus of English is often used for computer experiments with text, including some of the evaluation reported in the present article.Google Scholar
13 Langdon, G.G. An introduction to arithmetic coding. IBMI. Res. Dev. 28, 2 (Mar. 1984), 135-149. Introduction to arithmetic coding from the point of view of hardware implementation.Google ScholarDigital Library
14 Langdon, G.G., and Rissanen, J. Compression of black-white images with arithmetic coding. 1EEE Trans. Commun. COM 29,6 (June 1981), 858.-867. Uses a modeling method specially tailored to black/white pictures, in conjunction with arithmetic coding, to achieve excellent compression results.Google Scholar
15 Pasco, R. Source coding algorithms for fast data compression. Ph.D. thesis, Dept. of Electrical Engineering, Stanford Univ., Stanford, Calif., 1976. An early exposition of the idea of arithmetic coding, but lacking the idea of incremental operation. Google ScholarDigital Library
16 Rissanen, J.J. Generalized Kraft inequality and arithmmatic coding. IBM 1. Res. Dev. 20 (May 1976), 198-203. Another early exposition of the idea of arithmetic coding.Google ScholarDigital Library
17 Rissanen. J.J. Arithmetic codings as number representations. Acta Polytech. Stand. Math. 31 (Dec. 1979), 44-51. Further develops arithmetic coding as a practical technique for data representation.Google Scholar
18 Rissanen, J., and Langdon, G.G. Arithmetic coding. IBM J. Res. Dev. 23, 2 (Mar. 1979). 149-162. Describes a broad class of arithmetic codes.Google ScholarDigital Library
19 Rissanen, J. and Langdon, G.G. Universal modeling and coding. IEEE Trans. Znf Theory IT-27, 1 (Jan. 1981), 12-23. Shows how data cornpression can be separated into modeling for prediction and coding with respect to a model.Google ScholarDigital Library
20 Rubin, F. Arithmetic stream coding using fixed precision registers. IEEE Trans. If. Theory IT-25, 6 (Nov. 1979), 672-675. One of the first papers to present all the essential elements of practical arithmetic coding, including fixed-point computation and incremental operation.Google ScholarDigital Library
21 Shannon, C.E. and Weaver, W. The Mathematical Theory of Communication. University of Illinois Press, Urbana, Ill., 1949. A classic book that develops communication theory from the ground up. Google ScholarDigital Library
22 Welch, T.A. A technique for high-performance data compression. Computer 17, 6 (June 1984), 8-19. A very fast coding technique based on the method of {23}, but whose compression performance is poor by the standards of {4} and {6} An improved implementation of this method is widely used in UNIX systems under the name compress.Google ScholarDigital Library
23 Ziv, J., and Lempel, A. Compression of individual sequences via variable-rate coding. IEEE Trans. InJ Theory IT-24, 5 (Sept. 1978), 530-536. Describes a method of text compression that works by replacing a substring with a pointer to an earlier occurrence of the same substring. Although it performs quite well, it does not provide a clear separation between modeling and coding.Google ScholarCross Ref

Index Terms

Arithmetic coding for data compression
1. Information systems
  1. Data management systems
    1. Data structures
      1. Data layout
        Data compression

Recommendations

Block arithmetic coding for source compression

We introduce “Block Arithmetic Coding” (BAC), a technique for entropy coding that combines many of the advantages of ordinary stream arithmetic coding with the simplicity of block codes. The code is variable length in to fixed out (V to F), ...
Read More
Connectivity compression for non-triangular meshes by context-based arithmetic coding
GRAPHITE '06: Proceedings of the 4th international conference on Computer graphics and interactive techniques in Australasia and Southeast Asia

In this article, we present an efficient algorithm for encoding the connectivity information of general polygon meshes. The algorithm is a single-resolution lossless compression method for meshes, mainly for non-triangular meshes. In comparison with the ...
Read More
Reversible arithmetic coding for quantum data compression

We study the problem of compressing a block of symbols (a block quantum state) emitted by a memoryless quantum Bernoulli source. We present a simple-to-implement quantum algorithm for projecting, with high probability, the block quantum state onto the ...
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Article

Published in
Communications of the ACM Volume 30, Issue 6
June 1987
92 pages
ISSN:0001-0782
EISSN:1557-7317
DOI:10.1145/214762
Editor:
Peter J. Denning
Issue’s Table of Contents
Copyright © 1987 ACM
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]
Sponsors
In-Cooperation
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
- Published: 1 June 1987
Permissions
Request permissions about this article.
Request Permissions

Check for updates
Qualifiers
- article
Conference
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 1,994
  Total Citations
  View Citations
- 11,762
  Total Downloads
- Downloads (Last 12 months)1,769
- Downloads (Last 6 weeks)528
Other Metrics
View Author Metrics
Cited By
View all

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Arithmetic coding for data compression

Communications of the ACM

Abstract

References

Cited By

Index Terms

Recommendations

Block arithmetic coding for source compression

Connectivity compression for non-triangular meshes by context-based arithmetic coding

Reversible arithmetic coding for quantum data compression