2.5 Practical entropy coding techniques

Image coding Page 1 / 2

This module introduces practical entropy coding techniques, such as Huffman Coding, Run-length Coding (RLC) and Arithmetic Coding.

In the module of Use of Laplacian PDFs in Image Compression we have assumed that ideal entropy coding has been used in order to calculate the bitrates for the coded data. In practise we must use real codes and we shall now see how this affects the compression performance.

There are three main techniques for achieving entropy coding:

Huffman Coding - one of the simplest variable length coding schemes.
Run-length Coding (RLC) - very useful for binary data containing long runs of ones of zeros.
Arithmetic Coding - a relatively new variable length coding scheme that can combine the best features ofHuffman and run-length coding, and also adapt to data with non-stationary statistics.

We shall concentrate on the Huffman and RLC methods for simplicity. Interested readers may find out more aboutArithmetic Coding in chapters 12 and 13 of the JPEG Book.

First we consider the change in compression performance if simple Huffman Coding is used to code the subimages of the4-level Haar transform.

The calculation of entropy in this equation from our discussion of entropy assumed that each message with probability $p_{i}$ could be represented by a word of length $_{i} 2 logbase --> p_{i}$ bits. Huffman codes require the $_{i}$ to be integers and assume that the $p_{i}$ are adjusted to become:

\hat{p_{i}} 2 _{i}

where the

_{i}

are integers, chosen subject to the constraint that

i \hat{p_{i}} 1

(to guarantee that sufficient uniquely decodable code words are available) and such that the mean Huffman word length(Huffman entropy),

\hat{H} i p_{i} _{i}

, is minimised.

We can use the probability histograms which generated the entropy plots in figures of level 1 energies , level 2 energies , level 3 energies and level 4 energies to calculate the Huffman entropies $\hat{H}$ for each subimage and compare these with the true entropies to see the loss in performance caused by using realHuffman codes.

An algorithm for finding the optimum codesizes $_{i}$ is recommended in the JPEG specification [ the JPEG Book , Appendix A, Annex K.2, fig K.1]; and a Mathlab M-file toimplement it is given in M-file code .

Comparison of entropies (columns 1, 3, 5) and Huffman coded bit rates (columns 2, 4, 6) for the original (columns 1 and2) and transformed (columns 3 to 6) Lenna images. In columns 5 and 6, the zero amplitude state is run-length encoded toproduce many states with probabilities<0.5.

Numerical results used in the figure - entropies and bit rates of subimages for qstep=15
Column:	1	2	3	4	5	6	-
			0.0264	0.0265	0.0264	0.0266
			0.0220	0.0222	0.0221	0.0221	Level 4
			0.0186	0.0187	0.0185	0.0186
			0.0171	0.0172	0.0171	0.0173	-
			0.0706	0.0713	0.0701	0.0705
			0.0556	0.0561	0.0557	0.0560	Level 3
	3.7106	3.7676	0.0476	0.0482	0.0466	0.0471	-
			0.1872	0.1897	0.1785	0.1796
			0.1389	0.1413	0.1340	0.1353	Level 2
			0.1096	0.1170	0.1038	0.1048	-
			0.4269	0.4566	0.3739	0.3762
			0.2886	0.3634	0.2691	0.2702	Level 1
			0.2012	0.3143	0.1819	0.1828	-
Totals:	3.7106	3.7676	1.6103	1.8425	1.4977	1.5071

shows the results of applying this algorithm to the probability histograms and lists the same results numerically for ease of analysis. Columns 1 and 2 compare theideal entropy with the mean word length or bit rate from using a Huffman code (the Huffman entropy) for the case of theuntransformed image where the original pels are quantized with $Q_{step} 15$ . We see that the increase in bit rate from using the real code is: $3.7676 3.7106 1 1.5 %$ But when we do the same for the 4-level transformed subimages, we get columns 3 and 4. Here we see thatreal Huffman codes require an increase in bit rate of: $1.8425 1.6103 1 14.4 %$ Comparing the results for each subimage in columns 3 and 4, wesee that most of the increase in bit rate arises in the three level-1 subimages at the bottom of the columns. This is becauseeach of the probability histograms for these subimages (see figure ) contain one probability that is greater than 0.5. Huffman codes cannot allocate a word length ofless than 1 bit to a given event, and so they start to lose efficiency rapidly when $2 logbase --> p_{i}$ becomes less than 1, ie when $p_{i} 0.5$ .

<< Chapter < Page Page > Chapter >>

Read also:

Get Jobilize Job Search Mobile App in your pocket Now!

100% Free Mobile Applications
Receive real-time job alerts and never miss the right job again

Source: OpenStax, Image coding. OpenStax CNX. Jan 22, 2004 Download for free at http://cnx.org/content/col10206/1.3

Google Play and the Google Play logo are trademarks of Google Inc.

Notification Switch

Would you like to follow the 'Image coding' conversation and receive update notifications?

Ask

	10 AP Key Terms 10 Muscle Tissue Key Terms By OpenStax Start Key Terms
	Microeconomics By OpenStax Read Online Course
	11 Biology 11 Meiosis Sexual Reproduction MCQ By OpenStax Start Quiz
	NCE Ch 04 Social and Cultural Foundations By Anh Dao Start Quiz
	English Composition 2 Final Practice By Madison Christian Start Test
	36 Biology 36 Sensory Systems MCQ By OpenStax Start Quiz
	Cardiac Electrophysiology Basic 2 By Mistry Bhavesh Start Test
	22 Muscle and Pancreas Bio Path quiz By Brooke Delaney Start Exam
	22 AP 22 Respiratory System MCQ By OpenStax Start Quiz
©flickr: U.S.	Molecular Cellular Biology By Ann Schlosser Start Quiz