Entropy | Harder, Better, Faster, Stronger

Universal Coding (Part II)

03/12/2013

Last time, we presented universal codes and two simple code. This week, let’s revisit the topic and present Elias codes, which are much better universal codes than Chaitin’s and modified Chaitin codes.

We will use the idea of recursion introduced before, but we will push the idea further, getting codes with lengths $O(\lg n + \lg\lg n + \lg\lg\lg n + \cdots)$ , which are pretty much as good as it gets.

Read the rest of this entry »

1 Comment | algorithms, data compression, Mathematics | Tagged: 1948, Chaitin, Elias, Entropy, Shannon, universal code | Permalink
Posted by Steven Pigeon

Lossless Coding of CD Audio

17/01/2012

Once upon a time, I discussed how to pick bit-rate for MP3, while considering re-ripping all my CDs. But if I’m to re-rip everything, I might as well do it one last time and use lossless compression.

In this post, we’ll discuss the simple script I cooked up to do just that, and a bit on how Flac works, and how it compares to MP3.

Read the rest of this entry »

1 Comment | algorithms, Bash (Shell), data compression, embedded programming, Mathematics, programming | Tagged: CD, Entropy, Golomb Codes, GSM, Linear Prediction, music, psychoacoustic model, psychoacoustics, sound, Speech, Speech Codec | Permalink
Posted by Steven Pigeon

Fractional Bits (Part I)

01/11/2011

Some time ago, I discussed Huffman codes, how they were generated, and how you could (de)code information with it. I also said that they were optimal under various conditions, one of which (that I may or may not have mentioned) is that you have an integer number of bits.

Coding with an non-integer number of bits is counter-intuitive, but it is entirely possible to do so. There are in fact many ways to do so, but let’s start easy and ignore the frequency of occurrence of symbols for now.

Read the rest of this entry »

6 Comments | algorithms, bit twiddling, data compression | Tagged: binary encoding, Entropy, entropy coding, Huffman Codes, phase-in codes | Permalink
Posted by Steven Pigeon

Huffman Codes

17/05/2011

Some time ago, I presented a piece on compressing voxel worlds and I just realized that I discussed different types of variable length codes quite a few times, but that I never took the time to present you the basic method of Huffman coding!

The problem of finding an optimal variable length code is to find an uniquely decodable binary code (that is, a code using only 0 and 1 for which there exists only one way of decoding a message) that encodes each symbol in a number of bits that is proportional to its probability of occurrence. A first strategy was devised by Robert Fano, Huffman’s teacher at one point. The so-called Shannon-Fano code is a top-down approach to solving the problem but it was shown to be suboptimal. The code proposed by David Huffman solves the problem optimally by taking exactly the opposite approach, that is, building the code bottom-up.

Read the rest of this entry »

14 Comments | data compression | Tagged: bit, code, Entropy, Fano, Huffman, Huffman Codes, Prefix code, Shannon, Shannon-Fano Codes, Tree-Structured Code | Permalink
Posted by Steven Pigeon

Harder, Better, Faster, Stronger

Universal Coding (Part II)

Lossless Coding of CD Audio

Fractional Bits (Part I)

Huffman Codes

Le Tweet

Recent Posts

Categories

Archives

Blogroll

RSS