Changeset 3867 for docs

Ignore:
Timestamp:
Jun 12, 2014, 4:00:27 PM (5 years ago)
Message:

Max Unicode (hence UTF-32) value is 0x10FFFF

File:
1 edited

Legend:

Unmodified
 r3665 As we are comparing techniques in practice, we assume that $\Sigma$ is a standard input alphabet, such as ASCII ($\sigma = 128$), UTF-8 ($\sigma = 256$), UTF-16 ($\sigma = 65536$ ), or UTF-32 ($\sigma \approx 4.3 \times 10^9$). UTF-16 ($\sigma = 65536$ ), or UTF-32 ($\sigma = 1114112$). This assumption allows us to equate the number of bits in the encoding of a character (a parameter for the bitstream method) with $\log \sigma$.