Changeset 3867 for docs/Working/re


Ignore:
Timestamp:
Jun 12, 2014, 4:00:27 PM (5 years ago)
Author:
cameron
Message:

Max Unicode (hence UTF-32) value is 0x10FFFF

File:
1 edited

Legend:

Unmodified
Added
Removed
  • docs/Working/re/analysis.tex

    r3665 r3867  
    3535As we are comparing techniques in practice, we assume that $\Sigma$ is a
    3636standard input alphabet, such as ASCII ($\sigma = 128$), UTF-8 ($\sigma = 256$),
    37 UTF-16 ($\sigma = 65536$ ), or UTF-32 ($\sigma \approx 4.3 \times 10^9$).
     37UTF-16 ($\sigma = 65536$ ), or UTF-32 ($\sigma = 1114112$).
    3838This assumption allows us to equate the number of bits in the encoding of a
    3939character (a parameter for the bitstream method) with $\log \sigma$.
Note: See TracChangeset for help on using the changeset viewer.