wiki:CharSetArch

Version 1 (modified by lindanl, 11 years ago) (diff)

--

Character Set Architecture

Parabix has a character set architecture that is designed to provide high-performance native parsing for a wide variety of character sets. The architecture uses both the C++ template mechanism and an object hierarchy for character sets within families.

ASCII vs. EBCDIC

Parabix performs native parsing for character sets based on either ASCII or EBCDIC.

The enumerated type !CodeUnit_Base may have either the value ASCII or EBCDIC. Used as a template parameter, !CodeUnit_Base allows the selection of character codes according to the specified base set. These are provided by the Ord structure as defined in ASCII_EBCDIC.h.