Changes between Version 11 and Version 12 of IcGrepUnicodeLevel1


Ignore:
Timestamp:
Jan 8, 2015, 9:46:53 AM (4 years ago)
Author:
cameron
Comment:

--

Legend:

Unmodified
Added
Removed
Modified
  • IcGrepUnicodeLevel1

    v11 v12  
    2929Following Perl syntactic conventions, negated forms of property expressions (matching all values not
    3030having the specified property) use the {{{\P}}} syntax.
     31
     32=== RL1.2a Compatibility Properties ===
     33
     34Except for {{{\X}}} for extended grapheme clusters,
     35icGrep implements the full set of compatibility properties in Annex C of TR#18, following
     36Unicode definitions in preference to Posix definitions.   Posix bracket expressions such
     37as {{{[:punct:]}}} may be used within character classes.   
    3138
    3239=== 1.2.1 General_Category ===
     
    7784== RL1.4 Simple Word Boundaries ==
    7885
    79 {{{\b}}}: description to be written
     86{{{\b}}}: icGrep implements zero-width word-boundary assertions using the Unicode definition of word characters.
    8087
    8188== RL1.5 Simple Loose Matches ==
    8289
    83 {{{(?i)}}} description to be written
     90icGrep implements case-insensitive matching using Unicode simple case-folding rules.
     91Case-insenstive matching is specified with the {{{-i}}} command-line
     92parameter or applied within individual parts of a regular expression using the
     93{{{(?i)}}} and {{{(?i:<regexp>)}}} notations.    Case-insensitive matching applies to
     94literally and numerically specified characters; icgrep does not apply case-insensitivity
     95rules to change the interpretation of property expressions.
    8496
    8597== RL1.6 Line Boundaries ==
    8698
    87 icGrep implements full Unicode line boundaries.   Additional description to follow.
     99icGrep implements full Unicode line boundaries.   
    88100
    89101== RL1.7 Supplementary Code Points ==