Changeset 1333


Ignore:
Timestamp:
Aug 21, 2011, 11:19:15 AM (8 years ago)
Author:
lindanl
Message:

Parabix parser structure

Location:
docs/HPCA2012
Files:
2 edited

Legend:

Unmodified
Added
Removed
  • docs/HPCA2012/03b-research.tex

    r1331 r1333  
    11\section{Parabix}
    22
    3 \subsection{Parabix Architecture}
     3\subsection{Parabix Structure}
    44\begin{figure}
    55\begin{center}
    66\includegraphics[width=0.5\textwidth]{plots/parabix_arch.pdf}
    77\end{center}
    8 \caption{Parabix2 Architecture}
     8\caption{Parabix2 Structure}
    99\label{parabix_arch}
    1010\end{figure}
    1111
    1212
    13 Figure \ref{parabix_arch} shows the overall architecture of the parabix for well-formedness checking.
    14 The input file is processed by 7 modules or 11 stages and the error position is reported at the end if there is any.
    15 The first stage, Read\_Data, loads a chunk of data from an input file to data\_buffer.
    16 The data is then transposed to eight parallel basis bitstreams (basis\_bits) in the Transposition stage.
    17 The eight bitstreams are used in Classification stage to generate all the XML lexical item streams (lex)
    18 as well as in U8\_Validation stage to validate UTF-8 characters.
    19 The lexical item streams and scope streams (scope) that are generated in Gen\_Scope stage
    20 are supplied to the parsing module, which consists three stages, Parse\_CtCDPI, Parse\_Ref and Parse\_tag.
    21 After parsing the comments, cdata, processing instructions, references and tags,
    22 information is gathered by Name\_Validation and Err\_Check stages,
    23 where name streams and error streams are calculated and passed to the final stage, Postprocessing.
    24 All the possible errors that cannot be detected by bitstreams are checked in this last stage and
    25 error type with line and column number will be reported.
     13Figure \ref{parabix_arch} shows the overall structure of the Parabix XML parser set up for
     14well-formedness checking.
     15The input file is processed using 11 functions organized into 7 modules. 
     16In the first module, the Read\_Data function loads data blocks from an input file to data\_buffer.
     17The data is then transposed to eight parallel basis bitstreams (basis\_bits) in the Transposition module.
     18The eight bitstreams are used in the Classification function to generate all the XML lexical item streams (lex)
     19as well as in the U8\_Validation module to validate UTF-8 characters.
     20The lexical item streams and scope streams (scope) that are generated in Gen\_Scope function
     21are supplied to the parsing module, which consists three functions, Parse\_CtCDPI, Parse\_Ref and Parse\_tag.
     22These functions deal with the parsing of
     23comments, CDATA sections, processing instructions, references and tags.   After this,
     24information is gathered by Name\_Validation and Err\_Check functions, producing
     25name check streams and error streams.  These are then passed to the final module for Postprocessing.
     26All the possible errors that cannot be conveniently detected by bitstreams are checked in this last module.
     27The final output reports any well-formedness error detected and its position within the input file.
    2628
    2729\subsection{Parallel Bit Stream Compilation}
Note: See TracChangeset for help on using the changeset viewer.