# Changeset 1088 for docs/PACT2011/00-abstract.tex

Ignore:
Timestamp:
Apr 8, 2011, 5:36:57 PM (8 years ago)
Message:

Minor edits.

File:
1 edited

### Legend:

Unmodified
 r1040 XML is a data format designed for documents as well as the representation of data structures. The simplicity and generality of representation of data structures.  The simplicity and generality of the rules make it widely used in web services and database systems. Traditional XML parsers have been built around the systems.  Traditional XML parsers have been built around the byte-at-a-time model, in which they process every character token in the file in a sequential fashion.  Unfortunately, the byte-at-time The first generation Parabix1 parser then uses bit-scan instructions over these streams to make multibyte moves in an otherwise sequential approach.   The second generation Parabix2 technology adds further approach.  The second generation Parabix2 technology adds further parallelism by replacing much of the sequential bit scanning with a parallel scanning approach based on bit-stream addition.    We evaluate Parabix1 and Parabix2 against two widely-used XML parsers, James Clark's Expat and Apache's Xerces-C addition.  We evaluate Parabix1 and Parabix2 against two widely-used XML parsers, James Clark's Expat and Apache's Xerces on three generations of x86 machines, including the new Intel \SB{}.    We show that Parabix2's speedup is 2$\times$--7$\times$ \SB{}.  We show that Parabix2's speedup is 2$\times$--7$\times$ over Expat and Xerces.  In stark contrast to the energy expenditures necessary to realize performance gains through multicore parallelism, we also show that our Parabix parsers deliver energy savings directly in proportion to performance gains.   We also assess the scalability advantages to performance gains.  We also assess the scalability advantages of SIMD processor improvements the different Intel machine generations, culminating with an evaluation of the 256-bit AVX technology in