Changeset 3617 for docs


Ignore:
Timestamp:
Jan 24, 2014, 11:07:08 AM (5 years ago)
Author:
cameron
Message:

Some cleanups

Location:
docs/Working/re
Files:
4 edited

Legend:

Unmodified
Added
Removed
  • docs/Working/re/avx2.tex

    r3515 r3617  
    1919\paragraph*{AVX2 Stream Addition}
    2020 \begin{figure*}[tbh]
    21 \begin{center}
     21
     22\begin{center} \small
    2223\begin{verbatim}
    2324void add_ci_co(bitblock_t x, bitblock_t y, carry_t carry_in, carry_t & carry_out, bitblock_t & sum) {
     
    3637  sum = simd256<64>::add(partial_sum, _mm256_cvtepu16_epi64(avx_select_lo128(convert(spread))));
    3738}
    38 
    3939\end{verbatim}
    40 
    4140\end{center}
    4241\caption{AVX2 256-bit Addition}
     
    6564tick label style={font=\tiny},
    6665enlarge x limits=0.15,
    67 enlarge y limits={0.15, upper},
     66%enlarge y limits={0.15, upper},
    6867ymin=0,
    6968legend style={at={(0.5,-0.15)},
     
    116115tick label style={font=\tiny},
    117116enlarge x limits=0.15,
    118 enlarge y limits={0.15, upper},
     117%enlarge y limits={0.15, upper},
    119118ymin=0,
    120119legend style={at={(0.5,-0.15)},
     
    140139instruction count was reflected in a considerable speed-up
    141140in the bitstreams implementation.  However, the speed-up was
    142 considerably less than expected.  As shown in \label{fig:AVXIPC}
     141considerably less than expected.  As shown in \ref{fig:AVXIPC}
    143142the AVX2 version has lost some of the superscalar efficiency
    144143of the SSE2 code.   This is a performance debugging issue
     
    155154tick label style={font=\tiny},
    156155enlarge x limits=0.15,
    157 enlarge y limits={0.15, upper},
     156%enlarge y limits={0.15, upper},
    158157ymin=0,
    159158legend style={at={(0.5,-0.15)},
  • docs/Working/re/re-main.tex

    r3574 r3617  
    1 %-----------------------------------------------------------------------------
    2 %
    3 %               Template for sigplanconf LaTeX Class
    4 %
    5 % Name:         sigplanconf-template.tex
    6 %
    7 % Purpose:      A template for sigplanconf.cls, which is a LaTeX 2e class
    8 %               file for SIGPLAN conference proceedings.
    9 %
    10 % Guide:        Refer to "Author's Guide to the ACM SIGPLAN Class,"
    11 %               sigplanconf-guide.pdf
    12 %
    13 % Author:       Paul C. Anagnostopoulos
    14 %               Windfall Software
    15 %               978 371-2316
    16 %               paul@windfall.com
    17 %
    18 % Created:      15 February 2005
    19 %
    20 %-----------------------------------------------------------------------------
    21 
    22 
    23 \documentclass[preprint]{sigplanconf}
    24 
    25 % The following \documentclass options may be useful:
    26 
    27 % preprint      Remove this option only once the paper is in final form.
    28 % 10pt          To set in 10-point type instead of 9-point.
    29 % 11pt          To set in 11-point type instead of 9-point.
    30 % authoryear    To obtain author/year citation style instead of numeric.
    31 
     1
     2%%%%%%%%%%%%%%%%%%%%%%% file typeinst.tex %%%%%%%%%%%%%%%%%%%%%%%%%
     3%
     4% This is the LaTeX source for the instructions to authors using
     5% the LaTeX document class 'llncs.cls' for contributions to
     6% the Lecture Notes in Computer Sciences series.
     7% http://www.springer.com/lncs       Springer Heidelberg 2006/05/04
     8%
     9% It may be used as a template for your own input - copy it
     10% to a new file with a new name and use it as the basis
     11% for your article.
     12%
     13% NB: the document class 'llncs' has its own and detailed documentation, see
     14% ftp://ftp.springer.de/data/pubftp/pub/tex/latex/llncs/latex2e/llncsdoc.pdf
     15%
     16%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
     17
     18
     19\documentclass[runningheads,a4paper]{llncs}
     20
     21\usepackage{amssymb}
    3222\usepackage{amsmath}
     23\setcounter{tocdepth}{3}
     24\usepackage{graphicx}
     25\usepackage{tikz}
    3326\usepackage{pgfplots}
    3427
     28\usepackage{url}
     29\urldef{\mails}\path|{cameron,ksherdy,lindanl,linmengl,bhull,shermer,ashriram}@cs.sfu.ca|   
     30\newcommand{\keywords}[1]{\par\addvspace\baselineskip
     31\noindent\keywordname\enspace\ignorespaces#1}
     32
    3533\begin{document}
    3634
    37 \special{papersize=8.5in,11in}
    38 \setlength{\pdfpageheight}{\paperheight}
    39 \setlength{\pdfpagewidth}{\paperwidth}
    40 
    41 \conferenceinfo{PPoPP 2014}{February 15-19, 2014, Orlando, Florida, United States}
    42 \copyrightyear{2013}
    43 \copyrightdata{978-1-nnnn-nnnn-n/yy/mm}
    44 \doi{nnnnnnn.nnnnnnn}
    45 
    46 % Uncomment one of the following two, if you are not going for the
    47 % traditional copyright transfer agreement.
    48 
    49 %\exclusivelicense                % ACM gets exclusive license to publish,
    50                                   % you retain copyright
    51 
    52 %\permissiontopublish             % ACM gets nonexclusive license to publish
    53                                   % (paid open-access papers,
    54                                   % short abstracts)
    55 
    56 \titlebanner{}        % These are ignored unless
    57 \preprintfooter{Bitwise Data Parallel Grep}   % 'preprint' option specified.
    58 
     35\mainmatter  % start of an individual contribution
     36
     37% first the title is needed
    5938\title{Bitwise Data Parallelism in Regular Expression Matching}
    60 %\subtitle{Subtitle Text, if any}
    61 
    62 \authorinfo{Anonymous Authors}{Institutions}{emails}
    63 %\authorinfo{Robert D. Cameron \and Kenneth S. Herdy \and Dan Lin \and Meng Lin \and Ben Hull \and Thomas S. Shermer \and Arrvindh Shriraman}
     39
     40% a short form should be given in case it is too long for the running head
     41\titlerunning{Bitwise Data Parallel Grep}
     42
     43% the name(s) of the author(s) follow(s) next
     44%
     45% NB: Chinese authors should write their first names(s) in front of
     46% their surnames. This ensures that the names appear correctly in
     47% the running heads and the author index.
     48%
     49\author{Robert D. Cameron \and Kenneth S. Herdy \and Dan Lin \and Meng Lin \and Ben Hull \and Thomas S. Shermer \and Arrvindh Shriraman}
    6450%          {Simon Fraser University}
    6551%           {\{cameron,ksherdy,lindanl,linmengl,bhull,shermer,ashriram\}@cs.sfu.ca}
    6652
     53%
     54\authorrunning{Cameron et al}
     55% (feature abused for this document to repeat the title also on left hand pages)
     56
     57% the affiliations are given next; don't give your e-mail address
     58% unless you accept that it will be published
     59\institute{Simon Fraser University\\
     60\mails\\
     61}
     62
     63%
     64% NB: a more complex sample for affiliations and the mapping to the
     65% corresponding authors can be found in the file "llncs.dem"
     66% (search for the string "\mainmatter" where a contribution starts).
     67% "llncs.dem" accompanies the document class "llncs.cls".
     68%
     69
     70\toctitle{Bitwise Data Parallelism in Regular Expression Matching}
     71\tocauthor{Cameron, et al}
    6772\maketitle
     73
     74
     75
    6876
    6977\begin{abstract}
    7078\input{abstract}
    7179\end{abstract}
    72 \category{Theory of computation}{Formal languages and automata theory}{Regular languages}
    73 \category{Computer systems organization}{Parallel architectures}{Single instruction, multiple data}
    74 
    75 % general terms are not compulsory anymore,
    76 % you may leave them out
    77 %\terms
    78 %term1, term2
    79 
    80 \keywords
    81 regular expression matching, grep, parallel bit stream technology
    8280
    8381\section{Introduction}
     
    113111is an inability to skip input characters.
    114112Simple string matching algorithms,
    115 such as the Boyer-Moore family of algorithms \cite{boyer1977fast, horspool1980practical} skip input characters
     113such as the Boyer-Moore family of algorithms \cite{boyer1977fast,horspool1980practical} skip input characters
    116114to achieve sublinear times in the average case.
    117115Backward Dawg Matching (BDM) string matching algorithms \cite{crochemore1994text}
     
    140138performs small-ruleset regular expression matching at a rate of
    1411392.88 Gbps per chip on Intel Xeon E5472 hardware \cite{pasetto2010}.
    142 Naghmouchi et al. \cite{scarpazza2011top, naghmouchi2010} demonstrated that the Aho-Corasick (AC)
     140Naghmouchi et al. \cite{scarpazza2011top,naghmouchi2010} demonstrated that the Aho-Corasick (AC)
    143141string matching algorithm \cite{aho1975} is well suited for parallel
    144142implementation on multi-core CPUs, GPUs and the Cell BE.
     
    618616
    619617\begin{figure*}[tbh]
    620 \begin{center}
     618\begin{center}\small
    621619\begin{verbatim}
    622620inline BitBlock adc(int idx, BitBlock a, BitBlock b, __local BitBlock *carry, _
     
    649647\end{verbatim}
    650648
    651    
    652 
    653649\end{center}
    654650\caption{OpenCL 4096-bit Addition}
     
    668664tick label style={font=\tiny},
    669665enlarge x limits=0.15,
    670 enlarge y limits={0.15, upper},
     666%enlarge y limits={0.15, upper},
    671667ymin=0,
    672668legend style={at={(0.5,-0.15)},
     
    705701%This is the text of the appendix, if you need one.
    706702
    707 \acks
    708703
    709704This research was supported by grants from the Natural Sciences and Engineering Research Council of Canada and
     
    712707% We recommend abbrvnat bibliography style.
    713708
    714 \bibliographystyle{abbrvnat}
     709\bibliographystyle{splncs}
    715710
    716711% The bibliography should be embedded for final submission.
  • docs/Working/re/sse2.tex

    r3510 r3617  
    6565tick label style={font=\tiny},
    6666enlarge x limits=0.15,
    67 enlarge y limits={0.15, upper},
     67%enlarge y limits={0.15, upper},
    6868ymin=0,
    6969legend style={at={(0.5,-0.15)},
     
    7272ybar,
    7373bar width=7pt,
    74 visualization depends on=y \as \rawy,
     74%visualization depends on=y \as \rawy,
    7575]
    7676\addplot
     
    109109tick label style={font=\tiny},
    110110enlarge x limits=0.15,
    111 enlarge y limits={0.15, upper},
     111%enlarge y limits={0.15, upper},
    112112ymin=0,
    113113legend style={at={(0.5,-0.15)},
     
    143143tick label style={font=\tiny},
    144144enlarge x limits=0.15,
    145 enlarge y limits={0.15, upper},
     145%enlarge y limits={0.15, upper},
    146146ymin=0,
    147147legend style={at={(0.5,-0.15)},
     
    173173tick label style={font=\tiny},
    174174enlarge x limits=0.15,
    175 enlarge y limits={0.15, upper},
     175%enlarge y limits={0.15, upper},
    176176ymin=0,
    177177legend style={at={(0.5,-0.15)},
Note: See TracChangeset for help on using the changeset viewer.