TY - GEN
T1 - Automatic generation of streaming datapaths for arbitrary fixed permutations
AU - Milder, Peter A.
AU - Hoe, James C.
AU - Püschel, Markus
PY - 2009
Y1 - 2009
N2 - This paper presents a technique to perform arbitrary fixed permutations on streaming data. We describe a parameterized architecture that takes as input n data points streamed at a rate of w per cycle, performs a permutation over all n points, and outputs the result in the same streaming format. We describe the system and its requirements mathematically and use this mathematical description to show that the datapaths resulting from our technique can sustain a full throughput of w words per cycle without stalling. Additionally, we provide an algorithm to configure the datapath for a given permutation and streaming width. Using this technique, we have constructed a full synthesis system that takes as input a permutation and a streaming width and outputs a register-transfer level Verilog description of the datapath. We present an evaluation of our generated designs over varying problem sizes and streaming widths, synthesized for a Xilinx Virtex-5 FPGA.
AB - This paper presents a technique to perform arbitrary fixed permutations on streaming data. We describe a parameterized architecture that takes as input n data points streamed at a rate of w per cycle, performs a permutation over all n points, and outputs the result in the same streaming format. We describe the system and its requirements mathematically and use this mathematical description to show that the datapaths resulting from our technique can sustain a full throughput of w words per cycle without stalling. Additionally, we provide an algorithm to configure the datapath for a given permutation and streaming width. Using this technique, we have constructed a full synthesis system that takes as input a permutation and a streaming width and outputs a register-transfer level Verilog description of the datapath. We present an evaluation of our generated designs over varying problem sizes and streaming widths, synthesized for a Xilinx Virtex-5 FPGA.
UR - https://www.scopus.com/pages/publications/70350059437
U2 - 10.1109/date.2009.5090831
DO - 10.1109/date.2009.5090831
M3 - Conference contribution
SN - 9783981080155
T3 - Proceedings -Design, Automation and Test in Europe, DATE
SP - 1118
EP - 1123
BT - Proceedings - 2009 Design, Automation and Test in Europe Conference and Exhibition, DATE '09
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 2009 Design, Automation and Test in Europe Conference and Exhibition, DATE '09
Y2 - 20 April 2009 through 24 April 2009
ER -