TY - GEN
T1 - Hybrid binary-unary hardware accelerator
AU - Rasoul Faraji, S.
AU - Bazargan, Kia
PY - 2019/1/21
Y1 - 2019/1/21
N2 - Stochastic computing has been used in recent years to create designs with significantly smaller area by harnessing unary encoding of data. However, the low area advantage comes at an exponential price in latency, making the area × delay cost unattractive. In this paper, we present a novel method which uses a hybrid binary / unary representation to perform computations. We first divide the input range into a few sub-regions, perform unary computations on each sub-region individually, and finally pack the outputs of all sub-regions back to compact binary. Moreover, we propose a synthesis methodology and a regression model to predict an optimal or sub-optimal design in the design space. The proposed method is especially well-suited to FPGAs due to the abundant availability of routing and flip-flop resources. To the best of our knowledge, we are the first to show a scalable method based on the principles of stochastic computing that can beat conventional binary in terms of a real cost, i.e., area×delay. Our method outperforms the binary and fully unary methods on a number of functions and on a common edge detection algorithm. In terms of area × delay cost, our cost is on average only 2.51% and 10.2% of the binary for 8- and 10-bit resolutions, respectively. These numbers are 2-3 orders of magnitude better than the results of traditional stochastic methods. Our method is not competitive with the binary method for high-resolution oscillating functions such as sin(15x).
AB - Stochastic computing has been used in recent years to create designs with significantly smaller area by harnessing unary encoding of data. However, the low area advantage comes at an exponential price in latency, making the area × delay cost unattractive. In this paper, we present a novel method which uses a hybrid binary / unary representation to perform computations. We first divide the input range into a few sub-regions, perform unary computations on each sub-region individually, and finally pack the outputs of all sub-regions back to compact binary. Moreover, we propose a synthesis methodology and a regression model to predict an optimal or sub-optimal design in the design space. The proposed method is especially well-suited to FPGAs due to the abundant availability of routing and flip-flop resources. To the best of our knowledge, we are the first to show a scalable method based on the principles of stochastic computing that can beat conventional binary in terms of a real cost, i.e., area×delay. Our method outperforms the binary and fully unary methods on a number of functions and on a common edge detection algorithm. In terms of area × delay cost, our cost is on average only 2.51% and 10.2% of the binary for 8- and 10-bit resolutions, respectively. These numbers are 2-3 orders of magnitude better than the results of traditional stochastic methods. Our method is not competitive with the binary method for high-resolution oscillating functions such as sin(15x).
KW - Alternator Logic
KW - Hardware accelerators
KW - Hybrid computing system
KW - Scaling Network
KW - Unary computing system
UR - http://www.scopus.com/inward/record.url?scp=85061142373&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85061142373&partnerID=8YFLogxK
U2 - 10.1145/3287624.3287706
DO - 10.1145/3287624.3287706
M3 - Conference contribution
AN - SCOPUS:85061142373
T3 - Proceedings of the Asia and South Pacific Design Automation Conference, ASP-DAC
SP - 210
EP - 215
BT - ASP-DAC 2019 - 24th Asia and South Pacific Design Automation Conference
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 24th Asia and South Pacific Design Automation Conference, ASPDAC 2019
Y2 - 21 January 2019 through 24 January 2019
ER -