Fast 2D Convolution Algorithms for Convolutional Neural Networks

Chao Cheng; Keshab K. Parhi

doi:10.1109/TCSI.2020.2964748

Fast 2D Convolution Algorithms for Convolutional Neural Networks

Chao Cheng, Keshab K. Parhi

Electrical and Computer Engineering

Research output: Contribution to journal › Article › peer-review

22 Scopus citations

Abstract

Convolutional Neural Networks (CNN) are widely used in different artificial intelligence (AI) applications. Major part of the computation of a CNN involves 2D convolution. In this paper, we propose novel fast convolution algorithms for both 1D and 2D to remove the redundant multiplication operations in convolution computations at the cost of controlled increase of addition operations. For example, when the 2D processing block size is 3\times 3 , our algorithm has multiplication saving factor as high as 3.24, compared to direct 2D convolution computation scheme. The proposed algorithm can also process input feature maps and generate output feature maps with the same flexible block sizes that are independent of convolution weight kernel size. The memory access efficiency is also largely improved by the proposed method. These structures can be applied to different CNN layers, such as convolution with stride > 1, pooling and deconvolution by exploring flexible feature map processing tile sizes. The proposed algorithm is suitable for both software and hardware implementation.

Original language	English (US)
Article number	8966288
Pages (from-to)	1678-1691
Number of pages	14
Journal	IEEE Transactions on Circuits and Systems I: Regular Papers
Volume	67
Issue number	5
DOIs	https://doi.org/10.1109/TCSI.2020.2964748
State	Published - May 2020

Bibliographical note

Funding Information:
Manuscript received November 5, 2019; revised December 20, 2019; accepted January 4, 2020. Date of publication January 22, 2020; date of current version May 1, 2020. The work of Keshab K. Parhi was supported by the National Science Foundation under Grant CCF-1814759. This article was recommended by Associate Editor G. Jovanovic Dolecek. (Corresponding author: Chao Cheng.) Chao Cheng is with the AI Computation Technologies Laboratory, Alibaba Damo Academy, Sunnyvale, CA 94085 USA (e-mail: chen0867@umn.edu).

Publisher Copyright:
© 2004-2012 IEEE.

Keywords

Convolutional neural network
Kronecker product
Winograd algorithm
deconvolution
fast convolution
parallel FIR filter

Access

10.1109/TCSI.2020.2964748

OpenUrl availability

Full text

Cite this

@article{8e11c958941040dfa10794f7eff7f4c2,

title = "Fast 2D Convolution Algorithms for Convolutional Neural Networks",

abstract = "Convolutional Neural Networks (CNN) are widely used in different artificial intelligence (AI) applications. Major part of the computation of a CNN involves 2D convolution. In this paper, we propose novel fast convolution algorithms for both 1D and 2D to remove the redundant multiplication operations in convolution computations at the cost of controlled increase of addition operations. For example, when the 2D processing block size is 3\times 3 , our algorithm has multiplication saving factor as high as 3.24, compared to direct 2D convolution computation scheme. The proposed algorithm can also process input feature maps and generate output feature maps with the same flexible block sizes that are independent of convolution weight kernel size. The memory access efficiency is also largely improved by the proposed method. These structures can be applied to different CNN layers, such as convolution with stride > 1, pooling and deconvolution by exploring flexible feature map processing tile sizes. The proposed algorithm is suitable for both software and hardware implementation.",

keywords = "Convolutional neural network, Kronecker product, Winograd algorithm, deconvolution, fast convolution, parallel FIR filter",

author = "Chao Cheng and Parhi, {Keshab K.}",

note = "Funding Information: Manuscript received November 5, 2019; revised December 20, 2019; accepted January 4, 2020. Date of publication January 22, 2020; date of current version May 1, 2020. The work of Keshab K. Parhi was supported by the National Science Foundation under Grant CCF-1814759. This article was recommended by Associate Editor G. Jovanovic Dolecek. (Corresponding author: Chao Cheng.) Chao Cheng is with the AI Computation Technologies Laboratory, Alibaba Damo Academy, Sunnyvale, CA 94085 USA (e-mail: chen0867@umn.edu). Publisher Copyright: {\textcopyright} 2004-2012 IEEE.",

year = "2020",

month = may,

doi = "10.1109/TCSI.2020.2964748",

language = "English (US)",

volume = "67",

pages = "1678--1691",

journal = "IEEE Transactions on Circuits and Systems I: Regular Papers",

issn = "1549-8328",

number = "5",

}

TY - JOUR

T1 - Fast 2D Convolution Algorithms for Convolutional Neural Networks

AU - Cheng, Chao

AU - Parhi, Keshab K.

N1 - Funding Information: Manuscript received November 5, 2019; revised December 20, 2019; accepted January 4, 2020. Date of publication January 22, 2020; date of current version May 1, 2020. The work of Keshab K. Parhi was supported by the National Science Foundation under Grant CCF-1814759. This article was recommended by Associate Editor G. Jovanovic Dolecek. (Corresponding author: Chao Cheng.) Chao Cheng is with the AI Computation Technologies Laboratory, Alibaba Damo Academy, Sunnyvale, CA 94085 USA (e-mail: chen0867@umn.edu). Publisher Copyright: © 2004-2012 IEEE.

PY - 2020/5

Y1 - 2020/5

N2 - Convolutional Neural Networks (CNN) are widely used in different artificial intelligence (AI) applications. Major part of the computation of a CNN involves 2D convolution. In this paper, we propose novel fast convolution algorithms for both 1D and 2D to remove the redundant multiplication operations in convolution computations at the cost of controlled increase of addition operations. For example, when the 2D processing block size is 3\times 3 , our algorithm has multiplication saving factor as high as 3.24, compared to direct 2D convolution computation scheme. The proposed algorithm can also process input feature maps and generate output feature maps with the same flexible block sizes that are independent of convolution weight kernel size. The memory access efficiency is also largely improved by the proposed method. These structures can be applied to different CNN layers, such as convolution with stride > 1, pooling and deconvolution by exploring flexible feature map processing tile sizes. The proposed algorithm is suitable for both software and hardware implementation.

AB - Convolutional Neural Networks (CNN) are widely used in different artificial intelligence (AI) applications. Major part of the computation of a CNN involves 2D convolution. In this paper, we propose novel fast convolution algorithms for both 1D and 2D to remove the redundant multiplication operations in convolution computations at the cost of controlled increase of addition operations. For example, when the 2D processing block size is 3\times 3 , our algorithm has multiplication saving factor as high as 3.24, compared to direct 2D convolution computation scheme. The proposed algorithm can also process input feature maps and generate output feature maps with the same flexible block sizes that are independent of convolution weight kernel size. The memory access efficiency is also largely improved by the proposed method. These structures can be applied to different CNN layers, such as convolution with stride > 1, pooling and deconvolution by exploring flexible feature map processing tile sizes. The proposed algorithm is suitable for both software and hardware implementation.

KW - Convolutional neural network

KW - Kronecker product

KW - Winograd algorithm

KW - deconvolution

KW - fast convolution

KW - parallel FIR filter

UR - http://www.scopus.com/inward/record.url?scp=85084407442&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85084407442&partnerID=8YFLogxK

U2 - 10.1109/TCSI.2020.2964748

DO - 10.1109/TCSI.2020.2964748

M3 - Article

AN - SCOPUS:85084407442

SN - 1549-8328

VL - 67

SP - 1678

EP - 1691

JO - IEEE Transactions on Circuits and Systems I: Regular Papers

JF - IEEE Transactions on Circuits and Systems I: Regular Papers

IS - 5

M1 - 8966288

ER -

Fast 2D Convolution Algorithms for Convolutional Neural Networks

Abstract

Bibliographical note

Keywords

Access

OpenUrl availability

Other files and links

Fingerprint

Cite this