Welcome to the IKCEST

Microprocessors and Microsystems | Vol.60, Issue.0 | | Pages

Microprocessors and Microsystems

CaFPGA: An automatic generation model for CNN accelerator

Yong Dou   Jinwei Xu   Jingfei Jiang   Shijie Li   Zhiqiang Liu  
Abstract

Convolutional neural networks (CNNs) are gaining considerable popularity in numerous computer-vision applications. A convolutional architecture for fast feature embedding (Caffe) and other general frameworks has been proposed with the development of CNN. The field-programmable gate array (FPGA) as a classical platform is used to accelerate CNNs because CNNs are computationally complex tasks. However, the implementation of CNN on FPGA platforms is difficult. The present study focuses on exploring the performance-resource design space and proposes an automatic generation model to implement the CNN reconfigurable accelerator on the FPGA platform, which uses Caffe description text as its input file. A design-space exploration model is further proposed. This model includes a layer-folding pipeline structure to balance the bandwidth requirements of convolutional and fully connected layers with incremental exploration algorithms to exploit CNN parallelism. The AlexNet, VGG-S, and VGG-16 networks are implemented. The AlexNet accelerator can achieve 593.5 GOPS, and the VGG-16 accelerator can achieve 638.9 GOPS, which is equivalent or even exceeds that of the state-of-the-art CNN accelerator for VGG-16.

Original Text (This is the original text for your reference.)

CaFPGA: An automatic generation model for CNN accelerator

Convolutional neural networks (CNNs) are gaining considerable popularity in numerous computer-vision applications. A convolutional architecture for fast feature embedding (Caffe) and other general frameworks has been proposed with the development of CNN. The field-programmable gate array (FPGA) as a classical platform is used to accelerate CNNs because CNNs are computationally complex tasks. However, the implementation of CNN on FPGA platforms is difficult. The present study focuses on exploring the performance-resource design space and proposes an automatic generation model to implement the CNN reconfigurable accelerator on the FPGA platform, which uses Caffe description text as its input file. A design-space exploration model is further proposed. This model includes a layer-folding pipeline structure to balance the bandwidth requirements of convolutional and fully connected layers with incremental exploration algorithms to exploit CNN parallelism. The AlexNet, VGG-S, and VGG-16 networks are implemented. The AlexNet accelerator can achieve 593.5 GOPS, and the VGG-16 accelerator can achieve 638.9 GOPS, which is equivalent or even exceeds that of the state-of-the-art CNN accelerator for VGG-16.

+More

Cite this article
APA

APA

MLA

Chicago

Yong Dou,Jinwei Xu, Jingfei Jiang, Shijie Li, Zhiqiang Liu,.CaFPGA: An automatic generation model for CNN accelerator. 60 (0),.

Disclaimer: The translated content is provided by third-party translation service providers, and IKCEST shall not assume any responsibility for the accuracy and legality of the content.
Translate engine
Article's language
English
中文
Pусск
Français
Español
العربية
Português
Kikongo
Dutch
kiswahili
هَوُسَ
IsiZulu
Action
Recommended articles

Report

Select your report category*



Reason*



By pressing send, your feedback will be used to improve IKCEST. Your privacy will be protected.

Submit
Cancel