AJOU Central Library Repository: Convolution Neural Networks를 위한 고속 Convolution 알고리즘 과 가속기

BROWSE

Graduate School of Ajou University Department of Electronic Engineering 4. Theses(Ph.D)

Convolution Neural Networks를 위한 고속 Convolution 알고리즘 과 가속기

DC Field	Value	Language
dc.contributor.advisor	선우명훈	-
dc.contributor.author	김태선	-
dc.date.accessioned	2019-04-01T16:42:37Z	-
dc.date.available	2019-04-01T16:42:37Z	-
dc.date.issued	2019-02	-
dc.identifier.other	28581	-
dc.identifier.uri	https://dspace.ajou.ac.kr/handle/2018.oak/15241	-
dc.description	학위논문(박사)--아주대학교 일반대학원 :전자공학과,2019. 2	-
dc.description.tableofcontents	I. Introduction 6 II. Overview of Convolutional Neural Networks 12 A. Overall Architecture 12 B. Convolution Layer 13 C. Pooling Layer 17 D. Convolutional Neural Networks 19 1. LeNet 19 2. AlexNet 21 3. VGG-16 24 4. GoogLeNet 27 III. Acceleration for Deep Neural Networks 32 A. Quantization and Binarization 32 B. Pruning and Sharing 32 C. Low-Rank Factorization and Sparsity 34 IV. Two-Step MAC Operation for Convolutional Layer 35 V. Architecture for Two-Step MAC Operation 50 D. Modified HCCA 50 E. Reconstruction of the output pixel ordering 55 F. Overall architecture 59 G. PEG Architecture 60 H. Temporary Feature Map 66 VI. Experimental Results 68 I. Algorithms performance 68 J. Hardware Accelerator 72 VII. Conclusions 76 Bibliography 78	-
dc.language.iso	eng	-
dc.publisher	The Graduate School, Ajou University	-
dc.rights	아주대학교 논문은 저작권에 의해 보호받습니다.	-
dc.title	Convolution Neural Networks를 위한 고속 Convolution 알고리즘 과 가속기	-
dc.type	Thesis	-
dc.contributor.affiliation	아주대학교 일반대학원	-
dc.contributor.department	일반대학원 전자공학과	-
dc.date.awarded	2019. 2	-
dc.description.degree	Doctoral	-
dc.identifier.localId	T000000028581	-
dc.identifier.uci	I804:41038-000000028581	-
dc.identifier.url	http://dcoll.ajou.ac.kr:9080/dcollection/common/orgView/000000028581	-
dc.description.alternativeAbstract	Recent advances in computing power made possible by developments of faster general-purpose graphics processing units (GPGPUs) have increased the complexity of convolutional neural network (CNN) models. However, because of the limited applications of the existing GPGPUs, CNN accelerators are becoming more important. The current accelerators focus on improvement in memory scheduling and architectures. Thus, the number of multiplier-accumulator (MAC) operations is not reduced. In this study, a new convolution layer operation algorithm is proposed using the coarse-to-fine method instead of hardware or architecture approaches. This algorithm is shown to reduce the MAC operations by 33%. However, the accuracy of the Top 1 is decreased only by 3% and the Top 5 only by 1%. . Furthermore, the proposed hardware accelerator demonstrates higher performance, lower power consumption, and higher energy efficiency than other ASIC implementations except for [45]. The proposed accelerator demonstrates a performance higher by 1.7×, a 65% decrease in on-chip memory, and a gate count lower by 20% compared to the hardware accelerator of [45]. Although the proposed accelerator has a larger gate count, it demonstrates higher performance, lower power consumption, energy efficiency improved by 1.7–1.8×, and a chip memory size smaller than that for the accelerator of [22].	-

Appears in Collections:: Graduate School of Ajou University > Department of Electronic Engineering > 4. Theses(Ph.D)

Files in This Item:: There are no files associated with this item.

Show simple item record

qrcode

트윗하기

License

STATISTICS: Total Visit :5,012,213; Total Download :2,095; Today View :2,398

AJOU Central Library Repository는 국립중앙도서관 OAK 보급사업으로 구축되었습니다.

BROWSE

Browse