AI기술을위한딥러닝모델의 1-bit 양자화 · 1 목차 1....

엣지 AI기술을위한딥러닝모델의 1-bit 양자화김정훈, 네패스

1. 회사및발표자약력 (Introduction to Organization and Business)

1. 프로젝트개요 (Project Overview)

1. 개발배경 (Backgrounds)

1. 기술적인해결과제 (Project Goals and Challenges)

1. MathWorks 솔루션을통한해결방안및결과 (How did we get there and leverage MathWorks)

1. 결과및정리 (Achievements and Outlook)

1. 결론 (Concluding Remarks)

발표자김정훈

• 고려대학교전기전자공학석사

• (주)네패스딥러닝엔지니어전문연구요원

• Contact: jeonghoon.samuel@gmail.com

• 관심키워드: Neural Network Quantization, Compact Network Design,

Robotics Perceptions, State Estimation, …

• 2019 활동사항:

- Google Developer Group Gwangju DevFest 2019 발표자

- 한국기술교육대학교온라인평생교육원자문

- 삼성오픈소스컨퍼런스 (SOSCON) 2019 심사위원

- Mathworks Advisory Board 2019

- 인공지능로보틱스커뮤니티운영자

- Neural Network Quantization & Compact Network Development스터디리더

함께연구하고, 나누며, 성장하는데에큰보람을느끼고있습니다!

오늘의발표내용: Binarized Neural Networks on MATLAB

nepes Future Intelligence

AI CCTV System Research Projects

Real Applications

Core Value Research

AI Factory

Device Solution

SoftwareHardware

프로젝트개요: 매트랩을통한이진화신경망학습기개발

MATLAB

Hubara, Itay, et al. "Binarized neural networks." Advances in neural information processing systems. 2016.

MATLAB은 제공해주는기능만사용할수있는폐쇄적개발환경인가?

이논문을 MATLAB으로 어떻게구현할수있을까?

개발배경

Object Detection Semantic Segmentation

훌륭한성능을내는신경망기반어플리케이션들

개발배경

Real Time PerformanceCustomer

개발배경

On a High-end GPU

Customer

개발배경

Chen, Jiasi, and Xukan Ran. "Deep learning with edge computing: A review." Proceedings of the IEEE 107.8 (2019): 1655-1674.

Various Inference Environments of Deep Learning Network

개발배경

Neural Network

Target Environment

개발배경

Neural Network

Quantization

Neural Network

Target Environment

기술적인해결과제

양자화(Quantization)32-bit (Full Precision)

Binarization

MATLAB에서 1-bit quantization에대하여구현해보기!

(현재 MATLAB 2020a에서 8-bit quantization 기능을일부지원하고있습니다. 오늘바로사용해보세요!)

https://nervanasystems.github.io/distiller/quantization.html

네트워크를구성하는 weight와 activation의 bit를낮추어네트워크를경량화하거나연산을가속

DeepLabV3+ (Baseline: ResNet18) Architecture Weight Count

만약 DeepLabV3+ 를 1-bit Quantization한다면?

FP32 vs 1 Bit

Total Weight Count

Maximum Activation (Input Size: 360 x 480 x 3)

2.574656 MB

82.388992MB

0.1728 MB

13.1328 MB

당연한결과지만, 32-bit와 1-bit는 Volume에서 32배차이!

Check point:

1. Binarization function

2. Backpropagation of BNNs (Algorithm1)

3. Computing gradients and updates

기술적인해결과제 : BNN 구현

Binarization

Straight-through estimator?

Figure from Darabi, Sajad, et al. "BNN+: Improved binary network training." arXiv preprint arXiv:1812.11800 (2018).

Algorithm 1*

*Hubara, Itay, et al. "Binarized neural networks." Advances in neural information processing systems. 2016.

Algorithm 1: Training a BNN

- Binary Activation : Signum Activation Layer (backward: using STE)

- Binary Weight : When using a weight w^r, quantize it using w^b=Sign(w^r)

Constrain each real-valued weight between -1 and 1.

→ Clip weights during training

The real-valued weights would otherwise grow very

without any impact on the binary weights.

Anderson, Alexander G., and Cory P. Berg. "The high-dimensional geometry of binary neural networks." ICLR2018.

MathWorks 솔루션을통한해결방안및결과

기존함수를활용한 Model Training

https://kr.mathworks.com/help/deeplearning/ref/trainingoptions.html

내부구조는어떻게생겼나?

https://kr.mathworks.com/help/deeplearning/ref/trainingoptions.html

ctrl+D (Mac은 command+shift+D)를통해라이브러리함수열람

Convolution Layers를살펴보면!

convolution2dLayer -> Convolution2D -> Executionstrategy

-> Convolution2DLayer

convolution2dLayer -> Convolution2D -> Executionstrategy

-> Convolution2DLayer

Execution strategy

Binarized Convolutional Layers

다른이름으로저장후수정!

이진화컨벌루션함수생성!

Custom Layer를활용하여필요한활성함수만들기!

https://kr.mathworks.com/help/deeplearning/ug/define-custom-deep-learning-layers.html

Sign Activation Layer → Binary Activation!

New Functions for BNN Training

BNN(VGG7 model)

CNN의 1-bit quantization을위한함수들

*Nakahara, Hiroki, Tomoya Fujii, and Shimpei Sato. "A fully connected layer elimination for a binarizec convolutional neural network on an FPGA." 2017 27th International Conference on Field Programmable Logic and Applications (FPL). IEEE, 2017.

CIFAR-10 Training Results

Paper Reference*

Error Rate = 18.2%

Latent Space Visualization(t-SNE)

Nakahara, Hiroki, Tomoya Fujii, and Shimpei Sato. "A fully connected layer elimination for a binarizec convolutional neural network on an FPGA." 2017 27th International Conference on Field Programmable Logic and Applications (FPL). IEEE, 2017.

Raw BNN Features

BNN Training Results

What I’ve gotWhat paper said

결과및정리 (Achievements and Outlook)

● BNN은신경망양자화(Quantization)기법중하나로모델의가중치와활성화값을 1-bit으로양자화

● 1-bit 양자화를통해얻을수있는이득으로는..

1. 모델사이즈감소

2. 메모리소비감소

3. 연산가속효과기대

4. 하드웨어구성시집적도향상

● MATLAB을통한 BNN 논문구현방법을소개하고결과물을기존 MATLAB 함수들을사용하여평가

MATLAB으로논문을구현하며좋았던장점은…

● 레이어별함수내부로접근이가능하여구조를살펴볼수있음

● 필요한기능을구현하기위해해당함수및클래스에직접찾아가모듈별로직관적으로구현용이

MATLAB으로구현하며겪은단점은..

● 모든함수에접근이가능한것은아님

● 주석이나설명이불친절한경우나함수가복잡하게얽혀있는경우가많음

결론 (Concluding Remarks)

- 딥러닝네트워크모델은최종단계에서다양한환경에배포됨

- 엣지환경에서클라우드환경까지다양한환경에는다양한제약조건이존재

- 모델배포단계에서타겟환경을고려한경량화및압축기술은필수적

- 이를위해신경망의양자화및모델경량화기술이중요

- 오늘은이러한신경망의양자화중 1-bit(이진화) 신경망에대하여소개

- 양자화된네트워크모델은추론시에가속기와함께할때효과가극대화

- 각환경에는그환경에맞는다양한형태의가속, 연산기가존재

- 신경망양자화는모델사이즈의감소와더불어연산기와함께할때연산가속,

하드웨어의집적도, 메모리소모감소등다양한이득을얻을수있음

- Google Coral, Xilinx FPGA, Arm의 Ethos, Mali, Arm nn, NVIDIA의 TensorRT 등

AI기술을위한딥러닝모델의 1-bit 양자화 · 1 목차 1....

Documents

Ured za udruge - Naslovna · 2011-05-19 · a ˛,/2˝ ˙5,˝

사용자매뉴얼 - webbook.me.go.krwebbook.me.go.kr/DLi-File/NIER/09/018/5513182.pdf · - 2 - 제장자체방제계획서작성프로그램개요1 제절개발배경및목적1 1.개발배경

GOTRAVEL - 그린컴퓨터아트학원 · 2017-10-12 · POWER POINT PRESENTATION Page 1 pptbizcam.co.kr 제작기간 서비스소개 프로젝트개요 기술상세 사용자서비스

MATLAB을활용한 · 4 프로젝트개요: 5G 기술동향 ※5G 기술은서기환차장세션참조 ⚫표준화동향 • 2018년6월에3GPP에서5G SA(Standalone) 표준(Release

D L D Schleissheim D Allianz Arena (1.5km); KOLONIE ... · 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1

1 ) · 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ) . 0 ( / . ! ) - , * ) + * % ) ( ' & % " $ # " ! 3 2 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1

ЧЕЛОВЕЧЕСКОЕ РАЗВИТИЕ 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 …...«ЧЕЛОВЕЧЕСКОЕ РАЗВИТИЕ» 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ã º

Brand Intro - GLOWLOUDEY · 02 개발배경 Development Background 현대인의피부는점점지쳐가고있다. 환경적요인: 강해진자외선지수, 미세먼지, 배기가스등

Brand Intro - glowloudeycos.com · 02 개발배경 Development Background 현대인의피부는점점지쳐가고있다. 환경적요인: 강해진자외선지수, 미세먼지, 배기가스등

ACTA DE PRESENTACIÓN Y APERTURA DE PROPUESTAS …€¦ · / em pr esas 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 \ 1 1 1 1 1 1 1 1 1 1 1 1 1 \ 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1

o ¡Aprendo sin parar!...Estas figuras deben ser recortadas y usadas para las actividades del día 18. 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ... figuras

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ... - moh.gov.sg · 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1

file.tkplb.netfile.tkplb.net/PKB_2017/RAKORTEK_GRAN… · XLS file · Web view · 2017-11-065110 1 1 1 1. 1 1 1 1 1 1. 1 1 1 1 1 1 1. 5217 1 1 1 1 1 1 1. 1 1 1 1 1 1 1 1. 5982 1

Capitolo 3 Termodinamica - Istituto Nazionale di Fisica ... 3 Termodinamica.pdf · \ ï 1 1 1 £ 1 z 1 1 1 1 1 1 ð 1 1 1 1 1 ££ 1 1 1 1. 1 1 1 1 1 ¥ 1 1 ¥ 1 _ $ $

4° Cienc_Planif_13 Cienciasoct-08 dic 2016 (1) (1) (1) (1) (1) (1) (1) (1) (1) (1) (1) (1) (1) (1) (1) (1) (1) (1) (1) (1) (1) (1) (1) (1) (1).pdf

hp LaserJet1000h10032. · 3=jbl & 3=jbl ˛ ˘ ˇ˚ m4[at38qh tiu5j[ kzbpt 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1

SCMATE RX - shareditassets.s3.ap-northeast-2.amazonaws.com · Contents 제안배경 제품소개 구성및특장점 고객지원-개발배경 제품개요 - 도효과 - 주요기능

[XLS]fmism.univ-guelma.dzfmism.univ-guelma.dz/sites/default/files/le fond... · Web view1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1

1 1 1 1 ñ 1 1 1 1 1 1 1 1 1 1 1 1 , 1 1 1

Espacio Memoria y Parque Derechos Humanos (800m); Norte ...media.lonelyplanet.com/ebookmaps/Argentina/buenos-aires-gtr.pdf1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1