EMDL

Embedded and mobile deep learning research notes

Papers

Model

DeepRebirth: Accelerating Deep Neural Network Execution on Mobile Devices [arXiv '17, Samsung]
ShuffleNet: An Extremely Efficient Convolutional Neural Network for Mobile Devices [arXiv '17, Megvii]
MobileNets: Efficient Convolutional Neural Networks for Mobile Vision Applications [arXiv '17, Google]

System

DeepMon: Mobile GPU-based Deep Learning Framework for Continuous Vision Applications [MobiSys '17]
DeepEye: Resource Efficient Local Execution of Multiple Deep Vision Models using Wearable Commodity Hardware [MobiSys '17]
MobiRNN: Efficient Recurrent Neural Network Execution on Mobile GPU [EMDL '17]
DeepSense: A GPU-based deep convolutional neural network framework on commodity mobile devices [WearSys '16]
DeepX: A Software Accelerator for Low-Power Deep Learning Inference on Mobile Devices [IPSN '16]
EIE: Efficient Inference Engine on Compressed Deep Neural Network [ISCA '16]
MCDNN: An Approximation-Based Execution Framework for Deep Stream Processing Under Resource Constraints [MobiSys '16]
DXTK: Enabling Resource-efficient Deep Learning on Mobile and Embedded Devices with the DeepX Toolkit [MobiCASE '16]
Sparsification and Separation of Deep Learning Layers for Constrained Resource Inference on Wearables [SenSys ’16]
An Early Resource Characterization of Deep Learning on Wearables, Smartphones and Internet-of-Things Devices [IoT-App ’15]
CNNdroid: GPU-Accelerated Execution of Trained Deep Convolutional Neural Networks on Android [MM '16]

Quantization

Pruning

Learning both Weights and Connections for Efficient Neural Networks [NIPS'15]
Pruning Filters for Efficient ConvNets [ICLR'17]
Pruning Convolutional Neural Networks for Resource Efficient Inference [ICLR'17]
Soft Weight-Sharing for Neural Network Compression [ICLR'17]
Deep Compression: Compressing Deep Neural Networks with Pruning, Trained Quantization and Huffman Coding [ICLR'16]
Dynamic Network Surgery for Efficient DNNs [NIPS'16]
Designing Energy-Efficient Convolutional Neural Networks using Energy-Aware Pruning [CVPR'17]
ThiNet: A Filter Level Pruning Method for Deep Neural Network Compression [ICCV'17]

Approximation

Efficient and Accurate Approximations of Nonlinear Convolutional Networks [CVPR'15]
Accelerating Very Deep Convolutional Networks for Classification and Detection (Extended version of above one)
Convolutional neural networks with low-rank regularization [arXiv'15]
Exploiting Linear Structure Within Convolutional Networks for Efficient Evaluation [NIPS'14]
Compression of Deep Convolutional Neural Networks for Fast and Low Power Mobile Applications [ICLR'16]

Libraries

General

Web

mil-tokyo/webdnn: Fastest DNN Execution Framework on Web Browser

Tutorials

General

NEON

NEON™ Programmer’s Guide

OpenCL

Courses

Deep learning systems, UW course schedule(focused on systems design, not learning)

Name		Name	Last commit message	Last commit date
Latest commit History 25 Commits
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

EMDL

Papers

Model

System

Quantization

Pruning

Approximation

Libraries

General

Web

Tutorials

General

NEON

OpenCL

Courses

Demos

General

Vulkan

OpenCL

RenderScript

Tools

GPU

Driver

About

Releases

Packages

License

KangolHsu/emdl

Folders and files

Latest commit

History

Repository files navigation

EMDL

Papers

Model

System

Quantization

Pruning

Approximation

Libraries

General

Web

Tutorials

General

NEON

OpenCL

Courses

Demos

General

Vulkan

OpenCL

RenderScript

Tools

GPU

Driver

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Packages