Applied Deep Learning Resources

简介: Applied Deep Learning Resources A collection of research articles, blog posts, slides and code snippets about deep learning in applied settings.

 


Applied Deep Learning Resources

A collection of research articles, blog posts, slides and code snippets about deep learning in applied settings. Including trained models and simple methods that can be used out of the box. Mainly focusing on Convolutional Neural Networks (CNN) but Recurrent Neural Networks (RNN), deep Q-Networks (DQN) and other interesting architectures will also be listed.

CNN

Latest overview of the CNNs can be found from the paper "Deep learning for visual understanding: A review" [linkPDF]

Another decent overview in Nature by LeCun, Bengio and Hinton: "Deep learning" [linkPDF]

ImageNet

ImageNet is the most important image classification and localization competition. Other data sets with results can be found from here: "Discover the current state of the art in objects classification." [link].

imagenet-sample

Prediction error of the ImageNet competition has been decreasing rapidly over the last 5 years: imagenet-error

Main network architectures on ImageNet

AlexNet

Original paper: "ImageNet Classification with Deep Convolutional Neural Networks" [PDF]

Properties: 8 weight layers (5 convolutional and 2 fully connected), 60 million parameters, Rectified Linear Units (ReLUs), Local Response Normalization, Dropout

alexnet

VGG

Original paper: "Very Deep Convolutional Networks for Large-Scale Image Recognition" [arxiv]

Properties: 19 weight layers, 144m parameters, 3x3 convolution filters, L2 regularised, Dropout, No Local Response Normalization

GoogLeNet

Original paper: "Going deeper with convolutions" [arxiv]

Lates upgrade to the model achieves even better scores with models and import to Torch: "Rethinking the Inception Architecture for Computer Vision" [arxiv], "Torch port of Inception V3" [github]

Properties: 22 layers, 7m parameters, Inception modules, 1x1 conv layers, ReLUs, Dropout, Mid-level outputs

Inception modules:

googlenet

ResNet

Original paper: "Deep Residual Learning for Image Recognition" [arxiv]

Very nice slides: "Deep Residual Learning" [PDF]

Github: [github]

Properties: 152 layers, ReLUs, Batch Normalization (See "Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift" [arxiv]), less hacks (no dropout), more stable (different number of layers work as well) and lower complexity than VGG.

Main building block of the network:

resnet

Features are also very good and transferable with (faster) R-CNNs (see below):

resnet-features

Other architectures

  • Deep Learning for 3D shapes: "3D ShapeNets: A Deep Representation for Volumetric Shapes" [PDF]

  • Code and a model for faces: "Free and open source face recognition with deep neural networks." [github]

  • Fast neural networks which can perform arbitrary filters for images: "Deep Edge-Aware Filters" [PDF]

  • Lot's of different models in Caffe's "Model Zoo" [github]

Feature learning and object detection

  • "CNN Features off-the-shelf: an Astounding Baseline for Recognition" [arxiv]

  • First paper about R-CNN: "Rich feature hierarchies for accurate object detection and semantic segmentation" [PDFslides]

  • "Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks" [arxivgithubSlides]

fast-rcnn

  • "An Empirical Evaluation of Deep Learning on Highway Driving" [arxiv]

emp

  • "Object Detectors Emerge in Deep Scene CNNs" [arxiv]

emergent-localisation

  • Faster and better features: "Efficient Deep Feature Learning and Extraction via StochasticNets" [arxiv]

Other

  • Code and models for automatic captions of images: "Deep Visual-Semantic Alignments for Generating Image Descriptions"[web posterPDFgithub]

captions

  • Google Deep Dream or neural networks on LSD: "Inceptionism: Going Deeper into Neural Networks" [link,deepdreamer.io/]

Deep dreaming from noise:

deepdream

nnstyle

  • "Automatic Colorization" and it includes a pre-trained model [Link]

color

  • "Learning visual similarity for product design with convolutional neural networks" [PDF]

products

  • Using images and image descriptions to improve search results: "Images Don’t Lie: Transferring Deep Visual Semantic Features to Large-Scale Multimodal Learning to Rank" [arxiv]

  • "How Google Translate squeezes deep learning onto a phone" [post]

phone-nn

  • "What a Deep Neural Network thinks about your #selfie" [blog]

Top selfies according to the ConvNet:

topselfies

  • "Recommending music on Spotify with deep learning" [github]

  • "DeepStereo: Learning to Predict New Views from the World's Imagery" [arxiv]

deepstereo

  • Classifying street signs: "The power of Spatial Transformer Networks" [blog] with "Spatial Transformer Networks" [arxiv]

spatial-nn

  • "Pedestrian Detection with RCNN" [PDF]

DQN

  • Original paper: "Playing Atari with Deep Reinforcement Learning" [arxiv]

  • My popular science article about DQN: "Artificial General Intelligence that plays Atari video games: How did DeepMind do it?" [link]

  • DQN for RoboCup: "Deep Reinforcement Learning in Parameterized Action Space" [arxiv]

RNN

  • Original paper of the best RNN architecture: "Long short-term memory" [PDF]

  • Very good tutorial-like introduction to RNNs by Andrej Karpathy: "The Unreasonable Effectiveness of Recurrent Neural Networks" [link]

  • "Visualizing and Understanding Recurrent Networks" [arxiv]

  • "Composing Music With Recurrent Neural Networks" [blog]

Other promising or useful architectures

  • HTMs by Jeff Hawkins: "Continuous online sequence learning with an unsupervised neural network model"​ [arxiv]

  • Word2vec: "Efficient Estimation of Word Representations in Vector Space" [arxivGoogle code]

  • "Feedforward Sequential Memory Networks: A New Structure to Learn Long-term Dependency" [arxiv]

Framework benchmarks

  • "Comparative Study of Caffe, Neon, Theano and Torch for deep learning" [arxiv]

Their summary: From our experiments, we observe that Theano and Torch are the most easily extensible frameworks. We observe that Torch is best suited for any deep architecture on CPU, followed by Theano. It also achieves the best performance on the GPU for large convolutional and fully connected networks, followed closely by Neon. Theano achieves the best performance on GPU for training and deployment of LSTM networks. Finally Caffe is the easiest for evaluating the performance of standard deep architectures.

  • Very good qualitative analysis: zer0n/deepframeworks: [github]

  • Just performance comparison: soumith/convnet-benchmarks: [github]

  • "Deep Learning Libraries by Language" [link]

Other resources

Credits

Most of the snippets have come to my attention via internal mailing lists of Computational Neuroscience Lab at University of Tartu and London-based visual search company Dream It Get It. I am also reading a weekly newsletter by Data Elixir and checking research papers of the two main deep learning conferences: ICML and NIPS.

 
相关文章
|
11月前
|
JavaScript 前端开发 UED
Vue 项目中如何自定义实用的进度条组件
本文介绍了如何使用Vue.js创建一个灵活多样的自定义进度条组件。该组件可接受进度段数据数组作为输入,动态渲染进度段,支持动画效果和内容展示。当进度超出总长时,超出部分将以红色填充。文章详细描述了组件的设计目标、实现步骤(包括props定义、宽度计算、模板渲染、动画处理及超出部分的显示),并提供了使用示例。通过此组件,开发者可根据项目需求灵活展示进度情况,优化用户体验。资源地址:[https://pan.quark.cn/s/35324205c62b](https://pan.quark.cn/s/35324205c62b)。
532 0
|
并行计算 算法 安全
Bulletproof范围证明之优化
【11月更文挑战第9天】Bulletproof 是一种高效的零知识证明技术,广泛应用于加密货币等领域,用于证明交易金额在合法范围内而不泄露具体数值。本文介绍了 Bulletproof 的基本概念及其在算法、计算资源利用和协议交互等方面的优化方向,探讨了不同应用场景下的优化需求及面临的安全性和兼容性挑战。
387 4
|
移动开发 JavaScript API
HarmonyOS Next 简单上手元服务开发
本文介绍了 HarmonyOS Next 中元服务的开发流程与关键特性。元服务是一种轻量级应用程序形态,支持免安装、秒开直达,适用于听音乐、打车等场景,大幅提升服务获取效率。文章详细讲解了元服务的开发旅程,包括在 AGC 平台上新建项目、修改名称与图标、新增卡片等内容,并提供了代码示例,如 AtomicServiceTabs 的 tab 切换和标题设置、AtomicServiceNavigation 的路由管理等。此外,还探讨了 AtomicServiceWeb 的使用方法,涵盖鸿蒙页面与 h5 页面的数据传递及方法调用。
1206 20
HarmonyOS Next 简单上手元服务开发
|
网络协议 Linux 网络架构
【网络技术】什么是CIDR
【网络技术】什么是CIDR
1524 0
|
11月前
|
存储 JSON 前端开发
菜鸟之路Day39一一登录
本文介绍了登录功能的实现及其相关技术细节,包括会话管理、令牌认证和异常处理等内容。作者通过 Java 实现了一个基于用户名和密码的登录接口,调用服务层和数据库层完成用户验证。同时,文章深入探讨了三种会话跟踪技术:Cookie、Session 和 JWT 令牌。 在 JWT 部分,详细讲解了其生成与校验流程,实现了登录成功后返回 JWT 令牌的功能。此外,文章还介绍了过滤器(Filter)和拦截器(Interceptor)的概念及应用,演示了如何利用它们实现登录校验。 最后,为解决前后端交互中异常响应不统一的问题,定义了一个全局异常处理器 将系统异常以统一的 JSON 格式返回给前端。
308 0
|
监控 Linux Shell
Linux系统中如何查看磁盘情况
【7月更文挑战第11天】在Linux中,`df -h`展示文件系统磁盘使用概览;`du -sh /path`估计特定目录的磁盘占用;`fdisk -l`列出磁盘分区详情;`lsblk`以树状展示块设备信息。这些命令帮助监控磁盘状态,适时管理空间。
1249 0
|
人工智能 自然语言处理 机器人
Midjourney|文心一格prompt教程[Text Prompt(上篇)]:品牌log、App、徽章、插画、头像场景生成,各种风格选择:科技风、运动风
Midjourney|文心一格prompt教程[Text Prompt(上篇)]:品牌log、App、徽章、插画、头像场景生成,各种风格选择:科技风、运动风
Midjourney|文心一格prompt教程[Text Prompt(上篇)]:品牌log、App、徽章、插画、头像场景生成,各种风格选择:科技风、运动风
|
应用服务中间件 Linux Shell
树莓派上安装Nginx竟如此简单,只需两行代码...
树莓派上安装Nginx竟如此简单,只需两行代码...
|
存储 弹性计算 运维
Codeup使用评测
Codeup使用评测
996 0
Codeup使用评测