Paper Reading: Perceptual Generative Adversarial Networks for Small Object Detection

简介: Perceptual Generative Adversarial Networks for Small Object Detection2017-07-11  19:47:46   CVPR 2017   This paper use GAN to handle the issue of s...

Perceptual Generative Adversarial Networks for Small Object Detection

2017-07-11  19:47:46   CVPR 2017

 

  This paper use GAN to handle the issue of small object detection which is a very hard problem in general object detection. As shown in the following figures, small object and large objects usually shown different representations from the feature level.   

  Thus, it is possbile to use Percetual GAN to super-resolution of feature maps of small objects to obtain better detection performance.  

  It consists of two subnetworks, i.e., a generator network and a perceptual discriminator network. Specifically, the generator is a deep residual based feature generative model which transforms the original poor features of small objects to highly discriminative ones by introducing fine-grained details from lower-level layers, achieving “super-resolution” on the intermediate representations. 

  

  Different from normal GAN, this network also introduce a new perceptual loss tailored from the detection purpose. That is to say, the discriminator not only need to deal with the adversarial loss, but also need to justify the detection accuray benefiting from the generated super-resolved features with a perceptual loss.  

 

  The proposed contributions: 

  (1) We are the first to successfully apply GAN-alike models to solve the challenging small-scale object detection problems. 

   (2) We introduce a new conditional generator model that learns the additive residual representation between large and small objects, instead of generating the complete representations as before. 

   (3) We introduce a new perceptual discriminator that provides more comprehensive supervision beneficial for detections, instead of barely differentiating fake and real. 

   (4) Successful applications on traffic sign detection and pedestrian detection have been achieved with the state-of-the-art performance.  

  Figure 2. Training procedure of object detection network based on the Perceptual GAN.

  

  As shown in Figure 2, the generator network aims to generate super-resoved representation for the small object. 

  The discriminator includes two branches, i.e. 

    1. the adversarial branch  for differentiating between the generated superresolved representation. 

    2. the perception branch for justifying the detection accurcy benefiting from the generation representation. 


  ==>> Dicriminative Network Architecture: 

  The D network need to justify the dection accuracy benefiting from the generated super-resovled feature

  Given the adversarial loss $L_{dis_a}$ and the perceptual loss $L_{dis_p}$ , a final loss function Ldis can be produced as weighted sum of both individual loss components. Given weighting parameters w1 and w2, we define Ldis = w1 × Ldis_a + w2 × Ldis_p to encourage the generator network to generate super-resolved representation with high detection accuracy. Here we set both w1 and w2 to be one. 

 

    

 


  


 

  

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

  

相关文章
|
1天前
|
算法 BI 计算机视觉
[Initial Image Segmentation Generator]论文实现:Efficient Graph-Based Image Segmentation
[Initial Image Segmentation Generator]论文实现:Efficient Graph-Based Image Segmentation
9 1
|
9月前
|
机器学习/深度学习 自然语言处理 算法
TASLP21-Reinforcement Learning-based Dialogue Guided Event Extraction to Exploit Argument Relations
事件抽取是自然语言处理的一项基本任务。找到事件论元(如事件参与者)的角色对于事件抽取至关重要。
62 0
|
9月前
|
机器学习/深度学习 人工智能 自然语言处理
OneIE:A Joint Neural Model for Information Extraction with Global Features论文解读
大多数现有的用于信息抽取(IE)的联合神经网络模型使用局部任务特定的分类器来预测单个实例(例如,触发词,关系)的标签,而不管它们之间的交互。
111 0
|
9月前
|
机器学习/深度学习 编解码 数据可视化
Speech Emotion Recognition With Local-Global aware Deep Representation Learning论文解读
语音情感识别(SER)通过从语音信号中推断人的情绪和情感状态,在改善人与机器之间的交互方面发挥着至关重要的作用。尽管最近的工作主要集中于从手工制作的特征中挖掘时空信息,但我们探索如何从动态时间尺度中建模语音情绪的时间模式。
85 0
|
9月前
|
自然语言处理 算法 知识图谱
DEGREE: A Data-Efficient Generation-Based Event Extraction Model论文解读
事件抽取需要专家进行高质量的人工标注,这通常很昂贵。因此,学习一个仅用少数标记示例就能训练的数据高效事件抽取模型已成为一个至关重要的挑战。
75 0
|
12月前
|
机器学习/深度学习 存储 传感器
Automated defect inspection system for metal surfaces based on deep learning and data augmentation
简述:卷积变分自动编码器(CVAE)生成特定的图像,再使用基于深度CNN的缺陷分类算法进行分类。在生成足够的数据来训练基于深度学习的分类模型之后,使用生成的数据来训练分类模型。
100 0
|
12月前
|
机器学习/深度学习 算法 数据挖掘
A Generative Adversarial Network-based Deep Learning Method for Low-quality Defect ImageReconstructi
本文提出了一种基于生成对抗网络 (GAN) 的 DL 方法,用于低质量缺陷图像识别。 GAN用于重建低质量缺陷图像,并建立VGG16网络识别重建图像。
102 0
|
12月前
|
机器学习/深度学习 移动开发 算法
DISCOBOX: Weakly Supervised Instance Segmentation and Semantic Correspondence from Box Supervision
定位和识别物体的能力是人类视觉的核心。这促使视觉社区研究对象检测 [1] 作为一项基本的视觉识别任务。在检测之上进一步引入实例分割 [2] 以预测前景对象掩码,从而实现像素级精度的定位。
60 0
|
机器学习/深度学习 算法 数据挖掘
【多标签文本分类】Improved Neural Network-based Multi-label Classification with Better Initialization ……
【多标签文本分类】Improved Neural Network-based Multi-label Classification with Better Initialization ……
【多标签文本分类】Improved Neural Network-based Multi-label Classification with Better Initialization ……
《Multi-Task Multi-Network Joint-Learning of Deep Residual Networks and Cycle-Consistency Generative Adversarial Networks for Robust Speech Recognition》电子版地址
Multi-Task Multi-Network Joint-Learning of Deep Residual Networks and Cycle-Consistency Generative Adversarial Networks for Robust Speech Recognition
73 0
《Multi-Task Multi-Network Joint-Learning of Deep Residual Networks and Cycle-Consistency Generative Adversarial Networks for Robust Speech Recognition》电子版地址