Paper Reading: Perceptual Generative Adversarial Networks for Small Object Detection

简介: Perceptual Generative Adversarial Networks for Small Object Detection2017-07-11  19:47:46   CVPR 2017   This paper use GAN to handle the issue of s...

Perceptual Generative Adversarial Networks for Small Object Detection

2017-07-11  19:47:46   CVPR 2017

 

  This paper use GAN to handle the issue of small object detection which is a very hard problem in general object detection. As shown in the following figures, small object and large objects usually shown different representations from the feature level.   

  Thus, it is possbile to use Percetual GAN to super-resolution of feature maps of small objects to obtain better detection performance.  

  It consists of two subnetworks, i.e., a generator network and a perceptual discriminator network. Specifically, the generator is a deep residual based feature generative model which transforms the original poor features of small objects to highly discriminative ones by introducing fine-grained details from lower-level layers, achieving “super-resolution” on the intermediate representations. 

  

  Different from normal GAN, this network also introduce a new perceptual loss tailored from the detection purpose. That is to say, the discriminator not only need to deal with the adversarial loss, but also need to justify the detection accuray benefiting from the generated super-resolved features with a perceptual loss.  

 

  The proposed contributions: 

  (1) We are the first to successfully apply GAN-alike models to solve the challenging small-scale object detection problems. 

   (2) We introduce a new conditional generator model that learns the additive residual representation between large and small objects, instead of generating the complete representations as before. 

   (3) We introduce a new perceptual discriminator that provides more comprehensive supervision beneficial for detections, instead of barely differentiating fake and real. 

   (4) Successful applications on traffic sign detection and pedestrian detection have been achieved with the state-of-the-art performance.  

  Figure 2. Training procedure of object detection network based on the Perceptual GAN.

  

  As shown in Figure 2, the generator network aims to generate super-resoved representation for the small object. 

  The discriminator includes two branches, i.e. 

    1. the adversarial branch  for differentiating between the generated superresolved representation. 

    2. the perception branch for justifying the detection accurcy benefiting from the generation representation. 


  ==>> Dicriminative Network Architecture: 

  The D network need to justify the dection accuracy benefiting from the generated super-resovled feature

  Given the adversarial loss $L_{dis_a}$ and the perceptual loss $L_{dis_p}$ , a final loss function Ldis can be produced as weighted sum of both individual loss components. Given weighting parameters w1 and w2, we define Ldis = w1 × Ldis_a + w2 × Ldis_p to encourage the generator network to generate super-resolved representation with high detection accuracy. Here we set both w1 and w2 to be one. 

 

    

 


  


 

  

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

  

相关文章
|
编解码 人工智能 缓存
自学记录鸿蒙API 13:实现多目标识别Object Detection
多目标识别技术广泛应用于动物识别、智能相册分类和工业检测等领域。本文通过学习HarmonyOS的Object Detection API(API 13),详细介绍了如何实现一个多目标识别应用,涵盖从项目初始化、核心功能实现到用户界面设计的全过程。重点探讨了目标类别识别、边界框生成、高精度置信度等关键功能,并分享了性能优化与功能扩展的经验。最后,作者总结了学习心得,并展望了未来结合语音助手等创新应用的可能性。如果你对多目标识别感兴趣,不妨从基础功能开始,逐步实现自己的创意。
464 60
|
机器学习/深度学习 人工智能 并行计算
【YOLOv5】LabVIEW+YOLOv5快速实现实时物体识别(Object Detection)含源码
在LabVIEW中调用YOLOv5快速实现实时物体识别,感受丝滑般物体识别
907 0
|
机器学习/深度学习 算法 安全
基于深度学习的目标检测的介绍(Introduction to object detection with deep learning)
基于深度学习的目标检测的介绍(Introduction to object detection with deep learning)
569 0
|
UED 开发工具 iOS开发
Uno Platform大揭秘:如何在你的跨平台应用中,巧妙融入第三方库与服务,一键解锁无限可能,让应用功能飙升,用户体验爆棚!
【8月更文挑战第31天】Uno Platform 让开发者能用同一代码库打造 Windows、iOS、Android、macOS 甚至 Web 的多彩应用。本文介绍如何在 Uno Platform 中集成第三方库和服务,如 Mapbox 或 Google Maps 的 .NET SDK,以增强应用功能并提升用户体验。通过 NuGet 安装所需库,并在 XAML 页面中添加相应控件,即可实现地图等功能。尽管 Uno 平台减少了平台差异,但仍需关注版本兼容性和性能问题,确保应用在多平台上表现一致。掌握正确方法,让跨平台应用更出色。
356 0
|
数据采集 API TensorFlow
简化目标检测流程:深入探讨TensorFlow Object Detection API的高效性与易用性及其与传统方法的比较分析
【8月更文挑战第31天】TensorFlow Object Detection API 是一项强大的工具,集成多种先进算法,支持 SSD、Faster R-CNN 等模型架构,并提供预训练模型,简化目标检测的开发流程。用户只需准备数据集并按要求处理,选择预训练模型进行微调训练即可实现目标检测功能。与传统方法相比,该 API 极大地减少了工作量,提供了从数据预处理到结果评估的一站式解决方案,降低了目标检测的技术门槛,使初学者也能快速搭建高性能系统。未来,我们期待看到更多基于此 API 的创新应用。
257 0
|
XML TensorFlow API
TensorFlow Object Detection API 超详细教程和踩坑过程
TensorFlow Object Detection API 超详细教程和踩坑过程
630 1
|
传感器 自动驾驶 算法
论文阅读:RRPN:RADAR REGION PROPOSAL NETWORK FOR OBJECT DETECTION IN AUTONOMOUS VEHICLES
论文阅读:RRPN:RADAR REGION PROPOSAL NETWORK FOR OBJECT DETECTION IN AUTONOMOUS VEHICLES
557 1
论文阅读:RRPN:RADAR REGION PROPOSAL NETWORK FOR OBJECT DETECTION IN AUTONOMOUS VEHICLES
|
机器学习/深度学习 XML 人工智能
基于Tensorflow2.x Object Detection API构建自定义物体检测器
基于Tensorflow2.x Object Detection API构建自定义物体检测器的保姆级教程,详细地描述了代码框架结构、数据集的标准方法,标注文件的数据处理、模型流水线的配置、模型的训练、评估、推理全流程。
530 1
|
编解码 自动驾驶 测试技术
【论文速递】ECCV2022 - PETR: Position Embedding Transformation for Multi-View 3D Object Detection
【论文速递】ECCV2022 - PETR: Position Embedding Transformation for Multi-View 3D Object Detection
664 0
【论文速递】ECCV2022 - PETR: Position Embedding Transformation for Multi-View 3D Object Detection
|
数据可视化 数据挖掘 测试技术
【计算机视觉】Open-Vocabulary Object Detection 论文工作总结
Open-Vocabulary Object Detection (OVD)可以翻译为**“面向开放词汇下的目标检测”,**该任务和 zero-shot object detection 非常类似,核心思想都是在可见类(base class)的数据上进行训练,然后完成对不可见类(unseen/ target)数据的识别和检测,除了核心思想类似外,很多论文其实对二者也没有进行很好的区分。
1538 0

热门文章

最新文章