转自:.html#t-cnn
Method | VOC2007 | VOC2010 | VOC2012 | ILSVRC 2013 | MSCOCO 2015 | Speed |
---|---|---|---|---|---|---|
OverFeat | 24.3% | |||||
R-CNN (AlexNet) | 58.5% | 53.7% | 53.3% | 31.4% | ||
R-CNN (VGG16) | 66.0% | |||||
SPP_net(ZF-5) | 54.2%(1-model), 60.9%(2-model) | 31.84%(1-model), 35.11%(6-model) | ||||
DeepID-Net | 64.1% | 50.3% | ||||
NoC | 73.3% | 68.8% | ||||
Fast-RCNN (VGG16) | 70.0% | 68.8% | 68.4% | 19.7%(@[0.5-0.95]), 35.9%(@0.5) | ||
MR-CNN | 78.2% | 73.9% | ||||
Faster-RCNN (VGG16) | 78.8% | 75.9% | 21.9%(@[0.5-0.95]), 42.7%(@0.5) | 198ms | ||
Faster-RCNN (ResNet-101) | 85.6% | 83.8% | 37.4%(@[0.5-0.95]), 59.0%(@0.5) | |||
SSD300 (VGG16) | 77.2% | 75.8% | 25.1%(@[0.5-0.95]), 43.1%(@0.5) | 46 fps | ||
SSD512 (VGG16) | 79.8% | 78.5% | 28.8%(@[0.5-0.95]), 48.5%(@0.5) | 19 fps | ||
ION | 79.2% | 76.4% | ||||
CRAFT | 75.7% | 71.3% | 48.5% | |||
OHEM | 78.9% | 76.3% | 25.5%(@[0.5-0.95]), 45.9%(@0.5) | |||
R-FCN (ResNet-50) | 77.4% | 0.12sec(K40), 0.09sec(TitianX) | ||||
R-FCN (ResNet-101) | 79.5% | 0.17sec(K40), 0.12sec(TitianX) | ||||
R-FCN (ResNet-101),multi sc train | 83.6% | 82.0% | 31.5%(@[0.5-0.95]), 53.2%(@0.5) | |||
PVANet 9.0 | 89.8% | 84.2% | 750ms(CPU), 46ms(TitianX) |
Detection Results: VOC2012
Deep Neural Networks for Object Detection
OverFeat: Integrated Recognition, Localization and Detection using Convolutional Networks
Rich feature hierarchies for accurate object detection and semantic segmentation
Scalable Object Detection using Deep Neural Networks
Scalable, High-Quality Object Detection
Spatial Pyramid Pooling in Deep Convolutional Networks for Visual Recognition
DeepID-Net: Deformable Deep Convolutional Neural Networks for Object Detection
Object Detectors Emerge in Deep Scene CNNs
segDeepM: Exploiting Segmentation and Context in Deep Neural Networks for Object Detection
Object Detection Networks on Convolutional Feature Maps
Improving Object Detection with Deep Convolutional Networks via Bayesian Optimization and Structured Prediction
Fast R-CNN
DeepBox: Learning Objectness with Convolutional Networks
Object detection via a multi-region & semantic segmentation-aware CNN model
Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks
Faster R-CNN in MXNet with distributed implementation and data parallelization
Contextual Priming and Feedback for Faster R-CNN
An Implementation of Faster RCNN with Study for Region Sampling
You Only Look Once: Unified, Real-Time Object Detection
darkflow - translate darknet to tensorflow. Load trained weights, retrain/fine-tune them using tensorflow, export constant graph def to C++
Start Training YOLO with Our Own Data
R-CNN minus R
AttentionNet: Aggregating Weak Directions for Accurate Object Detection
DenseBox: Unifying Landmark Localization with End to End Object Detection
SSD: Single Shot MultiBox Detector
Inside-Outside Net: Detecting Objects in Context with Skip Pooling and Recurrent Neural Networks
Adaptive Object Detection Using Adjacency and Zoom Prediction
G-CNN: an Iterative Grid Based Object Detector
Factors in Finetuning Deep Model for object detection
Factors in Finetuning Deep Model for Object Detection with Long-tail Distribution
We don’t need no bounding-boxes: Training object class detectors using only human verification
HyperNet: Towards Accurate Region Proposal Generation and Joint Object Detection
A MultiPath Network for Object Detection
CRAFT Objects from Images
Training Region-based Object Detectors with Online Hard Example Mining
Track and Transfer: Watching Videos to Simulate Strong Human Supervision for Weakly-Supervised Object Detection
Exploit All the Layers: Fast and Accurate CNN Object Detector with Scale Dependent Pooling and Cascaded Rejection Classifiers
R-FCN: Object Detection via Region-based Fully Convolutional Networks
Weakly supervised object detection using pseudo-strong labels
Recycle deep features for better object detection
A Unified Multi-scale Deep Convolutional Neural Network for Fast Object Detection
Multi-stage Object Detection with Group Recursive Learning
Subcategory-aware Convolutional Neural Networks for Object Proposals and Detection
PVANET: Deep but Lightweight Neural Networks for Real-time Object Detection
PVANet: Lightweight Deep Neural Networks for Real-time Object Detection
Gated Bi-directional CNN for Object Detection
Crafting GBD-Net for Object Detection
StuffNet: Using ‘Stuff’ to Improve Object Detection
Generalized Haar Filter based Deep Networks for Real-Time Object Detection in Traffic Scene
Hierarchical Object Detection with Deep Reinforcement Learning
Learning to detect and localize many objects from few examples
Speed/accuracy trade-offs for modern convolutional object detectors
SqueezeDet: Unified, Small, Low Power Fully Convolutional Neural Networks for Real-Time Object Detection for Autonomous Driving
Feature Pyramid Networks for Object Detection
Action-Driven Object Detection with Top-Down Visual Attentions
Beyond Skip Connections: Top-Down Modulation for Object Detection
YOLO9000: Better, Faster, Stronger
Yolo_mark: GUI for marking bounded boxes of objects in images for training Yolo v2
DSSD : Deconvolutional Single Shot Detector
Wide-Residual-Inception Networks for Real-time Object Detection
Attentional Network for Visual Object Detection
Learning Chained Deep Features and Classifiers for Cascade in Object Detection
DeNet: Scalable Real-time Object Detection with Directed Sparse Sampling
.10295
A-Fast-RCNN: Hard Positive Generation via Adversary for Object Detection
Discriminative Bimodal Networks for Visual Localization and Detection with Natural Language Queries
Spatial Memory for Context Reasoning in Object Detection
Improving Object Detection With One Line of Code
Accurate Single Stage Detector Using Recurrent Rolling Convolution
Deep Occlusion Reasoning for Multi-Camera Multi-Target Detection
.05775
Learning Object Class Detectors from Weakly Annotated Video
Analysing domain shift factors between videos and images for object detection
Video Object Recognition
Deep Learning for Saliency Prediction in Natural Video
T-CNN: Tubelets with Convolutional Neural Networks for Object Detection from Videos
Object Detection from Video Tubelets with Convolutional Neural Networks
Object Detection in Videos with Tubelets and Multi-context Cues
Context Matters: Refining Object Detection in Video with Recurrent Neural Networks
CNN Based Object Detection in Large Video Images
Object Detection in Videos with Tubelet Proposal Networks
Flow-Guided Feature Aggregation for Video Object Detection
Video Object Detection using Faster R-CNN
Vote3Deep: Fast Object Detection in 3D Point Clouds Using Efficient Convolutional Neural Networks
Learning Rich Features from RGB-D Images for Object Detection and Segmentation
Differential Geometry Boosts Convolutional Neural Networks for Object Detection
A Self-supervised Learning System for Object Detection using Physics Simulation and Multi-view Pose Estimation
.03347
This task involves predicting the salient regions of an image given by human eye fixations.
Best Deep Saliency Detection Models (CVPR 2016 & 2015)
.html
Large-scale optimization of hierarchical features for saliency prediction in natural images
Predicting Eye Fixations using Convolutional Neural Networks
Saliency Detection by Multi-Context Deep Learning
DeepSaliency: Multi-Task Deep Neural Network Model for Salient Object Detection
SuperCNN: A Superpixelwise Convolutional Neural Network for Salient Object Detection
Shallow and Deep Convolutional Networks for Saliency Prediction
Recurrent Attentional Networks for Saliency Detection
Two-Stream Convolutional Networks for Dynamic Saliency Prediction
Unconstrained Salient Object Detection
Unconstrained Salient Object Detection via Proposal Subset Optimization
DHSNet: Deep Hierarchical Saliency Network for Salient Object Detection
Salient Object Subitizing
Deeply-Supervised Recurrent Convolutional Neural Network for Saliency Detection
Saliency Detection via Combining Region-Level and Pixel-Level Predictions with CNNs
Edge Preserving and Multi-Scale Contextual Neural Network for Salient Object Detection
A Deep Multi-Level Network for Saliency Prediction
Visual Saliency Detection Based on Multiscale Deep CNN Features
A Deep Spatial Contextual Long-term Recurrent Convolutional Network for Saliency Detection
Deeply supervised salient object detection with short connections
Weakly Supervised Top-down Salient Object Detection
SalGAN: Visual Saliency Prediction with Generative Adversarial Networks
Visual Saliency Prediction Using a Mixture of Deep Neural Networks
A Fast and Compact Salient Score Regression Network Based on Fully Convolutional Network
Saliency Detection by Forward and Backward Cues in Deep-CNNs
.00152
Supervised Adversarial Networks for Image Saliency Detection
.07242
Deep Learning For Video Saliency Detection
Visual Relationship Detection with Language Priors
ViP-CNN: A Visual Phrase Reasoning Convolutional Neural Network for Visual Relationship Detection
Visual Translation Embedding Network for Visual Relation Detection
Deep Variation-structured Reinforcement Learning for Visual Relationship and Attribute Detection
Detecting Visual Relationships with Deep Relational Networks
Multi-view Face Detection Using Deep Convolutional Neural Networks
From Facial Parts Responses to Face Detection: A Deep Learning Approach
Compact Convolutional Neural Network Cascade for Face Detection
Face Detection with End-to-End Integration of a ConvNet and a 3D Model
CMS-RCNN: Contextual Multi-Scale Region-based CNN for Unconstrained Face Detection
Finding Tiny Faces
Towards a Deep Learning Framework for Unconstrained Face Detection
Supervised Transformer Network for Efficient Face Detection
UnitBox: An Advanced Object Detection Network
Bootstrapping Face Detection with Hard Negative Examples
Grid Loss: Detecting Occluded Faces
A Multi-Scale Cascade Fully Convolutional Network Face Detector
Joint Face Detection and Alignment using Multi-task Cascaded Convolutional Networks
Joint Face Detection and Alignment using Multi-task Cascaded Convolutional Neural Networks
Face Detection using Deep Learning: An Improved Faster RCNN Approach
Faceness-Net: Face Detection through Deep Facial Part Responses
Multi-Path Region-Based Convolutional Neural Network for Accurate Detection of Unconstrained “Hard Faces”
End-To-End Face Detection and Recognition
.10818
Deep Convolutional Network Cascade for Facial Point Detection
Facial Landmark Detection by Deep Multi-task Learning
A Recurrent Encoder-Decoder Network for Sequential Face Alignment
Detecting facial landmarks in the video based on a hybrid framework
Deep Constrained Local Models for Facial Landmark Detection
Effective face landmark localization via single deep network
A Convolution Tree with Deconvolution Branches: Exploiting Geometric Relationships for Single Shot Keypoint Detection
.01880
End-to-end people detection in crowded scenes
Detecting People in Artwork with CNNs
Deep Multi-camera People Detection
Context-aware CNNs for person head detection
Pedestrian Detection aided by Deep Learning Semantic Tasks
Deep Learning Strong Parts for Pedestrian Detection
Deep convolutional neural networks for pedestrian detection
Scale-aware Fast R-CNN for Pedestrian Detection
New algorithm improves speed and accuracy of pedestrian detection
Pushing the Limits of Deep CNNs for Pedestrian Detection
A Real-Time Deep Learning Pedestrian Detector for Robot Navigation
A Real-Time Pedestrian Detector using Deep Learning for Human-Aware Navigation
Is Faster R-CNN Doing Well for Pedestrian Detection?
Reduced Memory Region Based Deep Convolutional Neural Network Detection
Fused DNN: A deep neural network fusion approach to fast and robust pedestrian detection
Multispectral Deep Neural Networks for Pedestrian Detection
Expecting the Unexpected: Training Detectors for Unusual Pedestrians with Adversarial Imposters
DAVE: A Unified Framework for Fast Vehicle Detection and Annotation
Evolving Boxes for fast Vehicle Detection
Traffic-Sign Detection and Classification in the Wild
Holistically-Nested Edge Detection
Unsupervised Learning of Edges
Pushing the Boundaries of Boundary Detection using Deep Learning
Convolutional Oriented Boundaries
Convolutional Oriented Boundaries: From Image Segmentation to High-Level Tasks
Richer Convolutional Features for Edge Detection
Object Skeleton Extraction in Natural Images by Fusing Scale-associated Deep Side Outputs
DeepSkeleton: Learning Multi-task Scale-associated Deep Side Outputs for Object Skeleton Extraction in Natural Images
SRN: Side-output Residual Network for Object Symmetry Detection in the Wild
Deep Fruit Detection in Orchards
Image Segmentation for Fruit Detection and Yield Estimation in Apple Orchards
Objects as context for part detection
.09529
Deep Deformation Network for Object Landmark Localization
Fashion Landmark Detection in the Wild
Deep Learning for Fast and Accurate Fashion Item Detection
OSMDeepOD - OSM and Deep Learning based Object Detection from Aerial Imagery (formerly known as “OSM-Crosswalk-Detection”)
Selfie Detection by Synergy-Constraint Based Convolutional Neural Network
Associative Embedding:End-to-End Learning for Joint Detection and Grouping
Deep Cuboid Detection: Beyond 2D Bounding Boxes
Automatic Model Based Dataset Generation for Fast and Accurate Crop and Weeds Detection
Deep Learning Logo Detection with Data Expansion by Synthesising Context
Pixel-wise Ear Detection with Convolutional Encoder-Decoder Networks
Automatic Handgun Detection Alarm in Videos Using Deep Learning
DeepProposal: Hunting Objects by Cascading Deep Convolutional Layers
Scale-aware Pixel-wise Object Proposal Networks
Attend Refine Repeat: Active Box Proposal Generation via In-Out Localization
Learning to Segment Object Proposals via Recursive Neural Networks
Learning Detection with Diverse Proposals
ScaleNet: Guiding Object Proposal Generation in Supermarkets and Beyond
Improving Small Object Proposals for Company Logo Detection
Beyond Bounding Boxes: Precise Localization of Objects in Images
Weakly Supervised Object Localization with Multi-fold Multiple Instance Learning
Weakly Supervised Object Localization Using Size Estimates
Active Object Localization with Deep Reinforcement Learning
Localizing objects using referring expressions
LocNet: Improving Localization Accuracy for Object Detection
Learning Deep Features for Discriminative Localization
ContextLocNet: Context-Aware Deep Network Models for Weakly Supervised Localization
Convolutional Feature Maps: Elements of efficient (and accurate) CNN-based object detection
Towards Good Practices for Recognition & Detection
TensorBox: a simple framework for training neural networks to detect objects in images
Object detection in torch: Implementation of some object detection frameworks in torch
Using DIGITS to train an Object Detection network
FCN-MultiBox Detector
KittiBox: A car detection model implemented in Tensorflow.
BeaverDam: Video annotation tool for deep learning training labels
Convolutional Neural Networks for Object Detection
/
Introducing automatic object detection to visual search (Pinterest)
Deep Learning for Object Detection with DIGITS
Analyzing The Papers Behind Facebook’s Computer Vision Approach
How to Train a Deep-Learned Object Detection Model in the Microsoft Cognitive Toolkit
Object Detection in Satellite Imagery, a Low Overhead Approach
You Only Look Twice — Multi-Scale Object Detection in Satellite Imagery With Convolutional Neural Networks
Faster R-CNN Pedestrian and Car Detection
Small U-Net for vehicle detection
Region of interest pooling explained
本文发布于:2024-02-01 12:45:41,感谢您对本站的认可!
本文链接:https://www.4u4v.net/it/170676274336688.html
版权声明:本站内容均来自互联网,仅供演示用,请勿用于商业和其他非法用途。如果侵犯了您的权益请与我们联系,我们将在24小时内删除。
留言与评论(共有 0 条评论) |