DPSSD双通道单发探测器-2022年

ID：37203

阅读量：1

大小：2.50 MB

页数：15页

时间：2023-03-03

金币：10

上传者：战必胜

Citation: Shan, D.; Xu, Y.; Zhang, P.;

Wang, X.; He, D.; Zhang, C.; Zhou,

M.; Yu, G. DPSSD: Dual-Path

Single-Shot Detector. Sensors 2022, 22,

4616. https://doi.org/10.3390/

s22124616

Academic Editor: Marcin Kowalski

Received: 9 May 2022

Accepted: 13 June 2022

Published: 18 June 2022

Publisher’s Note: MDPI stays neutral

with regard to jurisdictional claims in

published maps and institutional afﬁl-

iations.

Licensee MDPI, Basel, Switzerland.

This article is an open access article

distributed under the terms and

conditions of the Creative Commons

Attribution (CC BY) license (https://

creativecommons.org/licenses/by/

4.0/).

sensors

Article

DPSSD: Dual-Path Single-Shot Detector

Dongri Shan

* , Yalu Xu

, Peng Zhang

, Xiaofang Wang

, Dongmei He

, Chenglong Zhang

, Maohui Zhou

and Guoqi Yu

School of Mechanical Engineering, Qilu University of Technology (Shandong Academy of Sciences),

Jinan 250300, China; 1043119034@stu.qlu.edu.cn (Y.X.); 1043119003@stu.qlu.edu.cn (C.Z.);

1043119049@stu.qlu.edu.cn (M.Z.); 1043119037@stu.qlu.edu.cn (G.Y.)

School of Information and Automation Engineering, Qilu University of Technology (Shandong Academy of

Sciences), Jinan 250300, China; zp@qlu.edu.cn (P.Z.); wxf2012@stu.xjtu.edu.cn (X.W.); hedm@sdas.org (D.H.)

* Correspondence: shandongri@qlu.edu.cn; Tel.: +86-138-6406-5008

Abstract:

Object detection is one of the most important and challenging branches of computer

vision. It has been widely used in people’s lives, such as for surveillance security and autonomous

driving. We propose a novel dual-path multi-scale object detection paradigm in order to extract

more abundant feature information for the object detection task and optimize the multi-scale object

detection problem, and based on this, we design a single-stage general object detection algorithm

called Dual-Path Single-Shot Detector (DPSSD). The dual path ensures that shallow features, i.e.,

residual path and concatenation path, can be more easily utilized to improve detection accuracy.

Our improved dual-path network is more adaptable to multi-scale object detection tasks, and we

combine it with the feature fusion module to generate a multi-scale feature learning paradigm called

the “Dual-Path Feature Pyramid”. We trained the models on PASCAL VOC datasets and COCO

datasets with 320 pixels and 512 pixels input, respectively, and performed inference experiments to

validate the structures in the neural network. The experimental results show that our algorithm has

an advantage over anchor-based single-stage object detection algorithms and achieves an advanced

level in average accuracy. Researchers can replicate the reported results of this paper.

Keywords: convolution neural networks; object detection; single-stage; multi-scale

1. Introduction

After the success of deep convolution neural networks (DCNN) [

] in the ﬁeld of image

classiﬁcation, the object detection algorithm also introduces deep-learning technology and

has achieved signiﬁcant progress [

]. These new algorithms based on deep learning

are much better than the traditional algorithm because the feature of the manual design

is replaced with the feature representation computed via convolution neural networks.

However, multi-scale feature learning is a critical problem of the detection algorithms

based on deep learning. To optimize this problem and improve the detection effect of

the single-stage multi-scale detector based on the anchor box, we conducted a relevant

literature search and experiments.

In general, the objects are placed in a complex environment and have a large variance

in scale; for example, in applications such as pedestrian detection, face detection and

autonomous driving, the algorithm has to be robust to changes in the scale of the object [

It is critical to train a robust and discriminate feature to obtain good detection performance.

There are four main paradigms to address the multi-scale feature learning problem: the

image pyramid, the prediction pyramid, integrated features and the feature pyramid

(Figure 1). SNIP [

] uses the image pyramid to solve the multi-scale problem, where

each layer is responsible for a certain range of scales (Figure 1a). In this way, the same

sample needs to be converted into different scales and repeatedly input to the network for

training. This results in many redundant calculations. By fusing the shallow features rich

Sensors 2022, 22, 4616. https://doi.org/10.3390/s22124616 https://www.mdpi.com/journal/sensors

资源描述：

当前文档最多预览五页，下载文档查看全文

侵权申诉



1 1 2 3 4 5 / 15



此文档下载收益归作者所有

当前文档最多预览五页，下载文档查看全文

版权提示

温馨提示：
1. 部分包含数学公式或PPT动画的文件，查看预览时可能会显示错乱或异常，文件下载后无此问题，请放心下载。
2. 本文档由用户上传，版权归属用户，天天文库负责整理代发布。如果您对本文档版权有争议请及时联系客服。
3. 下载前请仔细阅读文档内容，确认文档内容符合您的需求后进行下载，若出现内容与标题不符可向本站投诉处理。
4. 下载文档时可能由于网络波动等原因无法下载或下载错误，付费完成后未能成功下载的用户请联系客服处理。

大家都在看

近期热门

DPSSD双通道单发探测器-2022年

最近更新

大家都在看

相关文章

相关标签