一种基于锐化空间注意力的轻量级YOLO模型的时空锐化注意机制-2021年

ID：37313

阅读量：0

大小：3.92 MB

页数：16页

时间：2023-03-03

金币：10

上传者：战必胜

sensors

Article

One Spatio-Temporal Sharpening Attention Mechanism for

Light-Weight YOLO Models Based on Sharpening

Spatial Attention

Mengfan Xue

, Minghao Chen

1,2

, Dongliang Peng

*, Yunfei Guo

and Huajie Chen



 

Citation: Xue, M.; Chen, M.; Peng,

D.; Guo, Y.; Chen, H. One

Spatio-Temporal Sharpening

Attention Mechanism for

Light-Weight YOLO Models Based on

Sharpening Spatial Attention. Sensors

2021, 21, 7949. https://doi.org/

10.3390/s21237949

Academic Editor: Stefanos Kollias

Received: 12 October 2021

Accepted: 23 November 2021

Published: 28 November 2021

Publisher’s Note: MDPI stays neutral

with regard to jurisdictional claims in

published maps and institutional afﬁl-

iations.

Licensee MDPI, Basel, Switzerland.

This article is an open access article

distributed under the terms and

conditions of the Creative Commons

Attribution (CC BY) license (https://

creativecommons.org/licenses/by/

4.0/).

School of Automation, Hangzhou Dianzi University, Hangzhou 310018, China; xuemf@hdu.edu.cn (M.X.);

192060268@hdu.edu.cn (M.C.); gyf@hdu.edu.cn (Y.G.); chj247@hdu.edu.cn (H.C.)

HDU-ITMO Joint Institute, Hangzhou Dianzi University, Hangzhou 310018, China

* Correspondence: dlpeng@hdu.edu.cn

Abstract:

Attention mechanisms have demonstrated great potential in improving the performance

of deep convolutional neural networks (CNNs). However, many existing methods dedicate to

developing channel or spatial attention modules for CNNs with lots of parameters, and complex

attention modules inevitably affect the performance of CNNs. During our experiments of embedding

Convolutional Block Attention Module (CBAM) in light-weight model YOLOv5s, CBAM does

inﬂuence the speed and increase model complexity while reduce the average precision, but Squeeze-

and-Excitation (SE) has a positive impact in the model as part of CBAM. To replace the spatial

attention module in CBAM and offer a suitable scheme of channel and spatial attention modules, this

paper proposes one Spatio-temporal Sharpening Attention Mechanism (SSAM), which sequentially

infers intermediate maps along channel attention module and Sharpening Spatial Attention (SSA)

module. By introducing sharpening ﬁlter in spatial attention module, we propose SSA module

with low complexity. We try to ﬁnd a scheme to combine our SSA module with SE module or

Efﬁcient Channel Attention (ECA) module and show best improvement in models such as YOLOv5s

and YOLOv3-tiny. Therefore, we perform various replacement experiments and offer one best

scheme that is to embed channel attention modules in backbone and neck of the model and integrate

SSAM into YOLO head. We verify the positive effect of our SSAM on two general object detection

datasets VOC2012 and MS COCO2017. One for obtaining a suitable scheme and the other for proving

the versatility of our method in complex scenes. Experimental results on the two datasets show

obvious promotion in terms of average precision and detection performance, which demonstrates

the usefulness of our SSAM in light-weight YOLO models. Furthermore, visualization results also

show the advantage of enhancing positioning ability with our SSAM.

Keywords: attention mechanism; object detection; YOLO; light-weight model; sharpening ﬁlter

1. Introduction

Convolutional neural networks have achieved great progress in the ﬁeld of visual ob-

ject detection and tracking by rich and expressive performance. Most researchers normally

study its innovations in depth, width and structure [

–

]. In addition, the most important

indicators for evaluating an object detector are accuracy and speed. As a guide, the visual

object detector based on neural network can be divided into one-stage detector [

–

]

and two-stage detector [

]. The most representative two-stage object detector is the R-

CNN [

] series, which generally extracts the image feature by feature extraction networks,

inputs the feature maps into region proposal network to generate regions of interest as ﬁrst

prediction and then makes classiﬁcation and regression operations as second prediction.

While the one-stage detector only passes through one prediction operation to perform

the object detection task and combines classiﬁcation and positioning together. Compared

with the two-stage detector, the one-stage detector gains a substantial speed increase at

Sensors 2021, 21, 7949. https://doi.org/10.3390/s21237949 https://www.mdpi.com/journal/sensors

资源描述：

当前文档最多预览五页，下载文档查看全文

侵权申诉



1 1 2 3 4 5 / 16



此文档下载收益归作者所有

当前文档最多预览五页，下载文档查看全文

版权提示

温馨提示：
1. 部分包含数学公式或PPT动画的文件，查看预览时可能会显示错乱或异常，文件下载后无此问题，请放心下载。
2. 本文档由用户上传，版权归属用户，天天文库负责整理代发布。如果您对本文档版权有争议请及时联系客服。
3. 下载前请仔细阅读文档内容，确认文档内容符合您的需求后进行下载，若出现内容与标题不符可向本站投诉处理。
4. 下载文档时可能由于网络波动等原因无法下载或下载错误，付费完成后未能成功下载的用户请联系客服处理。

大家都在看

近期热门

一种基于锐化空间注意力的轻量级YOLO模型的时空锐化注意机制-2021年

最近更新

大家都在看

相关文章

相关标签