SalfMix一种基于显著性图的单图像数据增强技术-2021年

ID：37213

阅读量：0

大小：2.67 MB

页数：15页

时间：2023-03-03

金币：10

上传者：战必胜

sensors

Article

SalfMix: A Novel Single Image-Based Data Augmentation

Technique Using a Saliency Map

Jaehyeop Choi , Chaehyeon Lee , Donggyu Lee and Heechul Jung *



 

Citation: Choi, J.; Lee, C.; Lee, D.;

Jung, H. SalfMix: A Novel Single

Image-Based Data Augmentation

Technique Using a Saliency Map.

Sensors 2021, 21, 8444. https://

doi.org/10.3390/s21248444

Academic Editor: Nunzio Cennamo

Received: 19 November 2021

Accepted: 14 December 2021

Published: 17 December 2021

Publisher’s Note: MDPI stays neutral

with regard to jurisdictional claims in

published maps and institutional afﬁl-

iations.

Licensee MDPI, Basel, Switzerland.

This article is an open access article

distributed under the terms and

conditions of the Creative Commons

Attribution (CC BY) license (https://

creativecommons.org/licenses/by/

4.0/).

Department of Artiﬁcial Intelligence, Kyungpook National University, Daegu 41566, Korea;

jaebb95@knu.ac.kr (J.C.); 123456ccdd@knu.ac.kr (C.L.); dglee@knu.ac.kr (D.L.)

* Correspondence: heechul@knu.ac.kr; Tel.: +82-53-950-4558

Abstract:

Modern data augmentation strategies such as Cutout, Mixup, and CutMix, have achieved

good performance in image recognition tasks. Particularly, the data augmentation approaches,

such as Mixup and CutMix, that mix two images to generate a mixed training image, could generalize

convolutional neural networks better than single image-based data augmentation approaches such

as Cutout. We focus on the fact that the mixed image can improve generalization ability, and we

wondered if it would be effective to apply it to a single image. Consequently, we propose a new

data augmentation method to produce a self-mixed image based on a saliency map, called SalfMix.

Furthermore, we combined SalfMix with state-of-the-art two images-based approaches, such as

Mixup, SaliencyMix, and CutMix, to increase the performance, called HybridMix. The proposed

SalfMix achieved better accuracies than Cutout, and HybridMix achieved state-of-the-art perfor-

mance on three classiﬁcation datasets: CIFAR-10, CIFAR-100, and TinyImageNet-200. Furthermore,

HybridMix achieved the best accuracy in object detection tasks on the VOC dataset, in terms of mean

average precision.

Keywords:

deep learning; data augmentation; convolutional neural network (CNN); image classification

1. Introduction

Deep learning has achieved remarkable performances in various computer vision

tasks such as image classiﬁcation [

–

], segmentation [

], detection [

–

], and image

quality assessment [

]. Generally, deep neural networks (DNNs) require large training

data to achieve high performance. Data augmentation techniques can increase the limited

size of training data and are important elements in the training process of DNNs to

improve their generalization performances. Data augmentation techniques have been

used to train AlexNet [

], and geometric data augmentation approaches have been used

to reduce Top-5 error rates of ImageNet classiﬁcation tasks, such as ﬂip, rotation, crop,

and translation [

]. In 2014, VGG neural networks were proposed, and the scale

jittering data augmentation technique was introduced by [

]. The Cutout method, which is

a representative data augmentation approach, performs regional dropout, where pixel

values of a randomly selected region of an input image are removed [

]. Regional dropout

approaches have shown better recognition rates than previous geometric transformation

strategies [

]. These data augmentation approaches are performed on a single image,

as shown in Figure 1.

In the recent data augmentation studies, two training images are selected and mixed

during network training, and mixed images are used for training a convolutional neural

network (CNN), such as Mixup [

] and CutMix [

]. These techniques further improve

generalization performance than traditional single image-based approaches. Most recent

research works such as SaliencyMix [

], PuzzleMix [

], ResizeMix [

], and SnapMix [

]

focus on the mixing of two images for data augmentation. Especially, when CutMix mixes

images, random patches are cut and pasted on other images; however, saliency-guided

approaches have recently been proposed and achieve better performances than the original

Sensors 2021, 21, 8444. https://doi.org/10.3390/s21248444 https://www.mdpi.com/journal/sensors

资源描述：

当前文档最多预览五页，下载文档查看全文

侵权申诉



1 1 2 3 4 5 / 15



此文档下载收益归作者所有

当前文档最多预览五页，下载文档查看全文

版权提示

温馨提示：
1. 部分包含数学公式或PPT动画的文件，查看预览时可能会显示错乱或异常，文件下载后无此问题，请放心下载。
2. 本文档由用户上传，版权归属用户，天天文库负责整理代发布。如果您对本文档版权有争议请及时联系客服。
3. 下载前请仔细阅读文档内容，确认文档内容符合您的需求后进行下载，若出现内容与标题不符可向本站投诉处理。
4. 下载文档时可能由于网络波动等原因无法下载或下载错误，付费完成后未能成功下载的用户请联系客服处理。

大家都在看

近期热门

SalfMix一种基于显著性图的单图像数据增强技术-2021年

最近更新

大家都在看

相关文章

相关标签