基于深度强化学习的交互式对象分割自动内点定位-2021年

ID：37267

阅读量：0

大小：3.11 MB

页数：14页

时间：2023-03-03

金币：10

上传者：战必胜

sensors

Article

Automatic Inside Point Localization with Deep Reinforcement

Learning for Interactive Object Segmentation

Guoqing Li

1,2

, Guoping Zhang

1,2

and Chanchan Qin

3,4,



 

Citation: Li, G.; Zhang, G.; Qin, C.

Automatic Inside Point Localization

with Deep Reinforcement Learning

for Interactive Object Segmentation.

Sensors 2021, 21, 6100. https://

doi.org/10.3390/s21186100

Academic Editor: Nunzio Cennamo

Received: 1 August 2021

Accepted: 9 September 2021

Published: 11 September 2021

Publisher’s Note: MDPI stays neutral

with regard to jurisdictional claims in

published maps and institutional afﬁl-

iations.

Licensee MDPI, Basel, Switzerland.

This article is an open access article

distributed under the terms and

conditions of the Creative Commons

Attribution (CC BY) license (https://

creativecommons.org/licenses/by/

4.0/).

College of Physical Science and Technology, Central China Normal University, NO. 152 Luoyu Road,

Wuhan 430079, China; liguoqing@mails.ccnu.edu.cn (G.L.); gpzhang@mail.ccnu.edu.cn (G.Z.)

Key Laboratory of Quark and Lepton Physics (MOE) and College of Physics Science and Technology,

Central China Normal University, NO. 152 Luoyu Road, Wuhan 430079, China

School of Big Data and Computer Science, Guizhou Normal University, The University Town,

Guian New Area, Guiyang 550025, China

Center for RFID and WSN Engineering, Department of Education, Guizhou Normal University,

The University Town, Guian New Area, Guiyang 550025, China

* Correspondence: 201407141@gznu.edu.cn

Abstract:

In the task of interactive image segmentation, the Inside-Outside Guidance (IOG) algo-

rithm has demonstrated superior segmentation performance leveraging Inside-Outside Guidance

information. Nevertheless, we observe that the inconsistent input between training and testing

when selecting the inside point will result in signiﬁcant performance degradation. In this paper, a

deep reinforcement learning framework, named Inside Point Localization Network (IPL-Net), is

proposed to infer the suitable position for the inside point to help the IOG algorithm. Concretely,

when a user ﬁrst clicks two outside points at the symmetrical corner locations of the target object, our

proposed system automatically generates the sequence of movement to localize the inside point. We

then perform the IOG interactive segmentation method for precisely segmenting the target object

of interest. The inside point localization problem is difﬁcult to deﬁne as a supervised learning

framework because it is expensive to collect image and their corresponding inside points. Therefore,

we formulate this problem as Markov Decision Process (MDP) and then optimize it with Dueling

Double Deep Q-Network (D3QN). We train our network on the PASCAL dataset and demonstrate

that the network achieves excellent performance.

Keywords:

interactive image segmentation; Markov Decision Process (MDP); Deep Reinforcement

Learning (DRL); inside point localization; Deep Q-Network (DQN)

1. Introduction

Interactive image segmentation allows users to explicitly control the segmentation

mask using human-friendly annotators, which can be formalized via various represen-

tations: bounding boxes, scribbles, clicks, or extreme points. As one of the fundamental

problems in computer vision, it has obtained remarkable results in broad applications, such

as medical image analysis [

], image editing [

], and especially pixel-level annotation [

In the early days, a large number of traditional approaches [

–

] have been developed in

this direction. Boykov et al. [

] considered interactive segmentation problem as an opti-

mization problem and utilized a graph cut-based method to extract the object automatically.

Following, Price et al. [

] improve the graph cut method by applying geodesic distances

for energy minimization. Grady introduces an interactive segmentation algorithm called

random walks [

]. Here, the pixel labels are assigned as the label of the ﬁrst seed that the

walker reaches. All these methods based on low level-features cannot distinguish between

the target object and background in the case of complex and variable scenes.

Over the past few years, deep learning-based algorithms have become popular in

computer vision and have also showed astonishing performances in interactive segmen-

tation problems. Xu et al. [

] put forward a CNN-based model to solve the interactive

Sensors 2021, 21, 6100. https://doi.org/10.3390/s21186100 https://www.mdpi.com/journal/sensors

资源描述：

当前文档最多预览五页，下载文档查看全文

侵权申诉



1 1 2 3 4 5 / 14



此文档下载收益归作者所有

当前文档最多预览五页，下载文档查看全文

版权提示

温馨提示：
1. 部分包含数学公式或PPT动画的文件，查看预览时可能会显示错乱或异常，文件下载后无此问题，请放心下载。
2. 本文档由用户上传，版权归属用户，天天文库负责整理代发布。如果您对本文档版权有争议请及时联系客服。
3. 下载前请仔细阅读文档内容，确认文档内容符合您的需求后进行下载，若出现内容与标题不符可向本站投诉处理。
4. 下载文档时可能由于网络波动等原因无法下载或下载错误，付费完成后未能成功下载的用户请联系客服处理。

大家都在看

近期热门

基于深度强化学习的交互式对象分割自动内点定位-2021年

最近更新

大家都在看

相关文章

相关标签