KD PatchMatch一种基于补丁匹配的自监督训练学习

ID：38578

阅读量：0

大小：91.75 MB

页数：15页

时间：2023-03-11

金币：2

上传者：战必胜

Citation: Tan, Q.; Fang, Z.; Jiang, X.

KD-PatchMatch: A Self-Supervised

Training Learning-Based PatchMatch.

Appl. Sci. 2023, 13, 2224. https://

doi.org/10.3390/app13042224

Academic Editor: Silvia Liberata Ullo

Received: 31 December 2022

Revised: 30 January 2023

Accepted: 6 February 2023

Published: 9 February 2023

Licensee MDPI, Basel, Switzerland.

This article is an open access article

distributed under the terms and

conditions of the Creative Commons

Attribution (CC BY) license (https://

creativecommons.org/licenses/by/

4.0/).

applied

sciences

Article

KD-PatchMatch: A Self-Supervised Training

Learning-Based PatchMatch

Qingyu Tan, Zhijun Fang * and Xiaoyan Jiang

School of Electronic and Electrical Engineering, Shanghai University of Engineering Science,

Shanghai 201620, China

* Correspondence: zjfang@sues.edu.cn

Abstract:

Traditional learning-based multi-view stereo (MVS) methods usually need to ﬁnd the cor-

rect depth value from a large number of depth candidates, which leads to huge memory consumption

and slow inference. To address these problems, we propose a probabilistic depth sampling in the

learning-based PatchMatch framework, i.e., sampling a small number of depth candidates from a

single-view probability distribution, which achieves the purpose of saving computational resources.

Furthermore, to overcome the difﬁculty of obtaining ground-truth depth for outdoor large-scale

scenes, we also propose a self-supervised training pipeline based on knowledge distillation, which

involves self-supervised teacher training and student training based on knowledge distillation. Ex-

tensive experiments show that our approach outperforms other recent learning-based MVS methods

on DTU, Tanks and Temples, and ETH3D datasets.

Keywords:

multi-view stereo; learning-based PatchMatch; probabilistic depth sampling; knowledge

distillation

1. Introduction

Given multiple RGB images with known camera poses, multi-view stereo (MVS)

intends to reconstruct a 3D dense point cloud of the image scene. Multi-view stereo has

a wide range of applications, including mapping [

], self-driving cars [

], infrastructure

inspection [3], robotics [4], etc.

Convolutional neural networks have demonstrated very powerful capabilities in multi-

view 3D reconstruction problems in recent years, owing to the continuing development

of deep learning. Many learning-based methods [

–

] can incorporate global semantic

information, such as specular prior and reﬂection prior, to improve the robustness of the

matching and thus solve the challenges that cannot be overcome by traditional methods.

However, MVS still has many challenges, such as untextured areas, occlusion, and non-

Lambertian surfaces [9–11].

When MVSNet [

] is proposed, the learning-based MVS domain constructs the cost

volume of image pairs using front-to-parallel and differentiable homography. Many sub-

sequent networks are improved on this basis. For example, R-MVSNet [

] innovates the

regularization of the cost volume in the depth dimension by using Conv-GRU layer-by-

layer processing to reduce the memory consumption; CasMVSNet [

] proposes the ﬁrst

coarse-to-ﬁne structure paradigm to optimize the memory consumption and computational

efﬁciency; Vis-MVSNet [

] and CVP-MVSNet [

] consider in depth the aggregation

approach of cost volume and the range of depth assumptions in the subsequent stages

of coarse-to-ﬁne from multiple views, respectively, resulting in substantial performance

improvements. PatchMatchNet [

] is the ﬁrst model that introduces the traditional stereo

matching algorithm (PatchMatch) into an end-to-end MVS framework.

Most learning-based MVS methods [

] employ the same set of depth hy-

pothesis candidates for all pixels (i.e., sampled between hand-picked limits

min

and

max

Appl. Sci. 2023, 13, 2224. https://doi.org/10.3390/app13042224 https://www.mdpi.com/journal/applsci

资源描述：

当前文档最多预览五页，下载文档查看全文

侵权申诉



1 1 2 3 4 5 / 15



此文档下载收益归作者所有

当前文档最多预览五页，下载文档查看全文

版权提示

温馨提示：
1. 部分包含数学公式或PPT动画的文件，查看预览时可能会显示错乱或异常，文件下载后无此问题，请放心下载。
2. 本文档由用户上传，版权归属用户，天天文库负责整理代发布。如果您对本文档版权有争议请及时联系客服。
3. 下载前请仔细阅读文档内容，确认文档内容符合您的需求后进行下载，若出现内容与标题不符可向本站投诉处理。
4. 下载文档时可能由于网络波动等原因无法下载或下载错误，付费完成后未能成功下载的用户请联系客服处理。

大家都在看

近期热门

KD PatchMatch一种基于补丁匹配的自监督训练学习

最近更新

大家都在看

相关文章

相关标签