基于变压器的点云处理神经网络

ID：39109

阅读量：1

大小：2.39 MB

页数：17页

时间：2023-03-14

金币：2

上传者：战必胜

Citation: Liu, L.; Chen, E.; Ding, Y.

TR-Net: A Transformer-Based Neural

Network for Point Cloud Processing.

Machines 2022, 10, 517. https://

doi.org/10.3390/machines10070517

Academic Editors: Shuai Li, Dechao

Chen, Mohammed Aquil Mirza,

Vasilios N. Katsikis, Dunhui Xiao and

Predrag Stanimirovi´c

Received: 2 May 2022

Accepted: 22 June 2022

Published: 27 June 2022

Publisher’s Note: MDPI stays neutral

with regard to jurisdictional claims in

published maps and institutional afﬁl-

iations.

Licensee MDPI, Basel, Switzerland.

This article is an open access article

distributed under the terms and

conditions of the Creative Commons

Attribution (CC BY) license (https://

creativecommons.org/licenses/by/

4.0/).

machines

Article

TR-Net: A Transformer-Based Neural Network for Point

Cloud Processing

Luyao Liu

, Enqing Chen

1,2

and Yingqiang Ding

School of Information Engineering, Zhengzhou University, No. 100 Science Avenue,

Zhengzhou 450001, China; luyao@stu.zzu.edu.cn (L.L.); ieeqchen@zzu.edu.cn (E.C.)

Henan Xintong Intelligent IOT Co., Ltd., No. 1-303 Intersection of Ruyun Road and Meihe Road,

Zhengzhou 450007, China

* Correspondence: dyq@zzu.edu.cn

Abstract:

Point cloud is a versatile geometric representation that could be applied in computer

vision tasks. On account of the disorder of point cloud, it is challenging to design a deep neural

network used in point cloud analysis. Furthermore, most existing frameworks for point cloud

processing either hardly consider the local neighboring information or ignore context-aware and

spatially-aware features. To deal with the above problems, we propose a novel point cloud processing

architecture named TR-Net, which is based on transformer. This architecture reformulates the

point cloud processing task as a set-to-set translation problem. TR-Net directly operates on raw

point clouds without any data transformation or annotation, which reduces the consumption of

computing resources and memory usage. Firstly, a neighborhood embedding backbone is designed

to effectively extract the local neighboring information from point cloud. Then, an attention-based

sub-network is constructed to better learn a semantically abundant and discriminatory representation

from embedded features. Finally, effective global features are yielded through feeding the features

extracted by attention-based sub-network into a residual backbone. For different downstream tasks,

we build different decoders. Extensive experiments on the public datasets illustrate that our approach

outperforms other state-of-the-art methods. For example, our TR-Net performs 93.1% overall accuracy

on the ModelNet40 dataset and the TR-Net archives a mIou of 85.3% on the ShapeNet dataset for

part segmentation.

Keywords: point cloud; deep learning; classiﬁcation; part segmentation; transformer

1. Introduction

Point cloud is a set of points in 3D space that can be viewed as a representation of

object surface. Due to greatly compensating for the lack of spatial structure information

of 2D images, point cloud has been extensively used in various ﬁelds such as automatic

drive [

], virtual reality [

], and intelligent robot technology [

]. These contemporary

applications usually call for advanced processing methods of point cloud. As is well

known, point cloud is unordered and irregular [

], which is distinct from 2D images. All

algorithms for point cloud feature extraction, therefore, must be independent of the order

of input points and point cloud is a collection of uneven sampling points. On one hand,

it makes the relationship between points difﬁcult to be used for extracting features. On

the other hand, convolutional neural networks, which have already been applied in image

and video processing, are not applicable to be used in point cloud processing directly. This

research focuses on shape classiﬁcation and part segmentation of point cloud, which are

two basic and challenging tasks that have received a lot of attention from researchers in

point cloud processing.

In the early stages of point cloud research, most researchers usually convert point

cloud data into regular 3D voxel grids [

] or a collection of images before feeding them into

Machines 2022, 10, 517. https://doi.org/10.3390/machines10070517 https://www.mdpi.com/journal/machines

资源描述：

当前文档最多预览五页，下载文档查看全文

侵权申诉



1 1 2 3 4 5 / 17



此文档下载收益归作者所有

当前文档最多预览五页，下载文档查看全文

版权提示

温馨提示：
1. 部分包含数学公式或PPT动画的文件，查看预览时可能会显示错乱或异常，文件下载后无此问题，请放心下载。
2. 本文档由用户上传，版权归属用户，天天文库负责整理代发布。如果您对本文档版权有争议请及时联系客服。
3. 下载前请仔细阅读文档内容，确认文档内容符合您的需求后进行下载，若出现内容与标题不符可向本站投诉处理。
4. 下载文档时可能由于网络波动等原因无法下载或下载错误，付费完成后未能成功下载的用户请联系客服处理。

大家都在看

近期热门

基于变压器的点云处理神经网络

最近更新

大家都在看

相关文章

相关标签