基于文本挖掘的事故原因分类的关系图卷积网络

ID:38829

大小:1.14 MB

页数:13页

时间:2023-03-14

金币:2

上传者:战必胜

 
Citation: Chen, Z.; Huang, K.; Wu, L.;
Zhong, Z.; Jiao, Z. Relational Graph
Convolutional Network for
Text-Mining-Based Accident Causal
Classification. Appl. Sci. 2022, 12,
2482. https://doi.org/10.3390/
app12052482
Academic Editors: Nikos D. Lagaros
and Vagelis Plevris
Received: 30 January 2022
Accepted: 24 February 2022
Published: 27 February 2022
Publishers Note: MDPI stays neutral
with regard to jurisdictional claims in
published maps and institutional affil-
iations.
Copyright: © 2022 by the authors.
Licensee MDPI, Basel, Switzerland.
This article is an open access article
distributed under the terms and
conditions of the Creative Commons
Attribution (CC BY) license (https://
creativecommons.org/licenses/by/
4.0/).
applied
sciences
Article
Relational Graph Convolutional Network for
Text-Mining-Based Accident Causal Classification
Zaili Chen
1,2,†
, Kai Huang
2,3,†
, Li Wu
1,
*, Zhenyu Zhong
1
and Zeyu Jiao
2,
*
1
Faculty of Engineering, China University of Geosciences, Wuhan 430074, China; zl.chen@cug.edu.cn (Z.C.);
zy.zhong@giim.ac.cn (Z.Z.)
2
Guangdong Key Laboratory of Modern Control Technology, Institute of Intelligent Manufacturing,
Guangdong Academy of Sciences, Guangzhou 510070, China; kirehuang@gmail.com
3
School of Economics and Management, Beihang University, Beijing 100191, China
* Correspondence: lwu@cug.edu.cn (L.W.); zy.jiao@giim.ac.cn (Z.J.)
These authors contributed equally to this work.
Abstract:
Accident investigation reports are text documents that systematically review and analyze
the cause and process of accidents after accidents have occurred and have been widely used in the
fields such as transportation, construction and aerospace. With the aid of accident investigation
reports, the cause of the accident can be clearly identified, which provides an important basis for
accident prevention and reliability assessment. However, since accident record reports are mostly
composed of unstructured data such as text, the analysis of accident causes inevitably relies on a lot
of expert experience and statistical analyses also require a lot of manual classification. Although, in
recent years, with the development of natural language processing technology, there have been many
efforts to automatically analyze and classify text. However, the existing methods either rely on large
corpus and data preprocessing methods, which are cumbersome, or extract text information based
on bidirectional encoder representation from transformers (BERT), but the computational cost is
extremely high. These shortcomings make it still a great challenge to automatically analyze accident
investigation reports and extract the information therein. To address the aforementioned problems,
this study proposes a text-mining-based accident causal classification method based on a relational
graph convolutional network (R-GCN) and pre-trained BERT. On the one hand, the proposed method
avoids preprocessing such as stop word removal and word segmentation, which not only preserves
the information of accident investigation reports to the greatest extent, but also avoids tedious
operations. On the other hand, with the help of R-GCN to process the semantic features obtained by
BERT representation, the dependence of BERT retraining on computing resources can be avoided.
Keywords: accident causal classification; accident investigation reports; text mining; R-GCN; BERT
1. Introduction
Accident investigation reports are usually text documents formed by professional
investigators or teams through visits, conversations, viewing video surveillance and an-
alyzing recorded data after accidents occur [
1
] and have been widely used in aviation,
construction, transportation and other fields [
2
]. The process and consequences of the
accident recorded in the reports can be leveraged by experts to analyze the cause of the acci-
dent, which is of great significance for preventing the recurrence of the accident or forming
the accident response plan [
3
]. However, the current analysis of accident investigation
reports mainly relies on expert experience to manually determine the cause of the accident,
which requires a lot of work, and the accuracy is affected by the subjective experience of
experts [
4
]. On 29 October 2018, an Indonesian Lion Air Boeing 737 MAX8 plane carrying
189 passengers and crew was flying from Jakarta’s Soekarno Hatta International Airport
to Penang Port, Bangka Belitung Province. The plane lost contact 13 min after takeoff and
was later confirmed to have crashed in the waters off Karawang, West Java province [
5
].
Appl. Sci. 2022, 12, 2482. https://doi.org/10.3390/app12052482 https://www.mdpi.com/journal/applsci
资源描述:

当前文档最多预览五页,下载文档查看全文

此文档下载收益归作者所有

当前文档最多预览五页,下载文档查看全文
温馨提示:
1. 部分包含数学公式或PPT动画的文件,查看预览时可能会显示错乱或异常,文件下载后无此问题,请放心下载。
2. 本文档由用户上传,版权归属用户,天天文库负责整理代发布。如果您对本文档版权有争议请及时联系客服。
3. 下载前请仔细阅读文档内容,确认文档内容符合您的需求后进行下载,若出现内容与标题不符可向本站投诉处理。
4. 下载文档时可能由于网络波动等原因无法下载或下载错误,付费完成后未能成功下载的用户请联系客服处理。
关闭