基于BERT的监督微调情绪分析的迁移学习-2022年

ID:37230

大小:1.67 MB

页数:19页

时间:2023-03-03

金币:10

上传者:战必胜
Citation: Proasha, N.J.; Sam, A.A.;
Kowsher, M.; Murad, S.A.; Bairagi,
A.K.; Masud, M.; Baz, M. Transfer
Learning for Sentiment Analysis
Using BERT Based Supervised
Fine-Tuning. Sensors 2022, 22, 4157.
hps://doi.org/10.3390/s22114157
Academic Editors: Yangquan Chen,
Nunzio Cennamo, M. Jamal Deen,
Subhas Mukhopadhyay, Simone
Morais, Junseop Lee and Roberto Teti
Received: 1 March 2022
Accepted: 11 May 2022
Published: 30 May 2022
Publisher’s Note: MDPI stays neutral
with regard to jurisdictional claims in
published maps and institutional al-
iations.
Copyright: © 2022 by the authors.
Licensee MDPI, Basel, Swierland.
This article is an open access article
distributed under the terms and
conditions of the Creative Commons
Aribution (CC BY) license (hps://
creativecommons.org/licenses/by/
4.0/).
sensors
Article
Transfer Learning for Sentiment Analysis Using BERT Based
Supervised Fine-Tuning
Nusrat Jahan Proasha
1
, Abdullah As Sami
2
, Md Kowsher
3,
* , Saydul Akbar Murad
4
,
Anupam Kumar Bairagi
5
, Mehedi Masud
6
and Mohammed Baz
7
1
Department of Computer Science and Engineering, Daodil International University,
Dhaka 1341, Bangladesh; jahannusratproa@gmail.com
2
Department of Computer Science & Engineering, Chiagong University of Engineering & Technology,
Chaogram 4349, Bangladesh; abdullahassami@gmail.com
3
Department of Computer Science, Stevens Institute of Technology, Hoboken, NJ 07030, USA
4
Faculty of Computing, Universiti Malaysia Pahang, Pekan 26600, Malaysia; saydulakbarmurad@gmail.com
5
Computer Science and Engineering Discipline, Khulna University, Khulna 9208, Bangladesh;
anupam@ku.ac.bd
6
Department of Computer Science, College of Computers and Information Technology, Taif University,
P.O. Box 11099, Taif 21944, Saudi Arabia; mmasud@tu.edu.sa
7
Department of Computer Engineering, College of Computers and Information Technology, Taif University,
P.O. Box 11099, Taif 21944, Saudi Arabia; mo.baz@tu.edu.sa
* Correspondence: ga.kowsher@gmail.com
Abstract:
The growth of the Internet has expanded the amount of data expressed by users across
multiple platforms. The availability of these dierent worldviews and individuals’ emotions em-
powers sentiment analysis. However, sentiment analysis becomes even more challenging due to a
scarcity of standardized labeled data in the Bangla NLP domain. The majority of the existing Bangla
research has relied on models of deep learning that signicantly focus on context-independent word
embeddings, such as Word2Vec, GloVe, and fastText, in which each word has a xed representation
irrespective of its context. Meanwhile, context-based pre-trained language models such as BERT have
recently revolutionized the state of natural language processing. In this work, we utilized BERT’s
transfer learning ability to a deep integrated model CNN-BiLSTM for enhanced performance of
decision-making in sentiment analysis. In addition, we also introduced the ability of transfer learning
to classical machine learning algorithms for the performance comparison of CNN-BiLSTM. Addi-
tionally, we explore various word embedding techniques, such as Word2Vec, GloVe, and fastText,
and compare their performance to the BERT transfer learning strategy. As a result, we have shown
a state-of-the-art binary classication performance for Bangla sentiment analysis that signicantly
outperforms all embedding and algorithms.
Keywords:
sentiment analysis; Bangla-BERT; transfer learning; transformer; word embedding; Bangla
NLP
1. Introduction
Sentiment classication is the process of examining a piece of text to forecast how an
individual’s aitude toward an occurrence or perspective will be oriented. The sentiment is
usually analyzed based on text polarity. Typically, a sentiment classier categorizes positive,
negative, or neutral [
1
]. Sentiment extraction is the backbone of sentiment categorization,
and considerable study has been conducted. The next crucial step is sentiment mining,
which has increased tremendously in recent years in line with the growth of textual data
worldwide. People now share their ideas electronically on various topics, including online
product reviews, book or lm studies, and political commentary. As a result, evaluating
diverse viewpoints becomes essential for interpreting people’s intentions. In general,
Sensors 2022, 22, 4157. https://doi.org/10.3390/s22114157 https://www.mdpi.com/journal/sensors
资源描述:

当前文档最多预览五页,下载文档查看全文

此文档下载收益归作者所有

当前文档最多预览五页,下载文档查看全文
温馨提示:
1. 部分包含数学公式或PPT动画的文件,查看预览时可能会显示错乱或异常,文件下载后无此问题,请放心下载。
2. 本文档由用户上传,版权归属用户,天天文库负责整理代发布。如果您对本文档版权有争议请及时联系客服。
3. 下载前请仔细阅读文档内容,确认文档内容符合您的需求后进行下载,若出现内容与标题不符可向本站投诉处理。
4. 下载文档时可能由于网络波动等原因无法下载或下载错误,付费完成后未能成功下载的用户请联系客服处理。
关闭