基于增强拉格朗日的IIoT网络切片强化学习

ID：39354

阅读量：1

大小：1.31 MB

页数：17页

时间：2023-03-14

金币：2

上传者：战必胜



 

Citation: Qi, Q.; Lin, W.; Guo, B.;

Chen, J.; Deng, C.; Lin, G.; Sun, X.;

Chen, Y. Augmented Lagrangian-

Based Reinforcement Learning

for Network Slicing in IIoT.

Electronics 2022, 11, 3385. https://

doi.org/10.3390/electronics11203385

Academic Editors:

Alexandros-Apostolos Boulogeorgos,

Panagiotis Sarigiannidis, Thomas

Lagkas, Vasileios Argyriou and

Pantelis Angelidis

Received: 8 September 2022

Accepted: 17 October 2022

Published: 19 October 2022

Publisher’s Note: MDPI stays neutral

with regard to jurisdictional claims in

published maps and institutional afﬁl-

iations.

Licensee MDPI, Basel, Switzerland.

This article is an open access article

distributed under the terms and

conditions of the Creative Commons

Attribution (CC BY) license (https://

creativecommons.org/licenses/by/

4.0/).

electronics

Article

Augmented Lagrangian-Based Reinforcement Learning

for Network Slicing in IIoT

Qi Qi

, Wenbin Lin

, Boyang Guo

* , Jinshan Chen

, Chaoping Deng

, Guodong Lin

, Xin Sun

and Youjia Chen

State Grid Fujian Electric Power Research Institute, Fuzhou 350007, China

Fujian Key Lab for Intelligent Processing and Wireless Transmission of Media Information, College of Physics

and Information Engineering, Fuzhou University, Fuzhou 350108, China

* Correspondence: boyangguo_fzu@163.com

Abstract:

Network slicing enables the multiplexing of independent logical networks on the same

physical network infrastructure to provide different network services for different applications.

The resource allocation problem involved in network slicing is typically a decision-making problem,

falling within the scope of reinforcement learning. The advantage of adapting to dynamic wireless

environments makes reinforcement learning a good candidate for problem solving. In this paper,

to tackle the constrained mixed integer nonlinear programming problem in network slicing, we

propose an augmented Lagrangian-based soft actor–critic (AL-SAC) algorithm. In this algorithm, a hi-

erarchical action selection network is designed to handle the hybrid action space. More importantly,

inspired by the augmented Lagrangian method, both neural networks for Lagrange multipliers and

a penalty item are introduced to deal with the constraints. Experiment results show that the proposed

AL-SAC algorithm can strictly satisfy the constraints, and achieve better performance than other

benchmark algorithms.

Keywords:

network slicing; augmented Lagrangian; reinforcement learning; hybrid action space;

soft actor–critic (SAC)

1. Introduction

With the rapid development of industrial internet of things (IIoT), more and more

devices are connected and controlled via wireless networks. Providing precise services

for these devices to fulﬁll their diverse requirements becomes a fundamental issue in IIoT.

Facing this challenge, three application scenarios are deﬁned by International Telecommu-

nication Union (ITU) and Fifth Generation Public Private Partnership (5G-PPP) [

], that is,

enhanced mobile broadband (eMBB), ultra-reliable low latency communications (URLLC),

and massive machine type communication (mMTC). In more detail, the eMBB scenario

provides devices with requirements on high transmission rate, such as high-deﬁnition

surveillance video in factories, whose peak rate for each camera can be greater than

10 Gbps [

]. mMTC refers to the scenarios, where a large number of devices connect simul-

taneously while the requirements on the transmission rate and delay are not critical [

In contrast, URLLC serves applications with a strict transmission on reliability, and latency,

such as automatic operators and controllers [5].

To satisfy these disparate scenarios within one network infrastructure, a network

slicing technique was proposed. It divides a physical network into multiple independent

logical networks [

], where each network slice is isolated from others and provides

one kind of network service via dedicated resource allocation. To efﬁciently allocate

resources and meet the dynamic of wireless networks, many intelligent algorithms have

been proposed. For instance, in [

], the genetic algorithm, ant colony optimization with

a genetic algorithm, and quantum genetic algorithm were used to jointly allocate radio

Electronics 2022, 11, 3385. https://doi.org/10.3390/electronics11203385 https://www.mdpi.com/journal/electronics

资源描述：

当前文档最多预览五页，下载文档查看全文

侵权申诉



1 1 2 3 4 5 / 17



此文档下载收益归作者所有

当前文档最多预览五页，下载文档查看全文

版权提示

温馨提示：
1. 部分包含数学公式或PPT动画的文件，查看预览时可能会显示错乱或异常，文件下载后无此问题，请放心下载。
2. 本文档由用户上传，版权归属用户，天天文库负责整理代发布。如果您对本文档版权有争议请及时联系客服。
3. 下载前请仔细阅读文档内容，确认文档内容符合您的需求后进行下载，若出现内容与标题不符可向本站投诉处理。
4. 下载文档时可能由于网络波动等原因无法下载或下载错误，付费完成后未能成功下载的用户请联系客服处理。

大家都在看

近期热门

基于增强拉格朗日的IIoT网络切片强化学习

最近更新

大家都在看

相关文章

相关标签