Improved quality of online education using prioritized multi-agent reinforcement learning for video traffic scheduling

Comsa, Ioan-Sorin ORCID logoORCID:, Molnar, Andreea, Tal, Irina, Imhof, Christof, Bergamin, Per, Muntean, Gabriel-Miro, Muntean, Cristina Hava and Trestian, Ramona Maria ORCID logoORCID: (2023) Improved quality of online education using prioritized multi-agent reinforcement learning for video traffic scheduling. IEEE Transactions on Broadcasting, 69 (2) . pp. 436-454. ISSN 0018-9316 [Article] (doi:10.1109/TBC.2023.3246815)

PDF - Published version (with publisher's formatting)
Available under License Creative Commons Attribution 4.0.

Download (6MB) | Preview
[img] PDF - Final accepted version (with author's formatting)
Restricted to Repository staff and depositor only

Download (14MB)


The recent global pandemic has transformed the way education is delivered, increasing the importance of videobased online learning. However, this puts a significant pressure on the underlying communication networks and the limited available bandwidth needs to be intelligently allocated to support a much higher transmission load, including video-based services. In this context, this paper proposes a Machine Learning (ML)-based solution that dynamically prioritizes content viewers with heterogeneous video services to increase their Quality of Service (QoS) and perceived Quality of Experience (QoE). The proposed approach makes use of the novel Prioritized Multi- Agent Reinforcement Learning solution (PriMARL) to decide the prioritization order of the video-based services based on networking conditions. However, the performance in terms of QoS and QoE provisioning to learners with different profiles and networking conditions depends on the type of scheduler employed in the frequency domain to conduct the scheduling and the radio resource allocation. To decide the best approach to be followed, we employ the proposed PriMARL solution with different types of scheduling rules and compare them with other state-of-theart solutions in terms of throughput, delay, packet loss, Peak Signal-to-Noise Ratio (PSNR), and Mean Opinion Score (MOS) for different traffic loads and characteristics. We show that the proposed solution achieves the best user QoE results.

Item Type: Article
Sustainable Development Goals:
Research Areas: A. > School of Science and Technology > Design Engineering and Mathematics
Item ID: 37455
Notes on copyright: This work is licensed under a Creative Commons Attribution 4.0 License. For more information, see
Depositing User: Ramona Trestian
Date Deposited: 15 Feb 2023 11:34
Last Modified: 17 Jun 2023 04:26

Actions (login required)

View Item View Item


Activity Overview
6 month trend
6 month trend

Additional statistics are available via IRStats2.