Publication
Title
Parallel reinforcement learning with minimal communication overhead for IoT environments
Author
Abstract
Many Internet of Things (IoT) applications require a distributed architecture for decision making: either because of a lack of a centralized system, failure-prone connectivity to a centralized system, or because the imposed latency to contact such a system is too high for real-time applications. Often, these IoT applications fall in the domain of Reinforcement Learning (RL), e.g., autonomous robot navigation in Smart Factories and Traffic Signal Control in Smart Cities. However, RL-based applications require a long learning time. To overcome this limitation and scale with the number of agents, Parallel Reinforcement Learning (PRL) algorithms run multiple RL agents in parallel and on distributed environments. However, deploying PRL algorithms in such environments entails a communication overhead that increases the (actual) execution time. State-of-the-art PRL algorithms are designed for reducing the learning time while assuming no (or limited) communication overhead. In this work, we present a novel partitioning algorithm that minimizes the communication overhead in PRL running on IoT environments. To the best of our knowledge, this is the first work that focuses on solving the communication overhead of distributing PRL algorithms algorithm without requiring any a priori knowledge about the structure of the problem. The proposed algorithm intelligently combines a dynamic state partitioning strategy, which exploits the agent’s exploration capabilities to build partition knowledge while learning, with an efficient mapping of agents to partitions, which reduces the communication among agents. Performance evaluations show that the proposed algorithm can achieve almost no communication among PRL agents at the converged state.
Language
English
Source (journal)
IEEE internet of things journal. - Piscataway, NJ, 2014, currens
Publication
Piscataway, NJ : Institute of Electrical and Electronics Engineers , 2020
ISSN
2327-4662
DOI
10.1109/JIOT.2019.2955035
Volume/pages
7 :2 (2020) , p. 1387-1400
ISI
000521981800046
Full text (Publisher's DOI)
Full text (publisher's version - intranet only)
UAntwerpen
Faculty/Department
Research group
Publication type
Subject
Affiliation
Publications with a UAntwerp address
External links
Web of Science
Record
Identifier
Creation 07.01.2020
Last edited 25.12.2024
To cite this reference