short-paper

TinyRL: Towards Reinforcement Learning on Tiny Embedded Devices

Authors:
Tomasz Szydlo

AGH University of Science and Technology, Krakow, Poland

AGH University of Science and Technology, Krakow, Poland
View Profile

,
Prem Prakash Jayaraman

Swinburne University of Technology, Melbourne, Australia

Swinburne University of Technology, Melbourne, Australia
View Profile

,
Yinhao Li

Newcastle University, Newcastle upon Tyne, United Kingdom

Newcastle University, Newcastle upon Tyne, United Kingdom
View Profile

,
Graham Morgan

Newcastle University, Newcastle upon Tyne, United Kingdom

Newcastle University, Newcastle upon Tyne, United Kingdom
View Profile

,
Rajiv Ranjan

Newcastle University, Newcastle upon Tyne, United Kingdom

Newcastle University, Newcastle upon Tyne, United Kingdom
View Profile

CIKM '22: Proceedings of the 31st ACM International Conference on Information & Knowledge ManagementOctober 2022Pages 4985–4988https://doi.org/10.1145/3511808.3557206

Published:17 October 2022Publication History

CIKM '22: Proceedings of the 31st ACM International Conference on Information & Knowledge Management

Pages 4985–4988

ABSTRACT

We observe significant interest in reinforcement learning methods for real-world sensing-control scenarios driven by the sensor data streams. However, the delay introduced to the data by the communication channels may degrade the system's performance. It is especially crucial in the internet of things (IoT), where devices with constraint resources and low throughput networks are used.

We demonstrate TinyRL framework, a different approach to this problem, by transferring RL algorithms knowledge to resource-limited devices. Our initial experiments point towards a successful demonstration of our technique using common microcontrollers used in IoT systems. Such devices have limited resource capability, and their regulation by processing data directly on devices without their transmission to the cloud can play a crucial role in their lifespan and usefulness.

Supplemental Material

CIKM22-fp9746.mp4

mp4

15.7 MB

Download

References

Tianqi Chen, Thierry Moreau, Ziheng Jiang, Lianmin Zheng, Eddie Yan, Haichen Shen, Meghan Cowan, Leyuan Wang, Yuwei Hu, Luis Ceze, Carlos Guestrin, and Arvind Krishnamurthy. 2018. TVM: An Automated End-to-End Optimizing Compiler for Deep Learning. In 13th USENIX Symposium on Operating Systems Design and Implementation (OSDI 18). USENIX Association, Carlsbad, CA, 578--594.Google Scholar
Itay Hubara, Matthieu Courbariaux, Daniel Soudry, Ran El-Yaniv, and Yoshua Bengio. 2016. Binarized Neural Networks. In Advances in Neural Information Processing Systems, D. Lee, M. Sugiyama, U. Luxburg, I. Guyon, and R. Garnett (Eds.), Vol. 29. Curran Associates, Inc.Google Scholar
Lei Lei, Yue Tan, Kan Zheng, Shiwen Liu, Kuan Zhang, and Xuemin Shen. 2020. Deep Reinforcement Learning for Autonomous Internet of Things: Model, Applications and Challenges. IEEE Communications Surveys Tutorials, Vol. 22, 3 (2020), 1722--1760.Google ScholarCross Ref
Norbert Mitschke, Michael Heizmann, Klaus-Henning Noffz, and Ralf Wittmann. 2019. A Fixed-Point Quantization Technique for Convolutional Neural Networks Based on Weight Scaling. In 2019 IEEE International Conference on Image Processing (ICIP). 3836--3840.Google ScholarCross Ref
Muhammad Shafique, Theocharis Theocharides, Vijay Janapa Reddy, and Boris Murmann. 2021. TinyML: Current Progress, Research Challenges, and Future Roadmap. In 2021 58th ACM/IEEE Design Automation Conference (DAC). 1303--1306.Google ScholarDigital Library
Jiecao Yu, Andrew Lukefahr, David J. Palframan, Ganesh S. Dasika, Reetuparna Das, and Scott A. Mahlke. 2017. Scalpel: Customizing DNN pruning to the underlying hardware parallelism. 2017 ACM/IEEE 44th Annual International Symposium on Computer Architecture (ISCA) (2017), 548--560. iGoogle Scholar

Index Terms

TinyRL: Towards Reinforcement Learning on Tiny Embedded Devices

Recommendations

HoneyIoT: Adaptive High-Interaction Honeypot for IoT Devices Through Reinforcement Learning
WiSec '23: Proceedings of the 16th ACM Conference on Security and Privacy in Wireless and Mobile Networks

As IoT devices are becoming widely deployed, there exist many threats to IoT-based systems due to their inherent vulnerabilities. One effective approach to improving IoT security is to deploy IoT honeypot systems, which can collect attack information ...
Read More
Recent Reinforcement Learning and Blockchain Based Security Solutions for Internet of Things: Survey
Abstract
Users’ security is one of the most important issues in Internet of Things (IoT) due to the high number of IoT devices involved in different applications. Security threats are evolving at a rapid pace that make the current security and privacy ...
Read More
Reward Shaping in Episodic Reinforcement Learning
AAMAS '17: Proceedings of the 16th Conference on Autonomous Agents and MultiAgent Systems

Recent advancements in reinforcement learning confirm that reinforcement learning techniques can solve large scale problems leading to high quality autonomous decision making. It is a matter of time until we will see large scale applications of ...
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Published in
CIKM '22: Proceedings of the 31st ACM International Conference on Information & Knowledge Management
October 2022
5274 pages
ISBN:9781450392365
DOI:10.1145/3511808
General Chairs:
Mohammad Al Hasan
Indiana University Purdue University, Indianapolis, USA
,
Li Xiong
Emory University, Atlanta, USA
Copyright © 2022 ACM
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]
Sponsors
In-Cooperation
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
- Published: 17 October 2022
Permissions
Request permissions about this article.
Request Permissions

Check for updates
Author Tags
embedded devices
internet of things
reinforcement learning
Qualifiers
- short-paper
Conference

Acceptance Rates
CIKM '22 Paper Acceptance Rate621of2,257submissions,28%Overall Acceptance Rate1,861of8,427submissions,22%
More
Upcoming Conference
CIKM '24

Sponsor:

sigir

sigir

The 33rd ACM International Conference on Information and Knowledge Management

October 21 - 25, 2024

Boise , ID , USA
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 0
  Total Citations
  View Citations
- 766
  Total Downloads
- Downloads (Last 12 months)216
- Downloads (Last 6 weeks)37
Other Metrics
View Author Metrics
Cited By
This publication has not been cited yet

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

TinyRL: Towards Reinforcement Learning on Tiny Embedded Devices

CIKM '22: Proceedings of the 31st ACM International Conference on Information & Knowledge Management

ABSTRACT

Supplemental Material

References

Cited By

Index Terms

Recommendations

HoneyIoT: Adaptive High-Interaction Honeypot for IoT Devices Through Reinforcement Learning

Recent Reinforcement Learning and Blockchain Based Security Solutions for Internet of Things: Survey

Reward Shaping in Episodic Reinforcement Learning

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Conference

Acceptance Rates

Upcoming Conference

Funding Sources

Other Metrics

Article Metrics

Other Metrics

Cited By

PDF Format

eReader

Digital Edition

Caption

TinyRL: Towards Reinforcement Learning on Tiny Embedded Devices

CIKM '22: Proceedings of the 31st ACM International Conference on Information & Knowledge Management

ABSTRACT

Supplemental Material

References

Cited By

Index Terms

Recommendations

HoneyIoT: Adaptive High-Interaction Honeypot for IoT Devices Through Reinforcement Learning

Recent Reinforcement Learning and Blockchain Based Security Solutions for Internet of Things: Survey

Reward Shaping in Episodic Reinforcement Learning

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Conference

Acceptance Rates

Upcoming Conference

Funding Sources

Article Metrics

Other Metrics

PDF Format

eReader

Digital Edition

Share this Publication link

Share on Social Media