Quantifying the impact of non-stationarity in reinforcement learning-based traffic signal control

Alegre, Lucas Nunes; Bazzan, Ana Lucia Cetertich; Silva, Bruno Carreiro da

dc.contributor.author	Alegre, Lucas Nunes	pt_BR
dc.contributor.author	Bazzan, Ana Lucia Cetertich	pt_BR
dc.contributor.author	Silva, Bruno Carreiro da	pt_BR
dc.date.accessioned	2023-04-07T03:26:12Z	pt_BR
dc.date.issued	2021	pt_BR
dc.identifier.issn	2376-5992	pt_BR
dc.identifier.uri	http://hdl.handle.net/10183/256805	pt_BR
dc.description.abstract	In reinforcement learning (RL), dealing with non-stationarity is a challenging issue. However, some domains such as traffic optimization are inherently non-stationary. Causes for and effects of this are manifold. In particular, when dealing with traffic signal controls, addressing non-stationarity is key since traffic conditions change over time and as a function of traffic control decisions taken in other parts of a network. In this paper we analyze the effects that different sources of non-stationarity have in a network of traffic signals, in which each signal is modeled as a learning agent. More precisely, we study both the effects of changing the context in which an agent learns (e.g., a change in flow rates experienced by it), as well as the effects of reducing agent observability of the true environment state. Partial observability may cause distinct states (in which distinct actions are optimal) to be seen as the same by the traffic signal agents. This, in turn, may lead to sub-optimal performance. We show that the lack of suitable sensors to provide a representative observation of the real state seems to affect the performance more drastically than the changes to the underlying traffic patterns.	en
dc.format.mimetype	application/pdf	pt_BR
dc.language.iso	eng	pt_BR
dc.relation.ispartof	PeerJ Computer Science. New York : PeerJ, 2021. Vol. 7, (mar. 2021), 20 p.	pt_BR
dc.rights	Open Access	en
dc.subject	Multiagent systems	en
dc.subject	Sistemas multiagentes	pt_BR
dc.subject	Aprendizado por reforço	pt_BR
dc.subject	Reinforcement learning	en
dc.subject	Traffic signal control	en
dc.subject	Informatica : Transportes	pt_BR
dc.subject	Non-stationarity	en
dc.title	Quantifying the impact of non-stationarity in reinforcement learning-based traffic signal control	pt_BR
dc.type	Artigo de periódico	pt_BR
dc.identifier.nrb	001143089	pt_BR
dc.type.origin	Estrangeiro	pt_BR

Files in this item

Name:: 001143089.pdf
Size:: 3.410Mb
Format:: PDF
Description:: Texto completo (inglês)

View/Open

This item is licensed under a Creative Commons License

Journal Articles (40281)

Exact and Earth Sciences (6158)

Show simple item record