Decentralized Reinforcement Learning: Global Decision-Making via Local Economic Transactions

Chang, Michael; Kaushik, Sid; Weinberg, S Matthew; Griffiths, Tom; Levine, Sergey

Decentralized Reinforcement Learning: Global Decision-Making via Local Economic Transactions

Author(s): Chang, Michael; Kaushik, Sid; Weinberg, S Matthew; Griffiths, Tom; Levine, Sergey

Download

To refer to this page use: http://arks.princeton.edu/ark:/88435/pr1pk1g

Full metadata record

DC Field	Value	Language
dc.contributor.author	Chang, Michael	-
dc.contributor.author	Kaushik, Sid	-
dc.contributor.author	Weinberg, S Matthew	-
dc.contributor.author	Griffiths, Tom	-
dc.contributor.author	Levine, Sergey	-
dc.date.accessioned	2021-10-08T19:51:14Z	-
dc.date.available	2021-10-08T19:51:14Z	-
dc.date.issued	2020	en_US
dc.identifier.citation	Chang, Michael, Sid Kaushik, S. Matthew Weinberg, Tom Griffiths, and Sergey Levine. "Decentralized Reinforcement Learning: Global Decision-Making via Local Economic Transactions." In Proceedings of the 37th International Conference on Machine Learning 119 (2020): pp. 1437-1447.	en_US
dc.identifier.issn	2640-3498	-
dc.identifier.uri	http://proceedings.mlr.press/v119/chang20b.html	-
dc.identifier.uri	http://arks.princeton.edu/ark:/88435/pr1pk1g	-
dc.description.abstract	This paper seeks to establish a framework for directing a society of simple, specialized, self-interested agents to solve what traditionally are posed as monolithic single-agent sequential decision problems. What makes it challenging to use a decentralized approach to collectively optimize a central objective is the difficulty in characterizing the equilibrium strategy profile of non-cooperative games. To overcome this challenge, we design a mechanism for defining the learning environment of each agent for which we know that the optimal solution for the global objective coincides with a Nash equilibrium strategy profile of the agents optimizing their own local objectives. The society functions as an economy of agents that learn the credit assignment process itself by buying and selling to each other the right to operate on the environment state. We derive a class of decentralized reinforcement learning algorithms that are broadly applicable not only to standard reinforcement learning but also for selecting options in semi-MDPs and dynamically composing computation graphs. Lastly, we demonstrate the potential advantages of a society’s inherent modular structure for more efficient transfer learning.	en_US
dc.format.extent	1437 - 1447	en_US
dc.language.iso	en_US	en_US
dc.relation.ispartof	Proceedings of the 37th International Conference on Machine Learning	en_US
dc.rights	Final published version. This is an open access article.	en_US
dc.title	Decentralized Reinforcement Learning: Global Decision-Making via Local Economic Transactions	en_US
dc.type	Conference Article	en_US
pu.type.symplectic	http://www.symplectic.co.uk/publications/atom-terms/1.0/conference-proceeding	en_US

Files in This Item:

File	Description	Size	Format
DecenReinforcementLearning.pdf		1.39 MB	Adobe PDF	View/Download

Show Simple Item Record