Heterogeneous stochastic interactions for multiple agents in a multi-armed bandit problem
Author(s): Madhushani, U; Leonard, Naomi E
DownloadTo refer to this page use:
http://arks.princeton.edu/ark:/88435/pr10w10
Full metadata record
DC Field | Value | Language |
---|---|---|
dc.contributor.author | Madhushani, U | - |
dc.contributor.author | Leonard, Naomi E | - |
dc.date.accessioned | 2021-10-08T20:20:03Z | - |
dc.date.available | 2021-10-08T20:20:03Z | - |
dc.date.issued | 2019 | en_US |
dc.identifier.citation | Madhushani, U, Leonard, NE. (2019). Heterogeneous stochastic interactions for multiple agents in a multi-armed bandit problem. 3502 - 3507. doi:10.23919/ECC.2019.8796036 | en_US |
dc.identifier.uri | http://arks.princeton.edu/ark:/88435/pr10w10 | - |
dc.description.abstract | We define and analyze a multi-agent multi-armed bandit problem in which decision-making agents can observe the choices and rewards of their neighbors. Neighbors are defined by a network graph with heterogeneous and stochastic interconnections. These interactions are determined by the sociability of each agent, which corresponds to the probability that the agent observes its neighbors. We design an algorithm for each agent to maximize its own expected cumulative reward and prove performance bounds that depend on the sociability of the agents and the network structure. We use the bounds to predict the rank ordering of agents according to their performance and verify the accuracy analytically and computationally. | en_US |
dc.format.extent | 3502 - 3507 | en_US |
dc.language.iso | en_US | en_US |
dc.relation.ispartof | 2019 18th European Control Conference | en_US |
dc.rights | Author's manuscript | en_US |
dc.title | Heterogeneous stochastic interactions for multiple agents in a multi-armed bandit problem | en_US |
dc.type | Conference Article | en_US |
dc.identifier.doi | doi:10.23919/ECC.2019.8796036 | - |
pu.type.symplectic | http://www.symplectic.co.uk/publications/atom-terms/1.0/conference-proceeding | en_US |
Files in This Item:
File | Description | Size | Format | |
---|---|---|---|---|
Heterogeneous stochastic interactions for multiple agents in a multi-armed bandit problem.pdf | 493.95 kB | Adobe PDF | View/Download |
Items in OAR@Princeton are protected by copyright, with all rights reserved, unless otherwise indicated.