Skip to main content

Heterogeneous stochastic interactions for multiple agents in a multi-armed bandit problem

Author(s): Madhushani, U; Leonard, Naomi E

Download
To refer to this page use: http://arks.princeton.edu/ark:/88435/pr10w10
Abstract: We define and analyze a multi-agent multi-armed bandit problem in which decision-making agents can observe the choices and rewards of their neighbors. Neighbors are defined by a network graph with heterogeneous and stochastic interconnections. These interactions are determined by the sociability of each agent, which corresponds to the probability that the agent observes its neighbors. We design an algorithm for each agent to maximize its own expected cumulative reward and prove performance bounds that depend on the sociability of the agents and the network structure. We use the bounds to predict the rank ordering of agents according to their performance and verify the accuracy analytically and computationally.
Publication Date: 2019
Citation: Madhushani, U, Leonard, NE. (2019). Heterogeneous stochastic interactions for multiple agents in a multi-armed bandit problem. 3502 - 3507. doi:10.23919/ECC.2019.8796036
DOI: doi:10.23919/ECC.2019.8796036
Pages: 3502 - 3507
Type of Material: Conference Article
Journal/Proceeding Title: 2019 18th European Control Conference
Version: Author's manuscript



Items in OAR@Princeton are protected by copyright, with all rights reserved, unless otherwise indicated.