One-Point Sampling for Distributed Bandit Convex Optimization with Time-Varying Constraints

This paper considers the distributed bandit convex optimization problem with
time-varying constraints. In this problem, the global loss function is the
average of all the local convex loss functions, which are unknown beforehand.
Each agent iteratively makes its own decision subject to time-varying
inequality constraints which can be violated but are fulfilled in the long run.
For a uniformly jointly strongly connected time-varying directed graph, a
distributed bandit online primal-dual projection algorithm with one-point
sampling is proposed. We show that sublinear dynamic network regret and network
cumulative constraint violation are achieved if the path-length of the
benchmark also increases in a sublinear manner. In addition, an
$\mathcal{O}({T^{3/4 + g}})$ static network regret bound and an $\mathcal{O}(
{{T^{1 – {g}/2}}} )$ network cumulative constraint violation bound are
established, where $T$ is the total number of iterations and $g \in ( {0,1/4}
)$ is a trade-off parameter. Moreover, a reduced static network regret bound
$\mathcal{O}( {{T^{2/3 + 4g /3}}} )$ is established for strongly convex local
loss functions. Finally, a numerical example is presented to validate the
theoretical results.

Este artículo explora los viajes en el tiempo y sus implicaciones.

Descargar PDF:

2504.16211v2

One-Point Sampling for Distributed Bandit Convex Optimization with Time-Varying Constraints

Plataforma Online

Enlaces

Verbalus Mater

One-Point Sampling for Distributed Bandit Convex Optimization with Time-Varying Constraints

One-Point Sampling for Distributed Bandit Convex Optimization with Time-Varying Constraints

Plataforma Online

Enlaces

Verbalus Mater

Signo en

Regístrate

— PRÓXIMO CURSO ONLINE EMPIEZA EL 15 DE ENERO —

La Ciencia Real Detrás de los Viajes Temporales 25% DTO

La Ciencia Real Detrás de
los Viajes Temporales
25% DTO