Contextual Bandit

SPARKLE: A Nonparametric Approach for Online Decision-Making with High-Dimensional Covariates

Personalized services are central to today's digital economy, and their sequential decisions are often modeled as contextual bandits. Modern applications pose two main challenges: high-dimensional covariates and the need for nonparametric models to …

Dynamic Selection in Algorithmic Decision-making

This paper identifies and addresses dynamic selection problems in online learning algorithms with endogenous data. In a contextual multi-armed bandit model, a novel bias (*self-fulfilling bias*) arises because the endogeneity of the data influences …