This paper identifies and addresses dynamic selection problems in online learning algorithms with endogenous data. In a contextual multi-armed bandit model, a novel bias (*self-fulfilling bias*) arises because the endogeneity of the data influences …