regret bounds Papers - BytesArchive

Optimal cross-learning for contextual bandits with unknown context distributions

root January 4, 2024 0

The paper by Jon Schneider and Julian Zimmert from Google Research addresses the problem of designing contextual bandit algorithms in cross-learning settings, where the learner observes the loss for the…

Press ESC to close

regret bounds

Please allow ads on our site