Asymptotically Optimal Contextual Bandit Algorithm Using Hierarchical Structures

IEEE Transactions on Neural Networks and Learning Systems - United States
doi 10.1109/tnnls.2018.2854796