(PDF) Efficient Counterfactual Learning From Bandit Feedback

Efficient Counterfactual Learning From Bandit Feedback

Proceedings of the AAAI Conference on Artificial Intelligence

doi 10.1609/aaai.v33i01.33014634

Full Text

Abstract

Available in full text

Date

July 17, 2019

Authors

Publisher

Association for the Advancement of Artificial Intelligence (AAAI)

Related search

Sample and Feedback Efficient Hierarchical Reinforcement Learning From Human Preferences

English

Learning Effective State-Feedback Controllers Through Efficient Multilevel Importance Samplers

International Journal of Control

Systems Engineering

Computer Science Applications

English

Bandit Structured Prediction for Neural Sequence-To-Sequence Learning

English

Online Learning to Diversify From Implicit Feedback

English

A Bandit From Uşak: Acemoğlu Ahmet

Afyon Kocatepe Üniversitesi Sosyal Bilimler Dergisi

English

LIMSI Submission for WMT'17 Shared Task on Bandit Learning

English

Effects of Mobile Learning in Medical Education: A Counterfactual Evaluation

Journal of Medical Systems

Health Information Management

Information Systems

Health Informatics

English

Learning a Neural Semantic Parser From User Feedback

English

Bandit Framework for Systematic Learning in Wireless Video-Based Face Recognition

IEEE Journal on Selected Topics in Signal Processing

Electronic Engineering

Signal Processing

English