Amanote Research

Amanote Research

    RegisterSign In

Efficient Counterfactual Learning From Bandit Feedback

Proceedings of the AAAI Conference on Artificial Intelligence
doi 10.1609/aaai.v33i01.33014634
Full Text
Open PDF
Abstract

Available in full text

Date

July 17, 2019

Authors
Yusuke NaritaShota YasuiKohei Yata
Publisher

Association for the Advancement of Artificial Intelligence (AAAI)


Related search

Sample and Feedback Efficient Hierarchical Reinforcement Learning From Human Preferences

2018English

Learning Effective State-Feedback Controllers Through Efficient Multilevel Importance Samplers

International Journal of Control
ControlSystems EngineeringComputer Science Applications
2018English

Bandit Structured Prediction for Neural Sequence-To-Sequence Learning

2017English

Online Learning to Diversify From Implicit Feedback

2012English

A Bandit From Uşak: Acemoğlu Ahmet

Afyon Kocatepe Üniversitesi Sosyal Bilimler Dergisi
2015English

LIMSI Submission for WMT'17 Shared Task on Bandit Learning

2017English

Effects of Mobile Learning in Medical Education: A Counterfactual Evaluation

Journal of Medical Systems
Health Information ManagementMedicineInformation SystemsHealth Informatics
2016English

Learning a Neural Semantic Parser From User Feedback

2017English

Bandit Framework for Systematic Learning in Wireless Video-Based Face Recognition

IEEE Journal on Selected Topics in Signal Processing
Electronic EngineeringSignal ProcessingElectrical
2015English

Amanote Research

Note-taking for researchers

Follow Amanote

© 2025 Amaplex Software S.P.R.L. All rights reserved.

Privacy PolicyRefund Policy