Amanote Research

Amanote Research

    RegisterSign In

Watch, Listen, and Describe: Globally and Locally Aligned Cross-Modal Attentions for Video Captioning

doi 10.18653/v1/n18-2125
Full Text
Open PDF
Abstract

Available in full text

Date

January 1, 2018

Authors
Xin WangYuan-Fang WangWilliam Yang Wang
Publisher

Association for Computational Linguistics


Related search

Listen, Watch, Learn: SeisSound Video Products

Seismological Research Letters
Geophysics
2012English

Multi-Task Video Captioning With Video and Entailment Generation

2017English

Locally and Globally Explainable Time Series Tweaking

Knowledge and Information Systems
Information SystemsHuman-Computer InteractionHardwareArchitectureArtificial IntelligenceSoftware
2019English

Switching Locally or Globally

Science
MultidisciplinaryPhilosophy of ScienceHistory
2015English

On Locally and Globally Conformal Kähler Manifolds

Transactions of the American Mathematical Society
MathematicsApplied Mathematics
1980English

Deep Learning Based, a New Model for Video Captioning

International Journal of Advanced Computer Science and Applications
Computer Science
2020English

Thinking and Acting Both Locally and Globally: New Issues for Teacher Education

Journal of Education for Teaching
Education
2011English

Strategies to Be Globally Visible and Locally Engaged

Drying Technology
Theoretical ChemistryChemical EngineeringPhysical
2016English

MeCP2: Phosphorylated Locally, Acting Globally

Neuron
Neuroscience
2011English

Amanote Research

Note-taking for researchers

Follow Amanote

© 2026 Amaplex Software S.P.R.L. All rights reserved.

Privacy PolicyRefund Policy