AVAST: Attentive Variational State Tracker in a Reinforced Navigator

Je-Wei Jang, Mahdin Rohmatillah, Jen-Tzung Chien


Abstract
Recently, emerging approaches have been proposed to deal with robotic navigation problems, especially vision-and-language navigation task which is one of the most realistic indoor navigation challenge tasks. This task can be modelled as a sequential decision-making problem, which is suitable to be solved by deep reinforcement learning. Unfortunately, the observations provided from the simulator in this task are not fully observable states, which exacerbate the difficulty of implementing reinforcement learning. To deal with this challenge, this paper presents a novel method, called as attentive variational state tracker (AVAST), a variational approach to approximate belief state distribution for the construction of a reinforced navigator. The variational approach is introduced to improve generalization to the unseen environment which barely achieved by traditional deterministic state tracker. In order to stabilize the learning procedure, a fine-tuning process using policy optimization is proposed. From the experimental results, the proposed AVAST does improve the generalization relative to previous works in vision-and-language navigation task. A significant performance is achieved without requiring any additional exploration in the unseen environment.
Anthology ID:
2022.aacl-main.33
Volume:
Proceedings of the 2nd Conference of the Asia-Pacific Chapter of the Association for Computational Linguistics and the 12th International Joint Conference on Natural Language Processing (Volume 1: Long Papers)
Month:
November
Year:
2022
Address:
Online only
Editors:
Yulan He, Heng Ji, Sujian Li, Yang Liu, Chua-Hui Chang
Venues:
AACL | IJCNLP
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
424–433
Language:
URL:
https://aclanthology.org/2022.aacl-main.33
DOI:
10.18653/v1/2022.aacl-main.33
Bibkey:
Cite (ACL):
Je-Wei Jang, Mahdin Rohmatillah, and Jen-Tzung Chien. 2022. AVAST: Attentive Variational State Tracker in a Reinforced Navigator. In Proceedings of the 2nd Conference of the Asia-Pacific Chapter of the Association for Computational Linguistics and the 12th International Joint Conference on Natural Language Processing (Volume 1: Long Papers), pages 424–433, Online only. Association for Computational Linguistics.
Cite (Informal):
AVAST: Attentive Variational State Tracker in a Reinforced Navigator (Jang et al., AACL-IJCNLP 2022)
Copy Citation:
PDF:
https://aclanthology.org/2022.aacl-main.33.pdf
Software:
 2022.aacl-main.33.Software.zip