[Master's Thesis] Improving Instruction Generation for Vision-Language Navigation by Reward Designing
A new method to enhance the performance of language generation model for vision-language navigation by using REINFORCE with multi-modal rewards
A new method to enhance the performance of language generation model for vision-language navigation by using REINFORCE with multi-modal rewards
A blue sky paper points out challenges and opportunities in human-robot interaction, in the context of vision-language navigation
A shape completion network utilizes graph attention and surface normal
Vanilla policy gradient methods could solve issues in classical MLE training, but how effective are they?
We compared the performance of visual SLAM packages. As for my task, I built the visual-odometry pipeline from scratch and compared it with SOFT-odometry.