Spoken Dialog - Honda Research Institute USA

Spoken Dialog

Spoken Dialog

Probabilistic model to track dialog state and provide information to driver viaspoken dialog

We built a belief maintenance and update system to track user goals during interaction. We learnt a policy to select optimal actions based on the belief. We developed Dynamic Probabilistic Ontology Trees (POT), a new probabilistic model to track dialog state. Our model captured both the user goal and the history of user dialog acts using a unified Bayesian Network. We performed efficient inference using a form of blocked Gibbs sampling designed to exploit the structure of the model.

Later, we combined this DPOT semantic belief tracker for categorical concepts with a kernel density estimator that incorporated landmark evidence from multiple turns and landmark hypotheses, into a posterior probability over candidate locations. Used a deterministic policy to select actions based on the belief. Our system was demonstrated via android app.

We also built a hybrid system to send text messages by voice, where we used Google Speech API to recognize general messages. We used Nuance with our own language model as a fallback technique when Google Speech confidence was low, such as when there were corrections starting with No.

Publications

Open Journal Vehicular Technology 2024 2025
Samuel Thornton, Nithin Santhanam, Rajeev Chhajer, Sujit Dey
IEEE Conference on Decision and Control (CDC) 2024
Sooyung Byeon, Danyang Tian, Jackie Ayoub, Miao Song, Ehsan Moradi Pari, Inseok Hwang
Neural Information Processing Systems (NeurIPS), 2024. 2024
Seunggeun Chi, Pin-Hao Huang, Enna Sachdeva, Hengbo Ma, Karthik Ramani, Kwonjoon Lee
NeurIPS 2024 2024
Huao Li, Hossein Nourkhiz Mahjoub, Behdad Chalaki, Vaishnav Tadiparthi, Kwonjoon Lee, Ehsan Moradi-Pari, Charles Michael Lewis, Katia P. Sycara
Robotics and Automation Letters (RA-L) 2024
Jinning Li, Jiachen Li, Sangjae Bae, and David Isele
Conference on Robot Learning (CoRL) 2024 Learning Robot Fine and Dexterous Manipulation Workshop 2024
Thomas Power, Abhinav Kumar, Fan Yang, Sergio Aguilera Marinovic, Soshi Iba, Rana Soltani Zarrin, Dmitry Berenson
Empirical Methods in Natural Language Processing (EMNLP 2024) 2024
Muhan Lin, Shuyang Shi, Yue Guo, Behdad Chalaki, Vaishnav Tadiparthi, Simon Stepputtis, Joseph Campbell, Katia P. Sycara, Ehsan Moradi-Pari
Frontiers in Robotics and Automation 2024
Hifza Javed, Weinan Wang, Affan Bin Usman, and Nawid Jamali
International Journal on Robotics Research 2024
Muchen Sun, Francesca Baldini, Pete Trautman, Todd Murphey
Robotics and Automation Letters (RA-L) 2024
Mansur M. Arief, Mike Timmerman, Jiachen Li, David Isele, and Mykel J. Kochenderfer
Nat. Commun. 15, 10080 (2024) 2024
Xufan Li, Samuel Wyss, Emanuil Yanev, Qing-Jie Li, Shuang Wu, Yongwen Sun, Raymond R. Unocic, Joseph Stage, Matthew Strasbourg, Lucas M. Sassi, Yingxin Zhu, Ju Li, Yang Yang, James Hone, Nicholas Borys, P. James Schuck, Avetik R. Harutyunyan
NeurIPS 2024 Workshop Open-World Agents 2024
Nikki_Lijing_Kuang, Songpo Li, Soshi Iba
Intelligent Robots and Systems (IROS) 2024
Hongyu Li, Snehal Dikhale, Jinda Cui, Soshi Iba, and Nawid Jamali
IROS 2024 2024
Viet-Anh Le​, Vaishnav Tadiparthi, Behdad Chalaki,​ Hossein Nourkhiz Mahjoub, Jovin D’sa, Ehsan Moradi-Pari​
arXiv preprint arXiv:2409.09415 (2024) 2024
Lingo, Ryan, Martin Arroyo, and Rajeev Chhajer
Conference on Robot Learning (CoRL) 2024
Patrick Naughton, Jinda Cui, Karankumar Patel, and Soshi Iba
European Conference on Computer Vision (ECCV), 2024 2024
Yuchen Yang, Kwonjoon Lee, Behzad Dariush, Yinzhi Cao, Shao-Yuan Lo
European Conference on Computer Vision (ECCV), 2024 2024
Seunggeun Chi, Hyung-gun Chi, Hengbo Ma, Nakul Agarwal, Faizan Siddiqui, Karthik Ramani, Kwonjoon Lee
European Conference on Computer Vision (ECCV), 2024 2024
Shijie Wang, Qi Zhao, Minh Quan Do, Nakul Agarwal, Kwonjoon Lee, Chen Sun