Beyond Text: Utilizing Vocal Cues to Improve Decision Making in LLMs for Robot Navigation Tasks

Daily Publication

Beyond Text: Utilizing Vocal Cues to Improve Decision Making in LLMs for Robot Navigation Tasks

Better model and token fusion strategy

Info

Title: Beyond Text: Utilizing Vocal Cues to Improve Decision Making in LLMs for Robot Navigation Tasks
Group: Purdue
Keywords: Speech as feedback
Venue: TMLR

Comments

Method

They provide vocal features to LLM as evidence for decision making.
They idenfity a specific task and dataset called Disfluent Human Audio-Guided Instructions