Skip to main navigation Skip to search Skip to main content

Exploring the Capabilities and Limitations of Large Language Models for Zero-Shot Human-Robot Interaction

  • Kelvin Olaiya
  • , Giovanni Delnevo
  • , Chan Tong Lam
  • , Giovanni Pau
  • , Paola Salomoni
  • University of Bologna
  • Technology Innovation Institute

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

3 Citations (Scopus)

Abstract

Human-robot interaction (HRI) is an evolving field with a growing emphasis on enabling robots to understand and perform tasks based on natural language commands. Recently, Large Language Models (LLMs) have emerged as a promising tool for such tasks, offering the potential to enable zero-shot learning and flexible interaction without task-specific training. In this paper, we explore the use of LLMs for zero-shot navigation and exploration tasks in robotic systems, specifically evaluating their performance with the PR2 Clearpath and Khepera IV robots in a simulated environment. Our findings demonstrate promising results, particularly in the LLM's ability to exhibit exploratory behavior and iterative reasoning when faced with ambiguous or incomplete visual input. These capabilities suggest a strong potential for LLMs in human-robot interaction. However, challenges were also identified, such as difficulties with target recognition, object misidentification, hallucination of information, and issues with movement execution, highlighting the need for improvements in these areas for real-world applications.

Original languageEnglish
Title of host publication30th IEEE Symposium on Computers and Communications, ISCC 2025
PublisherInstitute of Electrical and Electronics Engineers Inc.
ISBN (Electronic)9798331524203
DOIs
Publication statusPublished - 2025
Event30th IEEE Symposium on Computers and Communications, ISCC 2025 - Bologna, Italy
Duration: 2 Jul 20255 Jul 2025

Publication series

NameProceedings - IEEE Symposium on Computers and Communications
ISSN (Print)1530-1346

Conference

Conference30th IEEE Symposium on Computers and Communications, ISCC 2025
Country/TerritoryItaly
CityBologna
Period2/07/255/07/25

Keywords

  • AI for Robotics
  • Human-Robot Interaction
  • Large Language Models
  • Zero-Shot Learning

Fingerprint

Dive into the research topics of 'Exploring the Capabilities and Limitations of Large Language Models for Zero-Shot Human-Robot Interaction'. Together they form a unique fingerprint.

Cite this