Integration of Large Language Models with Proximal Policy Optimization for Autonomous Mobile Robot Control in Dynamic Environments

Chae, SongHwa; Lim, Yujin

doi:10.3745/JIPS.04.0372

상세 보기

Integration of Large Language Models with Proximal Policy Optimization for Autonomous Mobile Robot Control in Dynamic Environments

Chae, SongHwa;
Lim, Yujin

Citations

WEB OF SCIENCE

0

Citations

SCOPUS

0

초록

Autonomous mobile robots (AMRs) must operate in dynamic, unstructured environments where traditional control and reinforcement learning (RL) face adaptability and reward design limitations. This study proposes a hybrid framework combining proximal policy optimization (PPO) with a large language model (LLM) as an adaptive reward designer. Using GPT-4o-mini, the LLM dynamically shapes rewards based on performance logs, improving exploration and stability. Experiments in complex indoor navigation show the LLM-PPO model reduces collisions by 38%, shortens completion time by 21%, and increases rewards by 8% over PPO. Results demonstrate LLM-RL integration enhances safety, efficiency, and consistency, offering a promising paradigm for AMR control.

키워드

Autonomous Mobile Robots; Dynamic Environments; Large Language Models; Proximal Policy Optimization; Reinforcement Learning; Reward Shaping

제목: Integration of Large Language Models with Proximal Policy Optimization for Autonomous Mobile Robot Control in Dynamic Environments

저자: Chae, SongHwa; Lim, Yujin

DOI: 10.3745/JIPS.04.0372

발행일: 2026-04

유형: Article

저널명: JIPS(Journal of Information Processing Systems)

권: 22

호: 2

페이지: 197 ~ 208