Multi-Modal Interactive Control of Robotic Arm Based on Offline Large Language Models (Student Abstract)
DOI:
https://doi.org/10.1609/aaai.v40i48.42200Abstract
Large Language Models (LLMs) have revolutionized the modern society significantly with the numerous advanced interactions between humans and AI agents, whereas the usage of most large language models including ChatGPT are not friendly open-sourced and must require users paying a lot for such AI service continuously. Therefore, deploying open-sourced large language models on local servers can be considered as an efficient method to design and implement creative embodied AI algorithms with lower cost and more stable free usage. Inspired by this ordinary motivation, we originally propose and implement the “Socratic Models-ChatGLM”, which is a well-performed algorithm for multi-modal interactive control of robotic arm based on offline large language models via the facile PyBullet platform, even presents extraordinary potential to address complicated text-image multi-step long-horizon robotic manipulation tasks.Downloads
Published
2026-03-14
How to Cite
Chen, H. (2026). Multi-Modal Interactive Control of Robotic Arm Based on Offline Large Language Models (Student Abstract). Proceedings of the AAAI Conference on Artificial Intelligence, 40(48), 41164–41165. https://doi.org/10.1609/aaai.v40i48.42200
Issue
Section
AAAI Student Abstract and Poster Program