A curated list of awesome papers on Embodied AI and related research/industry-driven resources, inspired by awesome-computer-vision.
Embodied AI has led to a new breakthrough, and this repository will keep tracking and summarizing the research or industrial progress.
- Contribution is highly welcome and feel free to submit a pull request or contact me.
If you find this repository helpful, please consider Stars ⭐ or Sharing ⬆️.
- CVPR-Workshop
- ICCV-Workshop
- CS539-OregonStateUniversity
- ChatGPT for Robotics: Design Principles and Model Abilities
Please do consider this fantastic paper : Agent AI: Surveying the Horizons of Multimodal Interaction
- Aligning Cyber Space with Physical World: A Comprehensive Survey on Embodied AI
- Vision-Language Navigation with Embodied Intelligence: A Survey
- The Rise and Potential of Large Language Model Based Agents: A Survey
- A Survey of Embodied AI: From Simulators to Research Tasks
- A Survey on LLM-based Autonomous Agents
- Mindstorms in Natural Language-Based Societies of Mind
- Tianxing CHEN's repository
- Wenxuan Song's repository
- Qiang (Jony) ZHANG's repository
- GT-RIPL's repository
- Jacob Rintamaki's repository
- Jiankai-Sun's repository
- Yafei Hu's repository
- Thanks to Changan's repository
- Thanks to Rui's repository
- An Interactive Agent Foundation Model
- AutoGen, EcoOptiGen
- AgentTuning: Enabling Generalized Agent Abilities For LLMs
- AgentBench: Evaluating LLMs as Agents
- The Rise and Potential of Large Language Model Based Agents: A Survey
- An Open-source Framework for Autonomous Language Agents
- MetaGPT: Meta Programming for Multi-Agent Collaborative Framework
- AgentVerse: Facilitating Multi-Agent Collaboration and Exploring Emergent Behaviors in Agents
- ModelScope-Agent: Building Your Customizable Agent System with Open-source Large Language Models
- Embodied Task Planning with Large Language Models
- Building Cooperative Embodied Agents Modularly with Large Language Models
- State-Maintaining Language Models for Embodied Reasoning
- Embodied Executable Policy Learning with Language-based Scene Summarization
- Voyager: An Open-Ended Embodied Agent with Large Language Models
- Simple Embodied Language Learning as a Byproduct of Meta-Reinforcement Learning
- Vision-Language Tasks
- Exploring Large Language Models for Communication Games: An Empirical Study on Werewolf
- Language Guided Generation of 3D Embodied AI Environments
- CogAgent: Visual Expert for Pretrained Language Models
- ProAgent: from Robotic Process Automation to Agentic Process Automation
- Waymax: An accelerated simulator for autonomous driving research
- HOW FAR ARE LARGE LANGUAGE MODELS FROM AGENTS WITH THEORY-OF-MIND?
- AgentBench: Evaluating LLMs as Agents
- MINDAGENT: EMERGENT GAMING INTERACTION
- Alexa, play with robot: Introducing the First Alexa Prize SimBot Challenge on Embodied AI
- Emergent Communication for Embodied Control
- Simple but Effective: CLIP Embeddings for Embodied AI
- Embodied AI-Driven Operation of Smart Cities: A Concise Review
- Modeling Dynamic Environments with Scene Graph Memory
- An Open-source Framework for Autonomous Language Agents
- MetaGPT: Meta Programming for Multi-Agent Collaborative Framework