Embodied AGI Group

team.png

Welcome to the UMass Embodied AGI Group!

Our goal is to develop intelligent agents capable of understanding and interacting with the world in a human-like manner. By combining physical and social intelligence with advanced models, we aim to push the boundaries of embodied general intelligence for real-world and virtual environments.

LGRC A101, 740 N. Pleasant Street, Amherst, MA 01003



Selected Publications

Physical Reasoning and Interaction

  1. RoboGen: Towards Unleashing Infinite Data for Automated Robot Learning via Generative Simulation

Multimodal Language Models

  1. 3D-LLM: Injecting the 3D World into Large Language Models
    Advances in Neural Information Processing Systems

Learning World Models

  1. 3D-VLA: A 3D Vision-Language-Action Generative World Model

Large Language Models

  1. Principle-Driven Self-Alignment of Language Models from Scratch with Minimal Human Supervision
    Advances in Neural Information Processing Systems

Audio-Visual Learning

  1. soundofpixels.png
    The Sound of Pixels
    In The European Conference on Computer Vision (ECCV)