Skip to content
@InternRobotics

Intern Robotics

Building inclusive infrastructure for Embodied AI, from Shanghai AI Lab.

Toolchain - Training, Inference and Evaluation

  • InternUtopia: A simulation platform for versatile Embodied AI research and developments.
  • InternManip: An all-in-one robot manipulation learning suites (5 pretrained models, 3 benchmarks, and more coming soon).
  • InternNav: A open platform for building generalized navigation foundation models (with 6 mainstream benchmarks and 10+ baselines).
  • InternHumanoid: A versatile, all-in-one toolbox for whole-body humanoid robot contorl.
  • InternSR: A open-source toolbox for vision-based embodied spatial intelligence.

Models, Datasets and Research

  • Humanoids/Legged Robots

    • Datasets:
      • InternData-H1: The largest open-sourced 3D human motion dataset with text annotation, including 2.5k hours 1.9M episodes.
    • Models and Research:
      • UniHSI: Unified Human-Scene Interaction via Prompted Chain-of-Contacts
      • HIMLoco: Hybrid Internal Model: Learning Agile Legged Locomotion with Simulated Robot Response
      • 🏆HoST [Best Systems Paper Finalist at RSS 2025]: Learning Humanoid Standing-up Control across Diverse Postures
      • HOMIE: Humanoid Loco-Manipulation with Isomorphic Exoskeleton Cockpit
  • Manipulation

    • Datasets:
      • InternData-A1: A hybrid synthetic-real manipulation dataset integrating 5 heterogeneous robots, 15 skills, and 200+ scenes, emphasizing multi-robot collaboration under dynamic scenarios.
      • InternData-M1: A large-scale synthetic dataset for generalizable pick-and-place over 80K objects, with open-ended instructions covering object recognition, spatial and commonsense reasoning, and long-horizon tasks.
    • Models and Research:
      • InternVLA-A1: Unifying Understanding, Generation, and Action for Robotic Manipulation
      • InternVLA-M1: An Spatially Grounded Foundation Model for Generalist Robot
      • F1-VLA: Visual foresight generation for planning-based control
      • VLAC: A generalist vision-language-action-critic model for robotic real-world reinforcement learning
      • Seer: Predictive Inverse Dynamics Models are Scalable Learners for Robotic Manipulation
      • RoboSplat: Novel Demonstration Generation with Gaussian Splatting Enables Robust One-Shot Manipulation
      • GenManip: LLM-driven Simulation for Generalizable Instruction-Following Manipulation
  • Navigation

    • Datasets:
      • InternData-N1: A high-quality navigation dataset with the most diverse scenes and extensive randomization across embodiments/viewpoints, including 3k+ scenes and 830k VLN data.
    • Models and Research:
      • InternVLA-N1: An Open Dual-System Vision-Language Navigation Foundation Model with Learned Latent Plans
      • NavDP: Learning Sim-to-Real Navigation Diffusion Policy with Privileged Information Guidance
      • StreamVLN: Streaming Vision-and-Language Navigation via SlowFast Context Modeling
      • VLN-PE: A Holistic Study of Physical and Visual Disparities in Vision-and-Language Navigation
  • AIGC for Embodied AI

    • Datasets:
      • OmniWorld: A large-scale, multi-domain, multi-modal dataset, enables significant performance improvements in 4D reconstruction and video generation.
    • Models and Research:
      • MeshCoder: Generate Structured 3D Object Blender Code from Point Clouds
      • Infinite-Mobility: Scalable High-Fidelity Synthesis of Articulated Objects via Procedural Generation
      • Aether: Geometric-Aware Unified World Modeling
  • 3D Vision and Embodied Perception

    • EmbodiedScan: A Holistic Multi-Modal 3D Perception Suite Towards Embodied AI
    • 🏆PointLLM [Best Paper Candidate at ECCV 2024]: Empowering Large Language Models to Understand Point Clouds
    • MMSI-Bench: A Benchmark for Multi-Image Spatial Intelligence
    • OST-Bench: Evaluating the Capabilities of MLLMs in Online Spatio-temporal Scene Understanding
  • 3D Assets for Embodied AI

    • InternScenes: A large-scale interactive indoor scene dataset with realistic layouts, 40,000 diverse scenes and 1.96M 3D objects.

Pinned Loading

  1. InternUtopia InternUtopia Public

    A simulation platform for versatile Embodied AI research and developments.

    Python 1k 63

  2. InternScenes InternScenes Public

    InternScenes: A Large-scale Interactive Indoor Scene Dataset with Realistic Layouts.

    Python 174 5

  3. InternManip InternManip Public

    An All-in-one robot manipulation learning suite for policy models training and evaluation on various datasets and benchmarks.

    Python 136 9

  4. InternNav InternNav Public

    InternRobotics' open platform for building generalized navigation foundation models.

    Python 306 23

  5. InternHumanoid InternHumanoid Public

    A versatile, all-in-one toolbox for whole-body humanoid robot control.

    Python 110 3

  6. InternSR InternSR Public

    InternRobotics' open-source toolbox for vision-based embodied spatial intelligence.

    Python 41

Repositories

Showing 10 of 48 repositories
  • InternVLA-M1 Public

    InternVLA-M1: A Spatially Grounded Foundation Model for Generalist Robot Policy

    InternRobotics/InternVLA-M1’s past year of commit activity
    Python 120 MIT 2 2 0 Updated Sep 28, 2025
  • StreamVLN Public

    Official implementation of the paper: "StreamVLN: Streaming Vision-and-Language Navigation via SlowFast Context Modeling"

    InternRobotics/StreamVLN’s past year of commit activity
    Python 237 13 8 1 Updated Sep 28, 2025
  • MesaTask Public

    [NeurIPS 2025 Spotlight] MesaTask: Towards Task-Driven Tabletop Scene Generation via 3D Spatial Reasoning

    InternRobotics/MesaTask’s past year of commit activity
    Python 16 MIT 0 1 0 Updated Sep 28, 2025
  • InternNav Public

    InternRobotics' open platform for building generalized navigation foundation models.

    InternRobotics/InternNav’s past year of commit activity
    Python 306 23 7 0 Updated Sep 28, 2025
  • VLAC Public

    VLAC: A Vision-Language-Action-Critic Model for Robotic Real-World Reinforcement Learning

    InternRobotics/VLAC’s past year of commit activity
    Python 161 MIT 4 3 0 Updated Sep 27, 2025
  • .github Public
    InternRobotics/.github’s past year of commit activity
    0 3 0 0 Updated Sep 26, 2025
  • EgoHOD Public

    Official implementation of EgoHOD at ICLR 2025; 14 EgoVis Challenge Winners in CVPR 2024

    InternRobotics/EgoHOD’s past year of commit activity
    Python 22 Apache-2.0 2 1 0 Updated Sep 24, 2025
  • NavDP Public

    Official implementation of the paper: "NavDP: Learning Sim-to-Real Navigation Diffusion Policy with Privileged Information Guidance"

    InternRobotics/NavDP’s past year of commit activity
    Python 237 10 5 0 Updated Sep 23, 2025
  • InternHumanoid Public

    A versatile, all-in-one toolbox for whole-body humanoid robot control.

    InternRobotics/InternHumanoid’s past year of commit activity
    Python 110 MIT 3 1 0 Updated Sep 19, 2025
  • InternRobotics/internvla-m1.github.io’s past year of commit activity
    Astro 0 0 0 0 Updated Sep 19, 2025

People

This organization has no public members. You must be a member to see who’s a part of this organization.