Skip to content

Askdroid

Menu
  • Home
    • About Us
    • Contact us
  • AI
  • Robotics
  • Podcasts
  • News
  • Blog
Menu
  • Home
    • About Us
    • Contact us
  • AI
  • Robotics
  • Podcasts
  • News
  • Blog
Physical Intelligence 1 768x375
Previous Next
Ai Category: Vision-Language-Action ModelsAi Tags: Embodied AI Foundation Model Manipulation Navigation research vision-language-action
  • Profile
  • Title
  • Short Description
  • Description
  • Tags
  • Company Name
  • Category
  • Country
  • License
  • Stage
  • Model Size
  • Hardware Requirement
  • API
  • Documentation
  • GitHub
  • Paper / Publication
  • Robots Using

?0.5 keeps the ?0 recipe — a vision-language-model backbone with a flow-matching action expert — but is built around generalisation to unseen environments. Where ?0 and most contemporary VLAs are evaluated in settings that closely match training, ?0.5 is trained on a deliberately heterogeneous mixture of knowledge sources: multi-robot demonstrations, web-scale vision-language data, verbal-feedback data and high-level semantic subtask labels. The model can break a long-horizon instruction into intermediate subtasks and act on them, and can improve from verbal corrections given by a human. Physical Intelligence reports that ?0.5 can control a mobile manipulator to tidy kitchens and bedrooms it has never seen before, which the company frames as a meaningful step toward broadly generalisable physical intelligence. Limitations: it remains a research model; the company notes substantial work remains on knowledge transfer, autonomous self-improvement and reliability; and unlike ?0, full open weights for ?0.5 were not released on the same terms at announcement, so teams should verify current availability. A later ?*0.6 model — a VLA that learns from its own experience — has since been described by the company.

Physical Intelligence ?0.5

?0.5 extends Physical Intelligence's ?0 model with a focus on open-world generalisation. It is designed to control a mobile manipulator in entirely new homes -- cleaning an unseen kitchen or bedroom -- rather than only environments close to its training data.

?0.5 keeps the ?0 recipe — a vision-language-model backbone with a flow-matching action expert — but is built around generalisation to unseen environments. Where ?0 and most contemporary VLAs are evaluated in settings that closely match training, ?0.5 is trained on a deliberately heterogeneous mixture of knowledge sources: multi-robot demonstrations, web-scale vision-language data, verbal-feedback data and high-level semantic subtask labels. The model can break a long-horizon instruction into intermediate subtasks and act on them, and can improve from verbal corrections given by a human. Physical Intelligence reports that ?0.5 can control a mobile manipulator to tidy kitchens and bedrooms it has never seen before, which the company frames as a meaningful step toward broadly generalisable physical intelligence. Limitations: it remains a research model; the company notes substantial work remains on knowledge transfer, autonomous self-improvement and reliability; and unlike ?0, full open weights for ?0.5 were not released on the same terms at announcement, so teams should verify current availability. A later ?*0.6 model — a VLA that learns from its own experience — has since been described by the company.

Embodied AI, Foundation Model, Manipulation, Navigation, research, and vision-language-action
Physical Intelligence
Vision-Language-Action Models
United States
Closed / research (verify current weight availability)
Research prototype
Undisclosed
Cloud / high-end GPU (not publicly benchmarked)
None (research release; check openpi repository)
Documentation URL
GitHub URL
Physical Intelligence, 'pi-0.5: a VLA with Open-World Generalization' (2025) - arXiv:2504.16054
Mobile manipulators in Physical Intelligence's research fleet (wheeled bases with bimanual arms) shown tidying unseen homes. Research demonstrations only -- no commercial deployments disclosed.

Recent Posts

  • Wayve Robotaxi: How a Cambridge Startup Is Rivaling Waymo Without a Single LiDAR
  • Versius Plus and the Gynecology Frontier: CMR Surgical’s FDA Submission and the Future of U.S. Surgical Robotics
  • Autonomous Drone Inspection in 2026: How Industrial Drones Are Replacing Human Inspectors
  • Amazon Sequoia: The Next-Generation Warehouse Robot Arriving in 2026
  • Pudu Robotics Raises 50M and Pivots to Industrial AMR Market in 2026

Recent Comments

No comments to show.

Archives

  • May 2026
  • April 2026
  • October 2024
  • August 2024
  • March 2024
  • February 2024
  • January 2024
  • December 2023

Categories

  • Blog
  • News
  • Podcast
  • Uncategorized

Agriculture & Farming
AI Software & SaaS
Autonomous Systems
Aviation & Aerospace
Civil Engineering & Geospatial
Construction & Infrastructure
Defense & Security
Energy & Renewables
General Purpose & Humanoid
Hardware & Components
Healthcare & Medical
Hospitality & Wellness
Industries
Logistics & Warehousing
Manufacturing & Industrial
Product Type
Public Safety & Emergency
R&D & Developer Tools
Robotics Integration & Services
Robots & Automated Systems

Edge AI Hardware for Droids
Motion Planning & Control
Multimodal LLMs for Embodied AI
Robot Foundation Models
Safety & Alignment for Physical Robots
Simulation Platforms
Speech & Dialogue for Droids
Teleoperation & Data Collection Tools
Vision & Perception AI
Vision-Language-Action Models

Let's get in touch with us

At the intersection of innovation and technology, we are pioneers crafting a landscape for the digital age.
Please enable JavaScript in your browser to complete this form.
Name *
Loading

Contact Us

Call Us

+44 (0) 1483 870170

Email:

info@askdroid.com

Follow Us on

Copyright © 2026, Askdroid. All Rights Reserved
  • Home
    • About Us
    • Contact us
  • AI
  • Robotics
  • Podcasts
  • News
  • Blog
Change Location
Find awesome listings near you!