Robotics & AI

Caltech X1: The Multi-modal Machine Walking, Flying, and Rolling into the Future of Industry

Dillip Chowdary

Dillip Chowdary

April 05, 2026 • 10 min read

**Caltech** researchers have unveiled the **X1**, a breakthrough in embodied AI and robotic locomotion. Designed for complex industrial environments, the X1 is the first robot capable of seamlessly transitioning between walking, flying, and rolling modes to overcome any obstacle. This "Swiss Army Knife" of robotics represents a fundamental shift from specialized hardware to generalized, multi-modal platforms powered by foundation models.

1. Multi-modal Locomotion: Beyond Humanoid and Drone Limits

Traditional robots are typically optimized for a single mode of travel. Humanoids (like **Tesla Optimus** or **Figure 02**) walk for dexterity, drones fly for speed and perspective, and AMRs (Autonomous Mobile Robots) roll for energy efficiency. The X1 integrates all three using a modular limb system that features wheels at the base of its four legs and integrated high-density electric thrusters.

On flat factory floors, the X1 rolls like a traditional AMR, consuming minimal power. When faced with stairs or debris, it locks its wheels and transitions to a quadrupedal walk with the agility of a **Boston Dynamics Spot**. For vertical gaps, large obstacles, or rapid reconnaissance, it deploys its thrusters for sustained flight. This versatility makes it the ultimate platform for search-and-rescue, hazardous waste management, and automated infrastructure maintenance.

2. Foundation Model-Driven Autonomy: The "Brain" of the X1

The intelligence behind the X1 is a specialized **Robotics Foundation Model (RFM)** that has been trained on millions of hours of physics simulations and real-world sensor data. Unlike legacy robots that require scripted behaviors for every movement, the X1 does not need to be told how to navigate a new environment. It perceives its surroundings using a multimodal sensor suite—including LiDAR, thermal cameras, and binaural audio—and autonomously selects the most efficient locomotion mode.

The RFM utilizes "System 2" reasoning to anticipate obstacles. For example, if the robot identifies a slippery surface, it will proactively lower its center of gravity and increase wheel friction before contact. If it detects a structural gap that is too wide to jump, it will calculate the energy requirements for flight and execute a takeoff without human intervention. This real-time decision-making capability is what separates the X1 from previous multi-modal prototypes.

3. Hazardous Environments: The X1's Killer App

In industrial settings, the X1 can perform inspections in areas that are inaccessible or too dangerous for humans. Caltech has already demonstrated the X1 performing autonomous maintenance tasks in a decommissioned nuclear facility. The robot was able to fly over high-voltage equipment, walk through radioactive rubble, and roll along narrow catwalks to close a leaking valve—a task that would have previously required a team of specialized, single-mode robots.

The X1's chassis is radiation-hardened and features a modular payload bay, allowing it to carry sensors for gas detection, structural integrity analysis, or emergency medical supplies. Its ability to "become what the environment requires" makes it an invaluable asset for first responders and industrial safety officers.

4. The Path to Commercialization

While the X1 is currently a research project at Caltech's **Center for Autonomous Systems and Technologies (CAST)**, several aerospace and defense contractors have already expressed interest in licensing the technology. The modular nature of the X1's limbs and thrusters means that it can be scaled up for heavy cargo transport or scaled down for indoor search operations.

The real value, however, lies in the RFM software. Caltech plans to release an "X1 SDK" later this year, allowing third-party developers to build custom "skills" for the robot, such as specialized welding or precision planting for vertical farms. This ecosystem approach could turn the X1 into the primary hardware platform for the emerging Embodied AI industry.

Conclusion: The Future of Embodied AI

The Caltech X1 represents a shift from "specialized" to "generalized" robotic hardware. By combining AI-driven autonomy with multi-modal physical capabilities, Caltech has created a platform that can finally keep up with the complexity of the real world. The X1 is not just a robot; it is the blueprint for the next generation of industrial workers—machines that are as adaptable as humans but with the durability and speed of advanced hardware.

As embodied AI continues to mature, the distinction between "robot" and "intelligence" will continue to blur. The X1 is the first machine to prove that with enough compute and the right physical architecture, no environment is off-limits.