As hardware advances, the “brain” of the robot is catching up. Alibaba has just unveiled Qwen 3.5, a new AI model explicitly built for the “Agentic Era.” Unlike standard chatbots, Qwen 3.5 features native “visual agentic capabilities,” allowing it to see, understand, and operate independently across digital and physical interfaces. This is a crucial step towards true robotic autonomy.
Why It Matters for Robotics
Robots need to understand the world, not just text. Qwen 3.5’s ability to process video, images, and text simultaneously in a single model (native multimodal) means robots can react faster and with more context. Plus, with a claimed 60% reduction in inference costs, deploying smart robots just got significantly cheaper.
This move positions Alibaba as a serious contender against OpenAI and Google DeepMind in the race to build the operating system for physical AI.
Source: Reuters