Tech News

Nvidia’s ‘Cosmos’ AI Helps Humanoid Robots Navigate the World

Nvidia announced today that it is releasing a family of basic AI models called Cosmos that can be used to train humanoids, industrial robots, and self-driving cars. While language models learn how to create text by training on large amounts of books, articles, and social media posts, Cosmos is designed to create images and 3D models of the virtual world.

During a keynote presentation at the annual CES conference in Las Vegas, Nvidia CEO Jensen Huang showed examples of Cosmos being used to simulate operations inside warehouses. Cosmos was trained on 20 million hours of real footage of “people walking, hands moving, manipulating,” Jensen said. “It’s not about producing creative content, but teaching AI to understand the virtual world.”

Researchers and startups hope that these kinds of basic models can give robots used in factories and homes more sophisticated capabilities. Cosmos, for example, can produce virtual video boxes that fall on shelves inside a warehouse, which can be used to train a robot to spot hazards. Users can also tune the models using their own data.

Many companies are already using Cosmos, Nvidia said, including humanoid robotics startups Agility and Figure AI and self-driving companies like Uber, Waabi, and Wayve.

Examples of warehouse images produced by Cosmos.

Hosted by Nvidia

Nvidia also announced software designed to help different types of robots learn to perform new tasks more efficiently. The new feature is part of Nvidia’s existing Isaac robot simulation platform that will allow robot builders to take a small number of examples of a desired task, such as grasping an object, and generate large amounts of artificial training data.

Nvidia hopes that Cosmos and Isaac will attract companies that want to build and use human robots. Jensen was joined on stage at CES by life-size models of 14 different humanoid robots developed by companies including Tesla, Boston Dynamics, Agility, and Figure.

Along with Cosmos, Nvidia also announced Project Digits, a $3,000 “personal AI supercomputer” that can run a large language model of up to 200 billion parameters without the need for cloud services from the likes of AWS or Microsoft. It also announced the much-anticipated next-generation RTX Blackwell GPUs, and incoming software tools to help build AI agents.


Source link

Related Articles

Leave a Reply

Your email address will not be published. Required fields are marked *

Back to top button