Nvidia reveals new Cosmos models and infrastructure for robotics and physical applications

Nvidia introduces Cosmos Reason, a 7-billion-parameter model for robotics, enhancing AI capabilities with new libraries and servers.

: Nvidia unveiled new AI models, highlighted by Cosmos Reason, a 7-billion-parameter model designed for robotics and AI applications. The announcement also included Cosmos Transfer-2, which optimizes synthetic data generation and offers a distilled version for improved speed. Alongside these models, Nvidia introduced neural reconstruction libraries and new servers such as the RTX Pro Blackwell Server for robotics workflows. These innovations position Nvidia to expand AI uses beyond data centers into robotics and physical applications.

Nvidia has made a strategic move into the robotics and physical AI application domains by unveiling its latest Cosmos models and accompanying infrastructure. This announcement was made at the SIGGRAPH conference on Monday. At the forefront of this release is the Cosmos Reason, a 7-billion-parameter reasoning vision language model that aims to enhance a robot's ability to understand and interact with physical environments. As described by Nvidia, Cosmos Reason integrates memory and physics comprehension to support planning and reasoning processes in AI agents, allowing for applications such as data curation, robot planning, and video analytics.

Accompanying Cosmos Reason is the Cosmos Transfer-2, designed to accelerate synthetic data generation from 3D simulation scenes or through spatial control inputs. This model also offers a distilled version that prioritizes speed, allowing developers to create datasets for training more efficiently. The role of these models in generating synthetic text, images, and video datasets is critical as they lay the foundation for training AI agents and robots.

Nvidia's initiative goes beyond just model updates; they also announced new neural reconstruction libraries. These libraries include rendering techniques aimed at simulating the real world in 3D using sensor data. The integration of this technology into the open-source simulator CARLA exemplifies Nvidia's commitment to providing the tools necessary for developers to create realistic simulations. Furthermore, Nvidia has made an update to its Omniverse software development kit, enhancing the capabilities available to developers using this platform.

To support these software advancements, Nvidia presented new server solutions for robotics workflows. The Nvidia RTX Pro Blackwell Server represents a versatile architecture addressing the demands of robotic development applications. Additionally, the Nvidia DGX Cloud introduces a cloud-based management platform, enabling scalable and efficient management of these robotic workflows.

Nvidia's focus on robotics marks a strategic shift as the company seeks applications for its AI GPUs beyond conventional AI data centers. With the introduction of these models and infrastructure, Nvidia is setting a course towards becoming a dominant player in the physical AI space. This move not only highlights their innovation in model capabilities but also their comprehensive approach to providing both the software and hardware required to advance robotics technology.

Sources: Nvidia, TechCrunch