OpenAI is launching a new general purpose AI agent in ChatGPT, which the company says can complete a wide variety of computer-based tasks on behalf of users. OpenAI says the agent can automatically navigate a user’s calendar, generate editable presentations and slideshows, and run code.

The tool, called ChatGPT agent, combines several capabilities from OpenAI’s previous agentic tools, including Operator’s ability to click around on websites, as well as Deep Research’s ability to synthesize information from dozens of websites into a concise research report. OpenAI says users will be able to interact with the agent simply by prompting ChatGPT in natural language.

ChatGPT agent is rolling out on Thursday to subscribers to OpenAI’s Pro, Plus, and Team plans. To activate the tool, users can select “agent mode” in ChatGPT’s dropdown menu of tools.

ChatGPT can now do work for you using its own computer.

Introducing ChatGPT agent—a unified agentic system combining Operator’s action-taking remote browser, deep research’s web synthesis, and ChatGPT’s conversational strengths. pic.twitter.com/7uN2Nc6nBQ
— OpenAI (@OpenAI) July 17, 2025

The launch of ChatGPT agent represents OpenAI’s boldest attempt yet to turn ChatGPT into an agentic product that can take actions and offload tasks for users, rather than just answering questions. In recent years, Silicon Valley companies including OpenAI, Google, and Perplexity have unveiled dozens of AI agents that have promised to do just that. However, these early version AI agents have proven to struggle with complex tasks, and they seem less compelling as products than the ultimate vision tech executives pitch around AI agents.

That said, OpenAI says ChatGPT agent is far more capable than its previous offerings.

The company’s new agent can access ChatGPT connectors, allowing users to connect apps like Gmail and GitHub so that the agent can find relevant information to your prompts. OpenAI says ChatGPT agent has access to a terminal, and it can use APIs to access certain apps.

OpenAI suggests that users can tap ChatGPT agent to “plan and buy ingredients to make Japanese breakfast for four,” as well as “analyze three competitors and create a slide deck.” These kinds of capabilities requires ChatGPT agent to parse through websites, plan a course of action, and use tools — much more complicated tasks than OpenAI has previously tried to tackle with agents.

Techcrunch event

San Francisco | October 27-29, 2025

REGISTER NOW

The model underlying ChatGPT agent offers state-of-the-art performance on several benchmarks, according to OpenAI.

The company says the ChatGPT agent model scores 41.6% on Humanity’s Last Exam (pass@1), a difficult test made up of thousands of questions across more than one hundred subjects. That’s roughly double what OpenAI’s o3 and o4-mini scored on the test.

On FrontierMath, one of the hardest known math benchmarks, OpenAI says ChatGPT agent scores 27.4% when it has access to tools, such as a terminal for code execution. The previous state-of-the-art score comes from o4-mini, which scored just 6.3%.

OpenAI notes that it developed ChatGPT agent with safety in mind, largely because the product presents some newfound capabilities that could make it more dangerous in the hands of a bad actor. OpenAI has previously warned that agentic models could present more dangerous capabilities.

In a safety report for ChatGPT agent, OpenAI says it’s designated the model as “high capability” in biological and chemical weapon domains, which is defined in OpenAI’s Preparedness Framework as a model with the ability to “amplify existing pathways to severe harm.” OpenAI notes that it does not have direct evidence of this, but it’s decided to take a precautionary approach and activate new safeguards to mitigate these risks.

The new safeguards for ChatGPT agent include a monitor that works in real time as users interact with the product. OpenAI says it runs a classifier across every prompt entered into ChatGPT agent, determining whether the request is related to biology. If so, OpenAI runs ChatGPT agent’s response through a second monitor that determines whether the content could be used to evoke a biological threat.

OpenAI also says it disabled ChatGPT’s memory feature for this agent to prevent misuse. In other parts of ChatGPT, OpenAI’s memory feature allows the chatbot to reference information from previous user chats. However, OpenAI says bad actors could use the feature in ChatGPT agent to exfiltrate sensitive data through prompt injection attacks. The company says it may revisit adding the feature in the future, however.

While ChatGPT agent sounds impressive, it remains to be seen how capable it truly is in the real world. Until now, agent technology has proven relatively brittle when interacting with the real world. That said, OpenAI says it’s developed a more capable model that’s able to deliver on the promise of AI agents.

This story was updated with more information.