OpenAI is launching a new general purpose AI agent in ChatGPT which the company says can complete a wide variety of computer-based tasks on behalf of users. OpenAI says the agent can automatically navigate a user’s calendar, generate editable presentations and slideshows, and run code.
The tool, called ChatGPT agent, combines several capabilities from OpenAI’s previous agentic tools, including Operator’s ability to click around on websites, as well as Deep Research’s ability to synthesize information from dozens of websites into a concise research report. OpenAI says users will be able to interact with the agent simply by prompting ChatGPT in natural language.
On Thursday, OpenAI is rolling out ChatGPT agent to subscribers to its Pro, Plus, and Team plans. To activate the tool, users can select “agent mode” in ChatGPT’s dropdown menu of tools.
The launch of ChatGPT agent represents OpenAI’s boldest attempt yet to turn ChatGPT into an agentic product that can take actions and offload tasks for users, rather than just answering questions. In recent years, Silicon Valley companies including OpenAI, Google, and Perplexity have unveiled dozens of AI agents that have promised to do just that. However, these early version of AI agents have proven to struggle with complex tasks, and seem less compelling as products than the ultimate vision tech executives pitch around AI agents.
That said, OpenAI says ChatGPT agent is far more capable than its previous offerings.
OpenAI’s new agent can access ChatGPT connectors, allowing users to connect apps like Gmail and GitHub so that the agent can find relevant information to your prompts. Furthermore, OpenAI says ChatGPT agent has access to a terminal, and can use APIs to access certain apps.
The model underlying ChatGPT agent offers st …