Introducing OpenAI "Agents API" (computer use, web search, multi-agent, open-source!)

OpenAI introduced its “Agents API,” enabling developers to create autonomous systems that can perform tasks using built-in tools like web search, file search, and computer use. The new “responses API” and the rebranded “agents SDK” facilitate the development of sophisticated applications by allowing multiple agents to work together seamlessly, marking a significant advancement in AI capabilities.

In a recent live stream, OpenAI introduced its new “Agents API,” which aims to empower developers to create autonomous systems capable of performing tasks on behalf of users. The concept of an “agent” is defined as a system that can act independently, utilizing tools, memory, and advanced reasoning to execute complex workflows. The presentation highlighted the importance of building a robust infrastructure around core models, enabling developers to create reliable agents without the hassle of cobbling together various low-level APIs.

The live stream showcased three new built-in tools within the API: the web search tool, the file search tool, and the computer use tool. The web search tool allows models to access real-time information from the internet, enhancing the accuracy and relevance of responses. The file search tool enables users to upload and filter documents, making it easier to retrieve specific information. The computer use tool allows agents to control computers or applications without API access, facilitating automation of tasks across different environments.

The team also introduced the new “responses API,” designed to support multiple interactions and tool calls in a single request. This API allows developers to create more sophisticated applications, such as a personal stylist assistant that can recommend clothing based on user preferences and current trends. The demonstration illustrated how the responses API can integrate various tools, enabling agents to gather information from both user data and web searches seamlessly.

Additionally, OpenAI announced the rebranding of its previous “swarm” SDK to the “agents SDK,” which focuses on agent orchestration. This framework allows developers to create multiple specialized agents that can work together, enhancing the complexity and functionality of applications. The SDK supports Python functions, automatically generating the necessary schemas for model interactions, making it easier for developers to build and test their agents.

In conclusion, OpenAI’s new tools and APIs mark a significant step toward creating more capable and intelligent agents. The emphasis on open-source development and the integration of various functionalities aims to streamline the process for developers, allowing them to build innovative applications that can perform real-world tasks. With the anticipation of 2025 being the “year of the agent,” OpenAI is committed to advancing its technology and providing developers with the resources needed to harness the full potential of AI.