OpenAI’s open source models are finally here

OpenAI has released open-weight GPT models with 20 billion and 120 billion parameters that offer strong performance comparable to GPT-3.5 and GPT-4 Mini, enabling efficient local use on consumer hardware while prioritizing privacy and cost-effectiveness. Although these models lack built-in safety layers and require developer adaptation for integration and moderation, they represent a significant step toward democratizing advanced AI access and fostering innovation.

OpenAI has released their open-weight GPT OSS models, featuring a 120 billion parameter model and a 20 billion parameter model. The smaller 20 billion parameter model is remarkably efficient, capable of running on consumer-grade hardware including smartphones, while the larger 120 billion parameter model requires more powerful gaming hardware or high-end GPUs like the Nvidia 5090. Both models utilize a mixture of experts architecture, activating only a subset of parameters per request, which helps optimize performance and resource usage. This release is significant for privacy-conscious users and developers who want to run advanced AI models locally without relying on cloud APIs.

Performance-wise, the 20 billion parameter model delivers intelligence comparable to OpenAI’s GPT-3.5 Mini, making it a powerful option for mobile and lightweight applications. The 120 billion parameter model offers performance close to GPT-3 and GPT-4 Mini in many benchmarks, though it is not as strong in some areas like front-end coding tasks. Running the larger model on laptops is challenging due to high memory and processing demands, but it performs well on desktops and cloud providers. The models are also very cost-effective, with inference costs significantly lower than many proprietary alternatives, making them accessible to smaller organizations and emerging markets.

One of the key challenges with these open models is the lack of an intermediary safety and moderation layer that OpenAI traditionally used in their hosted models. This layer filtered unsafe or illegal content before it reached the user. Since the open-weight models are fully accessible, users and providers must implement their own safety mechanisms. The models use a unique bracket-bar syntax for input and output formatting, which requires adaptation by developers to integrate with existing tools and workflows. Tool calling capabilities vary across providers, with some showing reliability issues, but improvements are ongoing.

In terms of coding ability, the GPT OSS models perform moderately well but fall short compared to specialized models like Horizon, which excel at front-end development tasks. The open models tend to produce more errors and require more prompt engineering and provider-specific adjustments to achieve reliable results. Benchmarks show that while these models are not the absolute best in every category, they strike a strong balance between intelligence, efficiency, and cost. They are particularly strong in scientific and health-related tasks, where privacy and local processing are critical.

Overall, OpenAI’s release of these open-weight models marks a major milestone in democratizing access to powerful AI. They enable developers to run advanced language models on personal hardware, fostering innovation and privacy. While not perfect, these models offer a compelling alternative to proprietary APIs, with competitive performance and much lower costs. The community and providers are actively working to improve tooling, safety, and integration, making this an exciting time for AI enthusiasts and developers. Future updates and models like GPT-5 remain topics of interest but are yet to be detailed.