OpenAI founder admits AI isn’t working

OpenAI co-founder Andre Karpathy candidly acknowledged that while AI coding tools have improved productivity, they remain unreliable and prone to errors, often producing bloated and brittle code that requires careful oversight. He emphasized the evolving challenges for developers in collaborating with AI, advocating for new hiring practices focused on project-based assessments and precise specifications, while admitting that the future of AI in software engineering remains uncertain.

Andre Karpathy, a co-founder of OpenAI and a key figure in modern AI development, shared candid insights about the current state of AI coding tools at a conference. He revealed that he has stopped meticulously checking AI-generated code because the tools have improved to the point where constant corrections are less necessary. However, this shift is framed as a productivity tip rather than a sign of perfection, suggesting a trade-off where developers might be giving up on thorough review. Despite this, Karpathy highlighted a paradox: while AI models can handle complex tasks like refactoring massive codebases, they still struggle with simple problems, such as counting letters in a word or making basic decisions.

Karpathy recounted an experience with an AI agent making a nonsensical assumption in an app he was developing, which could have led to serious issues if unnoticed. This anecdote underscores the fragility and unpredictability of AI-generated code, which, despite its impressive capabilities, can produce critical errors. He described the code produced by AI as often bloated, repetitive, and brittle—functional but far from elegant or reliable. This reality contrasts sharply with the hype around AI, revealing a hidden struggle among engineers who must manage and mitigate these imperfections while maintaining productivity.

The conversation also touched on the evolving nature of software development workflows. Karpathy humorously suggested that future engineering teams might need medical support on hand due to the stress induced by reviewing AI-generated code. He emphasized that pair programming and standups might transform to accommodate the new challenges posed by AI-assisted coding, blending technical work with health-conscious practices. This metaphor highlights the emotional and cognitive toll that current AI tools can impose on developers, reflecting a broader industry tension between innovation and practical usability.

Karpathy shared practical advice for software developers seeking employment in this AI-driven landscape. He noted that many companies have yet to adapt their hiring processes to evaluate “agentic engineering” skills—those involving effective collaboration with AI agents. Instead of traditional coding puzzles, he advocates for project-based assessments where candidates write detailed specifications and manage complex projects, such as building a secure Twitter clone. Mastery in writing precise specs and anticipating edge cases is becoming a critical skill, as it directly impacts the quality of AI-generated code and the efficiency of the development process.

Ultimately, Karpathy expressed a sense of uncertainty about the future of AI in software engineering. When asked about valuable skills to learn moving forward, he admitted that even experts like himself do not have definitive answers. This honesty reflects the nascent and rapidly evolving nature of AI technology, where many challenges remain unresolved. For those feeling uncertain about their place in this shifting landscape, Karpathy’s openness offers reassurance that confusion and adaptation are part of the journey as the industry continues to explore the potential and limitations of AI-assisted development.