How To Train An LLM with Anthropic's Head of Pretraining

merefield · 30 September 2025 14:00

Nick Joseph, Head of Pretraining at Anthropic, discusses the evolution and engineering challenges of training large language models, emphasizing the dominance of autoregressive next-token prediction and the importance of scaling compute alongside efficient infrastructure. He also highlights the complexities of alignment, the need for robust evaluation methods, and encourages aspiring AI practitioners to develop strong engineering skills while considering the societal impacts of AI.

merefield · 30 September 2025 14:21

In this insightful discussion, Nick Joseph, Head of Pretraining at Anthropic, shares his journey into AI and the evolving landscape of pretraining large language models (LLMs). Starting from his early work at Vicarius on computer vision for robotics, through his time at OpenAI focusing on safety and code models, Nick emphasizes the importance of practical engineering skills over purely academic pursuits. He highlights how pretraining, particularly next-word prediction on vast internet text data, has been central to AI progress, driven by scaling laws that show predictable improvements as compute, data, and model size increase.

Nick explains that while many pretraining objectives were explored historically, autoregressive next-token prediction emerged as the dominant approach due to its empirical success and natural fit for generating coherent text. He stresses that throwing more compute at these models generally yields better results, with architectural tweaks and hyperparameter tuning offering incremental gains. Early on, Anthropic operated with relatively modest resources but focused heavily on efficiency, building custom distributed training infrastructure and closely collaborating with cloud providers to optimize hardware utilization and overcome challenges like network latency and hardware failures.

As pretraining has scaled to thousands of GPUs and beyond, Nick describes how the team has become more specialized, balancing deep expertise with maintaining a holistic understanding of the system. He discusses the complexities of managing large-scale distributed training, including fault tolerance and the need for sophisticated profiling tools. Despite advances, debugging remains a major challenge due to the scale and complexity of the code and hardware stack, requiring engineers who can navigate from high-level model behavior down to low-level hardware issues.

On the topic of alignment, Nick defines it as ensuring AI models share human goals and values, especially as models approach or surpass human-level intelligence. While alignment is primarily addressed in post-training through techniques like reinforcement learning and constitutional AI, there is potential for incorporating alignment signals into pretraining to improve robustness. He acknowledges the difficulty of defining whose values to encode and advocates for democratic control and models that can understand diverse perspectives. Evaluation remains a critical and challenging area, with loss serving as a surprisingly effective metric, though more nuanced, low-noise, and meaningful evaluations are needed for complex tasks.

Looking ahead, Nick anticipates continued paradigm shifts, such as increased use of reinforcement learning, but believes scale and careful engineering will remain key drivers. He sees opportunities for startups to innovate in areas that benefit from smarter models and to provide specialized services like hardware validation and debugging support. Finally, reflecting on career advice, Nick encourages aspiring AI practitioners to focus on engineering skills and to think deeply about the societal impacts of AGI, emphasizing that practical problem-solving and ethical considerations will be crucial in shaping the future of AI.