Our AI Engineer Curriculum Leaked

The video outlines a free AI Engineer Curriculum focused on transitioning from full stack to AI engineering by leveraging existing large language models, Retrieval Augmented Generation techniques, and vector stores for building practical AI applications. It emphasizes the importance of hands-on projects, proper data handling, model configuration, observability, and evaluation, while encouraging learners to actively build real-world AI solutions to enhance their careers.

The video provides a comprehensive breakdown of the essential skills, concepts, and projects needed to transition from full stack engineering to AI engineering. The speaker emphasizes that while the curriculum is shared for free, the real value of paid programs lies in guidance, troubleshooting, and hands-on support. The speaker’s own journey involved trial and error, building AI agents used in production, and leveraging these skills to boost career opportunities. Understanding large language models (LLMs) at a high level is crucial, as AI engineers typically build applications on top of existing models like OpenAI, Anthropic, or Grock rather than creating models from scratch.

A key concept introduced is Retrieval Augmented Generation (RAG), which involves collecting relevant data and feeding it into vector stores—specialized databases designed for semantic search. The speaker provides a practical example project using their own LinkedIn posts and articles to create an AI that mimics their writing style. This approach can be adapted to any data source, such as PDFs, company documents, or Jira tickets, making it a versatile playground for learning. The importance of chunking data properly before embedding it into vector stores like Pinecone, Weaviate, or Quadrant is stressed to maintain context and improve search relevance.

Vector stores work by converting text into numerical vectors using embedding models, enabling semantic similarity searches rather than exact keyword matches. This allows AI systems to find contextually relevant information even if the exact words differ. The speaker advises against using certain vector databases like Chroma or Google’s Vertex AI due to usability issues, recommending instead the more established options. Once data is embedded and stored, queries are embedded similarly and matched against stored vectors to retrieve the most relevant pieces of information, which are then passed to a large language model to generate context-aware responses.

The video also covers practical implementation details, such as using SDKs like Vercel’s AI SDK for TypeScript or LangChain for Python to interact with LLMs programmatically. The speaker highlights the importance of structured output using schema validation tools like Zod to ensure reliable and predictable AI responses. They also discuss configuring model parameters like temperature and token limits to control response creativity and length. Budgeting for API usage is recommended, with $5 to $20 being sufficient for most projects, emphasizing that building functional AI applications is accessible and affordable.

Finally, the speaker underscores the necessity of observability and evaluation in AI projects. Tools like Langsmith provide monitoring of model calls, user interactions, token usage, and latency, which are critical for maintaining quality. Evaluation methods such as “LLM as judge” involve scoring AI outputs against a golden set of answers to detect model drift or degradation early. The speaker encourages building real projects like chatbots despite skepticism, as these skills are increasingly in demand and will likely become foundational in software engineering. They conclude by inviting viewers to join their live sessions and reach out for further guidance, stressing that now is the ideal time to learn AI engineering for career growth.