πŸ—οΈ 🚒 πŸš€ RAG: The 2025 Best-Practice Infra Stack


​

Hey, AIM community!

​

Tomorrow, we'll cover Enterprise Agents with OpenAI! What does the agents SDK look like from OpenAI? How does it build on previous work they've done? Are they officially in the end-to-end platform game competing with orchestration frameworks like LangChain, LlamaIndex, CrewAI, and others? Join us live to find out!


Last week, we discussed RAG: The 2025 Best-Practice Stack This is the year of Practical RAG, and we kicked it off by unpacking the Minimum Viable Production-Ready RAG Stack. We walked through the essential toolsβ€”LangChain’s LangGraph, QDrant, Cohere’s Rerank, RAGAS, and moreβ€”and how they fit together from retrieval to evaluation. Attendees saw firsthand how to build, baseline, and deploy a high-quality RAG app, with a ready-to-use template to jumpstart their journey. We also teed ourselves for Part II, where we'll dig even further into scaling and deployment options from cloud service providers.

🧰 Resources


πŸ”­ Coming Up!

​

Everyone is obsessed with Model Context Protocol, or MCP. Everyone is sharing the awesome-mcp-servers repo. OpenAI just adopted it. What do we really need to know about this emerging standard?

Does it really change anything for us as AI Engineers and builders? If so, does it just make everything easier?

​Join us to learn about what you need to know about MCP, and how you should think about leveraging it (or not) in production LLM applications you build in 2025 and beyond!


🌐 Around the Community!

πŸ’‘ Transformation Spotlight: Andrew White, a technology professional with 20-plus years of AI experience. Learn how he helps companies navigate the complexities of an ever-changing AI landscape. Read more!

video preview​

​

πŸ€“ See what the community is πŸ“Ή building, shipping, and sharing this week. Join us in the Lounge every Monday at 9 AM PT for some Build, Ship, Share accountability, and now every Friday at 11 AM PT for Job Search accountability!

​

Want to join the AIM community? Hop into Discord and share your intro!


​

πŸ–ΌοΈ Meme of the Week


Today, we launch Cohort 6 of The AI Engineering Bootcamp!

It's not too late to enroll, as long as you finish The AI Engineering Bootcamp Challenge first! Not the right time? Cohort 7 will kick off this summer on June 24.

​

Keep building πŸ—οΈ shipping 🚒 and sharing πŸš€,

​

​Dr. Greg, The Wiz, Seraacha, and Lusk​
​AI Makerspace​

​
​Unsubscribe Β· Preferences​

The LLM Edge

Read more from The LLM Edge
DeepSeek Week

Hey, AIM community! On Wednesday, we'll cover the infra stack that we recommend for RAG in 2025. Then, we'll build, ship, and share a best-practice RAG app. We'll also discuss important production tradeoffs and implications that you should consider before and after deployment when going from zero to production RAG! Last week, we discussed the latest open-source repo drops from DeepSeek Week, and we covered how they're being used as a new best-practice way to do inference on MoE models via...

Optimization of LLMs

Hey, AIM community! Next Wednesday, we begin a new series on Optimization of LLMs! We'll tackle an important topic from first principles: building and optimizing LLMs before they make it to production. What are the essential concepts and code that underlie the technology, from loss functions and gradient descent to LSTMs, RLHF, and GRPO? Join us to kick off our new series - which we will continue monthly - about Optimization of LLMs. Last week, we put PydanticAI to the test! πŸš€ The team behind...

Cursor: An AI Engineer’s Guide to Vibe Coding and Beyond

Hey, AIM community! Next Wednesday, we cover a new agent orchestration framework: PydanticAI. The team "built PydanticAI to bring that FastAPI feeling to Gen AI app development" because everything else out there wasn't good enough. Join Dr. Greg and The Wiz to help us assess whether or not they're accomplishing their mission as we learn to build, ship, and share a multi-agent application. Last week, we explored Cursor An AI Engineer’s Guide to Vibe Coding and Beyond. In 2025, top engineers in...