πŸ—πŸš’πŸš€ RAG Evaluation


​

Hey, AIM community!

​

Join Dr. Greg and The Wiz as they cover Large Reasoning Models next Wednesday, Jan 18. With the release of o1, o3, and the new Gemini, everyone is talking about Chain-of-Thought Reasoning and Test-Time Compute. What are these things, anyway?

And what are the implications for building production LLM applications with models like this in 2025 and beyond? Join the discussion live on Wed. at 10 AM PT!

​


Last week, we dove into the latest RAG Evaluation metrics and RAGAS framework updates for building, shipping, and sharing production RAG applications in 2025!

🧰 Resources


πŸ”­ Coming Up!

Agent Evaluation with RAGAS

From RAG to Agents with RAGAS - it is 2025 after all! There are some interesting new metrics for assessing how agents use tools to solve problems. How can you ensure that your apps stay focused on the topic at hand while calling the right tools to achieve the goal? Find out live with us!

Multimodality with Llama 3.2

Llama 3.2, Meta's first multimodal model, combines vision and text to tackle tasks like image captioning and document understanding. See how these models differ from text-only LLMs, from training to application. Learn how to harness Llama 3.2 for your workflows and projects.


🌐 Around the Community!

πŸ’‘ Transformation Spotlight: Vincent Kienzler, entrepreneur, CTO, and venture builder, has significant experience building companies, technical teams, and developing gen AI applications. Learn more about his journey!​

video preview​

πŸ€“ See what the community is building, shipping, and sharing this week. Join us in the Lounge every Monday at 9 AM PT for some accountability!

​

Want to join the AIM community? Hop into Discord and share your intro!


​

AIE Cohort 5 Starts Jan. 14!

A few days remain to apply, complete the challenge, and grab a spot! Check out the schedule to learn more!


​

πŸ–ΌοΈ Meme of the Week


🌟 Want to start building, shipping, and sharing but unsure how to begin? Check out our LLM Foundations - a 5-day email-based course to help you understand the inside of LLMs.

​

Keep building πŸ—οΈ shipping 🚒 and sharing πŸš€,

​

​Dr. Greg, The Wiz, Seraacha, and Lusk​
​AI Makerspace​

​
​Unsubscribe Β· Preferences​

The LLM Edge

Read more from The LLM Edge
CODEX

Hey, AI Makerspace community! Tomorrow, we're hosting a special event entitled "What is an Agent?" We've thought deeply about this problem for years, and we were inspired by the recent Matter of OpenAI vs. LangGraph. We will break down all of the definitions that different industry players are using (including our own!), and have a rich discussion about what categories of agents matter most to think about today, and how we expect this to evolve in a world of agentic fabrics, MCP, and...

The Llama 4 Herd

Hey, AI Makerspace community! This Wednesday, we'll cover the new release from OpenAI: Codex CLI, a lightweight coding agent that runs in your terminal. Carrying the same name as the original Codex models released in 2021, this new open-source tooling is a CLI-based coding agent that bears little resemblance to the model that originally powered GitHub Copilot. Join us to understand coding agents and best practices for their use in 2025 πŸ” Coming Up: What is an Agent? OpenAI just released their...

RAG: The 2025 Best Practice Stack

Hey, AIM community! Tomorrow, we'll cover Enterprise Agents with OpenAI! What does the agents SDK look like from OpenAI? How does it build on previous work they've done? Are they officially in the end-to-end platform game competing with orchestration frameworks like LangChain, LlamaIndex, CrewAI, and others? Join us live to find out! Last week, we discussed RAG: The 2025 Best-Practice Stack This is the year of Practical RAG, and we kicked it off by unpacking the Minimum Viable...