🏗🚢🚀 Virtual LLM

👋 Hey, AIM community!

Dr. Greg and the Wiz will go on-prem with LangGraph next week! Join us for our last YouTube Live event before the New Year 🎆!

Last Wednesday, Dr. Greg and The Wiz guest spoke with Malikeh from Arcee on the SLM Show about the year in summary at the LLM Edge, and what to expect in 2025!

We also explored vLLM! We learned that Virtual LLM helps us relieve memory bottlenecks when serving LLMs through PagedAttention, just like Virtual Memory relieves memory bottlenecks in operating systems through paging. We also discussed alternatives and saw how vLLM works with best-practice tools like FlashAttention2 and Activation-aware Weight Quantization.

🧰 Resources

🧑‍🏫 Concepts: Slides
🧑‍💻 Code: vLLM-Event-AIM
📜 Blog: vLLM Blog

🔭 Coming Up!

On-Prem Agents with LangGraph

Join us to build, ship, and share our last YouTube Live event of the year next Wednesday, December 18! Get ready for On-Prem Agents with LangGraph and LangServe, and get your questions answered live by Dr. Greg and The Wiz!

RSVP

🌐 Around the Community!

💡 Transformation Spotlight: David Stampfli, a tech veteran with over 30 years of experience, is now an AI Engineer. Listen as David shares his thoughts on how to stay relevant in 2025 and beyond. Read more here!