๐Ÿ—๐Ÿšข๐Ÿš€ Virtual LLM


โ€‹

๐Ÿ‘‹ Hey, AIM community!

โ€‹

Dr. Greg and the Wiz will go on-prem with LangGraph next week! Join us for our last YouTube Live event before the New Year ๐ŸŽ†!

โ€‹


Last Wednesday, Dr. Greg and The Wiz guest spoke with Malikeh from Arcee on the SLM Show about the year in summary at the LLM Edge, and what to expect in 2025!

We also explored vLLM! We learned that Virtual LLM helps us relieve memory bottlenecks when serving LLMs through PagedAttention, just like Virtual Memory relieves memory bottlenecks in operating systems through paging. We also discussed alternatives and saw how vLLM works with best-practice tools like FlashAttention2 and Activation-aware Weight Quantization.

โ€‹

๐Ÿงฐ Resources


๐Ÿ”ญ Coming Up!

On-Prem Agents with LangGraph

Join us to build, ship, and share our last YouTube Live event of the year next Wednesday, December 18! Get ready for On-Prem Agents with LangGraph and LangServe, and get your questions answered live by Dr. Greg and The Wiz!

๐ŸŒ Around the Community!

๐Ÿ’ก Transformation Spotlight: David Stampfli, a tech veteran with over 30 years of experience, is now an AI Engineer. Listen as David shares his thoughts on how to stay relevant in 2025 and beyond. Read more here!โ€‹

video previewโ€‹

๐Ÿค“ See what the community is building, shipping, and sharing this week. Join us in the Lounge every Monday at 9 AM PT for some accountability!

โ€‹

Want to join the AIM community? Hop into Discord and share your intro!



๐Ÿ–ผ๏ธ Meme of the Week


Keep building ๐Ÿ—๏ธ shipping ๐Ÿšข and sharing ๐Ÿš€,

โ€‹

โ€‹Dr. Greg, The Wiz, Seraacha, and Luskโ€‹
โ€‹AI Makerspaceโ€‹

โ€‹
โ€‹Unsubscribe ยท Preferencesโ€‹

The LLM Edge

Read more from The LLM Edge

Hey, AIM community! Join Dr. Greg and The Wiz as they cover Large Reasoning Models next Wednesday, Jan 18. With the release of o1, o3, and the new Gemini, everyone is talking about Chain-of-Thought Reasoning and Test-Time Compute. What are these things, anyway? And what are the implications for building production LLM applications with models like this in 2025 and beyond? Join the discussion live on Wed. at 10 AM PT! Last week, we dove into the latest RAG Evaluation metrics and RAGAS...

๐Ÿ‘‹ Hey, AIM community! As we near the end of 2024, our team is looking back at all we've accomplished as a community this year. Thanks to all of you for learning ๐Ÿ“š, building ๐Ÿ—, shipping ๐Ÿšข, and sharing ๐Ÿš€ with us at the open-source LLM Edge! We'll be rooting for you to take your AI career to the next level in 2025, and when you do, we hope you'll lean on us to amplify your story and showcase your best work. In this way, you'll help the AI Makerspace community achieve its mission of becoming the...

๐Ÿ‘‹ Hey, AIM community! Dr. Greg and the Wiz will unlock vLLM for you next week with a full breakdown of "Easy, fast, and cheap LLM serving for everyone." Last Wednesday, we explored AG2: AutoGen, Evolved with co-creator Qingyun Wu. The origin story was fascinating - from MathChat to going viral! AutoGen is all about conversations - which effectively constitute reasoning - by going full send on messages. ๐Ÿงฐ Resources ๐Ÿง‘๐Ÿซ Concepts: Slides ๐Ÿง‘๐Ÿ’ป Code: CaptainAgent Notebook ๐Ÿ“œ Paper: AutoGen The AutoGen...