๐Ÿ—๐Ÿšข๐Ÿš€ FA2: Flash Attention


โ€‹

โ€‹

๐Ÿ‘‹ Hey, AIM community!

โ€‹

Next Wednesday, Dr. Greg & The Wiz ๐Ÿช„ will explore the concepts and code behind On-Prem Agentic RAG!


Last Wednesday, they explored FA2: Next-Level Attention. They dug all the way down into the "shadow of the warp groups" on GPU hardware. It was epic. S/o to @Allan Tan with the awesome community recap.

โ€‹

๐Ÿงฐ Resources

โ€‹


๐Ÿ”ญ Coming Up!

AG2: AutoGen, Evolved

December 4, 2024

The co-creators of AutoGen have officially launched AG2. New features just dropped this week that we'll check out, including SwarmAgent and CaptainAgent! We'll explore both live!

vLLM: Virtual LLM

December 11, 2024

vLLM is for efficient inference AND serving. A great way to think about it is that while vLLM is building the racecar, FlashAttention enhances the engine and Quantization provides the light-weight, high-performance tires.

๐ŸŒ Around the Community!

๐Ÿ’ก Transformation Spotlight: Hear about how Tshwanelo took the opportunity to serve as a thought leader and "grab it with both hands" for a corporate investment bank after learning AI Engineering.

video previewโ€‹

โ€‹

๐ŸŒ Check out what the AIM community is building, shipping, and sharing!

๐Ÿค“ See what the community is building, shipping, and sharing this week. Join us in the Lounge every Monday at 9 AM PT for some accountability!

โ€‹

Want to join the AIM community? Hop into Discord and share your intro now!



๐Ÿ–ผ๏ธ Meme of the Week


Keep building ๐Ÿ—๏ธ shipping ๐Ÿšข and sharing ๐Ÿš€,

โ€‹

โ€‹Dr. Greg, The Wiz, Seraacha, and Luskโ€‹
โ€‹AI Makerspaceโ€‹

โ€‹
โ€‹Unsubscribe ยท Preferencesโ€‹

The LLM Edge

Read more from The LLM Edge

Hey, AIM community! Join Dr. Greg and The Wiz as they cover Large Reasoning Models next Wednesday, Jan 18. With the release of o1, o3, and the new Gemini, everyone is talking about Chain-of-Thought Reasoning and Test-Time Compute. What are these things, anyway? And what are the implications for building production LLM applications with models like this in 2025 and beyond? Join the discussion live on Wed. at 10 AM PT! Last week, we dove into the latest RAG Evaluation metrics and RAGAS...

๐Ÿ‘‹ Hey, AIM community! As we near the end of 2024, our team is looking back at all we've accomplished as a community this year. Thanks to all of you for learning ๐Ÿ“š, building ๐Ÿ—, shipping ๐Ÿšข, and sharing ๐Ÿš€ with us at the open-source LLM Edge! We'll be rooting for you to take your AI career to the next level in 2025, and when you do, we hope you'll lean on us to amplify your story and showcase your best work. In this way, you'll help the AI Makerspace community achieve its mission of becoming the...

๐Ÿ‘‹ Hey, AIM community! Dr. Greg and the Wiz will go on-prem with LangGraph next week! Join us for our last YouTube Live event before the New Year ๐ŸŽ†! Last Wednesday, Dr. Greg and The Wiz guest spoke with Malikeh from Arcee on the SLM Show about the year in summary at the LLM Edge, and what to expect in 2025! We also explored vLLM! We learned that Virtual LLM helps us relieve memory bottlenecks when serving LLMs through PagedAttention, just like Virtual Memory relieves memory bottlenecks in...