Is AI Alliance the CNCF/k8s moment for GenAI?

Week 49 of Coding with Intelligence

Dec 07, 2023

📰 News

Launch of industry consortium: AI Alliance
Notable participants: Meta, IBM, AMD, ServiceNow, Hugging Face, Dell and Intel. The k8s moment for AI?
Alibaba releases Qwen-72B and Qwen-1.8B LLM
Beijing Academy of Artificial Intelligence releases Aquila2-70B
They also released the highest scoring embedding model `bge`
Open Source 13B Function Calling model NexusRaven-V2

📦 Repos

80% faster QLoRA LLM fine-tuning using custom Triton kernels
Created by ex-NVIDIA intern. How many VCs have emailed Daniel is left as an exercise for the reader.
Apple launches Apple Silicon specific PyTorch/JAX alternative
If you're building for the Apple platform specifically there are probably gains to be had from using their framework.

📄 Papers

Mamba: Linear-Time Sequence Modeling with Selective State Spaces
Structured state space models (SSMs) approach that outperforms Transformer in various downstream tasks (language tasks like HellaSwag, DNA sequences, audio generation). Co-author invented Flash-Attention. So importantly, this model has linear scaling in sequence length.

📱 Demos

3D LLM visualization in your browser
This is really cool! And educational if you’re not too familiar with the Transformer architecture.

📚 Resources

Thought piece about Agents using tools through a marketplace of tools
Reminds me of https://agentprotocol.ai/
Claude 2.1 long context prompt guidance
Extensive Vector DB Feature Matrix in a Google Sheet
From this LinkedIn thread
Overview of recent progress in Instruction Tuning
From a PhD student at Princeton. Expect a bunch of pointers to dive deeper and a general survey of what's considered SotA
Making Llama inference fast with PyTorch: 25 tok/s to 107 tok/s
If you allow for some quality degradation (int4 weights) you can even get to 244 tok/s. Cool post about using `torch.compile` to maximum advantage. Blog post

Want more? Follow me on Twitter! @ricklamers

Discussion about this post

No posts

#nojs-banner { position: fixed; bottom: 0; left: 0; padding: 16px 16px 16px 32px; width: 100%; box-sizing: border-box; background: red; color: white; font-family: -apple-system, "Segoe UI", Roboto, Helvetica, Arial, sans-serif, "Apple Color Emoji", "Segoe UI Emoji", "Segoe UI Symbol"; font-size: 13px; line-height: 13px; } #nojs-banner a { color: inherit; text-decoration: underline; } This site requires JavaScript to run correctly. Please turn on JavaScript or unblock scripts