Simulating entire worlds using Diffusion Models: GameNGen

Aug 29, 2024

📰 News

📦 Repos

Liger-Kernel Medusa head training
Created by the LinkedIn engineering team.
Salesforces releases more "Large Action Models" on HF
MoE, 32k context, why these are not called function calling models beats me. Still doesn't allow commercial use.

📄 Papers

📚 Resources

Ape - your first prompt engineer
Anthropic is leaning into prompt generation support in their console, this might be the open source version of that.
Berkeley Function Calling Leaderboard V2
Looks promising! I’m taking a closer look next week, more to follow…
Llama 3.1 8B (quantized) with 128k context in 18.9GB VRAM
[Video] MLSys conference talk about AI systems by Jeff Dean
Cross-Architecture Distillation Part I - The MOHAWK Framework (Transformer->SSM)
[Video] Interview with Jürgen Schmidhuber – the father of generative AI
Evaluating the Effectiveness of LLM-Evaluators (aka LLM-as-Judge)
Hotshot: 4 person team creates Sora/Luma alternative
Anthropic adds system prompts to docs
Love how they’re like “you can prompt jailbreak extract them anyway, let’s own that they are public” instead of pretending they’re invisible to users.
GPUd by Lepton AI
OpenDevin rebrands to OpenHands
The Devin competitor (the OSS team is now a company too).
Llama 3.1 405b bf16 base model
Some AI sommeliers apparently really dig what you can get a bf16 precision Llama 3.1 405b base model to do. If you find out cool examples please share (as GitHub gist) and I'll promote! Hosting curtesy of Hyperbolic.
Local Speech-to-Speech models
By the excellent Trelis Research

Want more? Follow me on X! @ricklamers

Coding with Intelligence