Drama at OpenAI continues … just kidding, here are some actually useful LLM/AI resources!

Nov 22, 2023

📰 News

Claude 2.1 by Anthropic
200K context window, reduced hallucination (actually learned to say "I don't know based on the provided information), tool use (OpenAI Functions equivalent), and a new playground in the console.
Stability releases Stable Video Diffusion Open Source model
Beats Pika Labs in some benchmarks, on par with RunwayML

📦 Repos

📄 Papers

Video-LLaVA: Learning United Visual Representation by Alignment Before Projection
You can now ask questions about videos through open source models. There's also a paper https://arxiv.org/abs/2311.10122
Stanford paper exploring DPO fine-tuning for improving SLM factuality
SLM = small language model. Reduction between 40% to 58% in factual errors.
LLMs cannot find reasoning errors, but can correct them
Useful paper if you're working with self-correction prompting techniques.
Proving Test Set Contamination in Black Box Language Models
Very important result: verify whether a test set is in the pre-training data without needing access to model weights or pre-training data. This can be used to validate the accuracy of many benchmarks.
Exponentially Faster Language Modelling
Very interesting pruned activation approach opening up the potential for incredible performance speedups. By ETH Zurich.

🛠️ Products

📚 Resources

Want more? Follow me on Twitter! @ricklamers

Coding with Intelligence