Optimizing RLHF and llamafile 0.9.2
Mar 3, 2026 · 12:58 AM · 1 min read
🔥 What's hot right now
RLHFless is optimizing RLHF training with serverless computing, achieving up to 1.35x speedup and 44.8% cost reduction. By pre-computing shared prefixes, it tackles resource variability and idle time effectively.
🚀 Just shipped
llamafile v0.9.2 is a major update adding support for DeepSeek Distil R1, Gemma 3, and IBM Granite. It also introduces LocalScore, a benchmarking utility to evaluate LLM performance on your specific hardware.
🛠 Useful for the array
AMA-Bench is a new evaluation framework for long-horizon memory in LLM agents, focusing on continuous interaction. It proposes AMA-Agent using causality graphs to identify gaps in current memory systems.