An open-source library that makes LLM fine-tuning 2× faster and cuts VRAM use by 70%.
Demand for local fine-tuning is surging; several open models shipped same-day support, and weekly new contributors hit a record.
Sits on the most cost-sensitive curve — GPU spend. If a cloud vendor integrates it, distribution scales exponentially.