Cumulus
Homepage
Docs
←Back to all articles
Tag

#reinforcement-learning

1 article tagged with "reinforcement-learning"

#inference4#model-hosting4#serverless-gpu3#cuda2#grace-hopper2#gpu2#pricing2#gpu-cloud2#reinforcement-learning1#fine-tuning1#visual-generation1#pipeline1#mamba1#qwen3.51#linear-attention1
March 11, 20267 min read

SFT and Online RL for Visual Generation: How We Built CoSprite's Training Pipeline

How Cumulus built a production pipeline for consistent AI-generated game previews using best-of-N sampling, deterministic rendering, pairwise judging, supervised fine-tuning, and online reinforcement learning with GRPO.

inferencereinforcement-learningfine-tuning+3
Read article

Cumulus Labs

© 2026 Cumulus Compute Labs Corporation. All rights reserved.