SaladCloud Blog

INSIDE SALAD

Flux.1-Dev benchmark: 992 images per dollar on SaladCloud

Maksim Gorkii

Flux.1-Dev: An Introduction

Flux.1 is a new series of text-to-image models from Black Forest Labs that has set the new standard in quality and prompt adherence, and it can even render legible text. The Flux.1-Dev version of the model – a 12 billion parameter rectified flow transformer – generates high quality images in about 20 steps, and is released under a non-commercial license. In this benchmark, we measure the speed and cost performance of this new model on SaladCloud.

* An earlier benchmark of Flux.1-Schnell delivered 5243 images per dollar on Saladcloud.

Benchmark Design

The benchmark was conducted using k6, a modern load testing tool from Grafana Labs, to simulate a gradually increasing load from 7 to 12 virtual users over approximately 1.9 hours. See the exact configuration in GitHub. The test environment consisted of a container group on SaladCloud with 8-10 replicas (most commonly running 9 replicas).

An example of an image generated in the benchmark

Each virtual user submitted continuous consecutive image generation requests to the container group, and response time and failures were measured. Image generation requests consisted of 20 steps at a resolution of 1024×1024.

Each node was configured with

  • NVIDIA RTX 4090 (24GB)
  • 4 vCPUs
  • 30GB Ram
  • 1GB ephemeral storage
Flux.1-dev benchmark design on SaladCloud

Deploying Flux.1-Dev on SaladCloud

To reproduce this benchmark, deploy the “Flux.1-Dev (FP8) – ComfyUI API” recipe from the “Create container group” page in the SaladCloud portal. Set the priority to “Batch” to optimize for cost-effectiveness.

Benchmark results

Conclusion

Deploying Flux.1-Dev on RTX 4090 (24 GB) GPUs on SaladCloud (batch priority) delivers 992 images per dollar. As with other text-to-image models on SaladCloud, deploying Flux.1-Dev in production results in more inferences per dollar and significant cost savings. Cost per image is just $0.00101.

The Flux1-Dev model demonstrates impressive stability and efficiency in this benchmark. With a 99.78% success rate and consistent response times averaging under 18 seconds, the model can readily be used for production deployments. The system showed moderate scalability, maintaining performance as virtual users increased from 7 to 12, with peak throughput achieved at 10 VUs. Under sudden spikes of traffic, an increased timeout and error rate should be expected.

Interested in free credits to try SaladCloud for Image Generation? Contact our support team today.

Have questions about enterprise pricing for SaladCloud?

Book a 15 min call with our team.

Related Blog Posts

Salad will become a Render Subnet, Salad and Render Partnership

RNP-023 Approved: Salad Is Joining the Render Network

It's official. RNP-023 has passed the community vote, and Salad will now become an exclusive subnet on the Render Network. A few weeks ago we shared our proposal to fully...
Read More

Use Cline with SaladCloud: Building Real Apps for Under $0.01

At SaladCloud, we've been working on easy-to-deploy recipes designed to cover most agentic use cases out of the box. When you run LLMs on Salad, you're not worried about token...
Read More

Salad Proposes Integration with the Render Network

I’m excited to share that Salad has submitted a formal proposal alongside the Render Network Foundation to become a subnet on the Render Network. This would involve fully transitioning our...
Read More

Don’t miss anything!

Subscribe To SaladCloud Newsletter & Stay Updated.