PerFit Scaling DemoCloudflare
401 BlockEnd user sending a try-on requestUSRMobile app or web browser clientMOB/WEBAPI key validation — rejects invalid keys with 401 before reaching originKEYGUARD×API Gateway — 1 server routing requests to backend servicesAPIGW×Try-On API — 1 server running the virtual fitting logic in parallelTRY-ON×Image generation — 3 servers needed. Each server takes 1s per image. 2.8 images/sec ÷ (1 server / 1s) = 3 servers.IMGGEN×3 servers2.8/s × 1s each×Video generation — 216 servers needed. Each server takes 3.0 min per clip. 1.2 clips/sec arriving means each server is busy for 3.0 min — so you need 1.2 × 180 = 216 servers running at all times.VIDGEN×216 servers1.2/s × 3min each×Cloudflare R2 object store — caches generated images and videos by jobIdR2CACHE×Events bus — aggregates structured events from all services (Kafka / Kinesis / Pub/Sub)EVTBUSAnalytics store — time-series metrics, distributed traces, structured logsANALY-TICSClick to view mock dashboard metricsDASH-BOARDmedium · 10 req/s · 1 region
Cost breakdown / hr
Edge / CDN
$0.00070%
API compute
$0.04770%
Queue
$0.00580%
DB
$0.02130%
Replication
$0.00000%
Image servers
$302.4022%
Video servers
$1.1k78%
TOTAL INFRA
$1.4k100%
Unit economics
Infra cost / request$0.0960
Revenue / request$0.0880
Gross margin -9.1%$-0.008005

At 10 req/s, revenue is $3.2k/hr against $1.4k/hr infra cost.

Cost vs load — all providers