Pay as you go
DefaultPer-token billing with no commit. Top up credits or run invoiced billing once you hit consistent volume.
Pricing
No subscriptions, no minimums, no seat fees. Pay only for the tokens you use. Rates scale with model tier, and volume discounts kick in automatically.
| Model | Input / 1M | Output / 1M | Context |
|---|---|---|---|
| wlfv-v1-flash | $0.12$0.12 | $0.35$0.35 | 128K |
| wlfv-v1-code | $0.18$0.18 | $0.45$0.45 | 200K |
| wlfv-v1-pro | $0.40$0.40 | $0.90$0.90 | 262K |
All prices in USD. No minimum spend. Volume rates (20% off) apply automatically above 50M tokens/month across your whole account.
Rate cards
Start pay-as-you-go and graduate to volume or batch rates as you scale. The same rate card underpins every option.
Per-token billing with no commit. Top up credits or run invoiced billing once you hit consistent volume.
Cross 50M tokens in a billing month and every rate drops 20% for the rest of the period. No contract needed.
Submit large, non-interactive jobs with up to 24h turnaround. Same models and quality at half the rate.
Cost examples
Token counts vary by workload, but the math is simple: input tokens × input rate + output tokens × output rate.
1,500 input tokens (prompt + history) and 300 output tokens for a typical assistant reply.
4,000 input tokens (file context) and 1,200 output tokens for a refactored function with comments.
20,000 input tokens (long document) and 1,500 output tokens for a structured summary and answers.
FAQ
The short version: you pay for tokens, you can switch models anytime, and there are no hidden fees.
Per token. Input and output are billed separately at each model's rate, metered to the exact count — not rounded estimates. Short requests cost cents, not dollars.
Yes. Change the model field in your request and billing follows automatically. No SDK change, no plan migration, no downtime.
Aggregated across every model and project on your account within a calendar month. Hit 50M tokens and the 20% discount kicks in for the rest of that month.
Waitlist members get a launch-period credit to try every model. After that, pay-as-you-go starts at $0 with no minimum spend and no seat fees.
Set per-project budgets and hard limits. We stop accepting requests when you hit your cap — no overruns, ever. Invoices break down usage by model and project.
For dedicated capacity, custom SLAs, or committed-use discounts, reach out after joining the waitlist. Same rate card, tailored terms.
These rates take effect the moment WLFV AI opens. Join the waitlist to lock in early access and a launch-period credit to test every model tier.