Question 1

What is the Sonic Inference Engine®?

Accepted Answer

A fully custom hardware and software stack, built specifically for AI inference. We tune it from BIOS and kernel up, run models on hardware we own, and keep them preloaded across regions. Because we operate the engine end to end, you get higher throughput and lower latency than generic cloud GPUs, at lower cost.

Question 2

What does “one API for all AI” mean in practice?

Accepted Answer

One auth, one endpoint, one bill. Switching model is a string change. Image, video, audio, 3D, vision and LLM chat all hit the same POST /v1. Connect over REST, or use WebSockets for persistent connections. If you have your own weights, you can upload LoRAs, checkpoints, safetensors and LyCORIS through Model Upload.

Question 3

How does Runware's pricing work?

Accepted Answer

Pay per request. No subscriptions, no commitments. Open-source models are billed on optimised compute time, so faster generations automatically cost less. Closed-source and partner models are fixed per-request, at rates we negotiate down with the provider. In practice, open-source typically runs up to 10× cheaper and 40% faster than other providers; closed-source lands 10–40% below the published rates thanks to bulk-execution. The slider above is live against competitor pricing.

Question 4

What models does Runware support?

Accepted Answer

400K+ open and proprietary models, including Flux 2, Veo 3.1, Kling, Seedance, ElevenLabs, Claude Opus 4.7, GPT-5.4, Llama, Qwen and DeepSeek, plus a wide range of vision models. You can also bring your own LoRAs, checkpoints, safetensors and LyCORIS through Model Upload. Nothing is artificially restricted for speed, and there's no hidden caching that alters outputs.

Question 5

Can I use Runware for commercial projects?

Accepted Answer

Official models on Runware include commercial usage rights under our partner agreements, so you can use leading models in production without worrying about separate licence fees. For community models, commercial use is governed by the licence published by the model creator. We always link to the source so the terms are easy to review.

Question 6

Is my data private and secure?

Accepted Answer

Inputs and outputs are never used for training. Encrypted in transit. SOC 2 and ISO 27001 certified, GDPR aligned. Generated content is automatically purged from our servers unless you explicitly opt in to storage. Your data always belongs to you and is never reused, resold, or used for any other purpose.

Question 7

What SLA and support do you offer?

Accepted Answer

Enterprise plans come with managed infrastructure, dedicated capacity, custom SLAs, priority routing, volume-based pricing, 24/7 on-call engineering, and a shared support channel. Contact sales to scope it.

Question 8

Can I test it before committing?

Accepted Answer

Yes. $2 in credits when you sign up with a business email, no card required. That's hundreds to thousands of generations at our per-request pricing. Need more to evaluate? Get in touch.

One API for all AI.
We run infra
while you ship.

All AI models

Lowest cost

Instant scale

Any use case. Any task.

Image generation & editing

How much would you save?

Model collections.
Pick your task.

SOTA Models

Best Image Models

Best Video Models

Best Audio Models

Best 3D Models

Best LLM Models

Request → Route → Optimize → Execute.

Requests

Runware API

Orchestration

Inference Pods

The stack you already ship on.

Ship to millions of users in days, not months.

FAQ

One API for all AI.We run infrawhile you ship.

The AI inference platform, by the numbers