POST https://api.wiro.ai/v1/Run/openai/sora-2
const request = {
"prompt": "A woman drives along a quiet countryside road at sunset, golden light reflecting on her face through the car window. The engine hums softly, trees blur past. She sighs and says quietly: 'I wonder if he's watching this same sunset.' Acoustic guitar plays gently in the background, natural camera handheld movement.",
}



Google's Gemini 3 Pro Image Preview, also known as Nano Banana, model for text-to-image and image-to-image generation.

Qwen Image Edit Plus powered by Pruna AI

wan-image-small is a highly optimized, resource-efficient AI model that rapidly generates high-quality, cinematic images from text using the Pruna AI framework.

Pruna AI P-Image-Edit Model

Pruna AI P-Image Model

Combine text prompts with reference images to create new variations. Blend styles, concepts, and visual elements.

Modify existing images using text instructions. Upload an image and describe the changes you want to make.

Combine text prompts with reference images to create new variations. Blend styles, concepts, and visual elements.

Modify existing images using text instructions. Upload an image and describe the changes you want to make.

Generate images from text descriptions. Perfect for creating original artwork, illustrations, and visual content from your imagination.

wiro/camera-angle-editor is an advanced AI tool that instantly changes the camera perspective and angle of any existing image. Leveraging sophisticated spatial reconstruction, it eliminates the need for reshoots by synthesizing photorealistic new viewpoints, making it the fastest way for creators to maximize the versatility of their visual content.

Save time and production costs with AI Product Photoshoot. Generate polished product images featuring adaptive lighting, varied angles, and contextual scenes. Ideal for online stores, marketing teams, and agencies looking to accelerate content creation with consistent, high-quality visuals.

Integrate the Wiro Virtual Try-On API to deliver hyper-realistic apparel fitting directly in your web, mobile, or SaaS platform. Generate lifelike visuals of users wearing new garments with precise texture mapping, pose alignment, and fabric simulation — ideal for online retail and fashion tech solutions.
OpenAI's Sora 2 Pro model for text-to-video or image-to-video generation.
OpenAI's Sora 2 model for text-to-video or image-to-video generation.
Ovi is a veo-3 like, video+audio generation model that simultaneously generates both video and audio content.

Google's Gemini 2.5 Flash Text To Speech Preview model

Access the Seedream 4.0 API for fast high resolution image generation and editing. Simple integration, clear pricing, and support for text to image, multi image input, and advanced creative workflows.
Wan-S2V is a video model that generates high-quality videos from static images and audio, with realistic facial expressions, body movements, and professional camera work for film and television applications

Google's Gemini 2.5 Flash Image Preview, also known as Nano Banana, model for text-to-image and image-to-image generation.

HiDream AI I1 Text to Image Fast Version

Qwen Image Edit, the image editing version of Qwen-Image.

An image-to-image model for editing images using ByteDance's seededit v3 model
The fastest Wan 2.2 text-to-video model.
The fastest Wan 2.2 image-to-video model.
A very fast and cheap PrunaAI optimized version of Wan 2.2 A14B image-to-video
A very fast and cheap PrunaAI optimized version of Wan 2.2 A14B text-to-video model.
Wan2.2-TI2V-5B model supports both text-to-video and image-to-video generation at 720P resolution with 24fps. This is Wan2.2-TI2V-5B text-to-video tool.
Wan2.2-TI2V-5B model supports both text-to-video and image-to-video generation at 720P resolution with 24fps. This is Wan2.2-TI2V-5B image-to-video tool.
Wan2.2-I2V-A14B model, designed for image-to-video generation, supporting both 480P and 720P resolutions. Built with a Mixture-of-Experts (MoE) architecture, it achieves more stable video synthesis with reduced unrealistic camera movements and offers enhanced support for diverse stylized scenes.
Wan2.2-T2V-A14B model, which supports generating 5s videos at both 480P and 720P resolutions. Built with a Mixture-of-Experts (MoE) architecture, it delivers outstanding video generation quality.

Text to speech model from ElevenLabs

Whisper is a pre-trained model for automatic speech recognition (ASR) and speech translation. Trained on 680k hours of labelled data, Whisper models demonstrate a strong ability to generalise to many datasets and domains without the need for fine-tuning.

Wiro/AI-Resume-CV-Feedback-Generator enables you to generate detailed, personalized feedback on resumes by comparing them against specific job descriptions using AI-powered insights. Effortlessly analyze candidate CVs, identify strengths and improvement areas, and provide comprehensive feedback reports that help candidates enhance their applications. Supporting both Turkish and English languages, automate CV review processes, reduce manual feedback workload, and deliver actionable insights by matching candidate qualifications with job requirements to improve hiring quality and candidate experience efficiently.

A state-of-the-art text-based image editing model that delivers high-quality outputs with excellent prompt following and consistent results for transforming images through natural language

Industry leading face manipulation.
It generates a video from the given prompt.

NSFW video detection automatically analyzes video content to identify inappropriate or explicit material, ensuring compliance with content policies and a safe viewing environment.
LTX-Video is the first DiT-based video generation model capable of generating high-quality videos in real-time.
LTX-Video is the first DiT-based video generation model capable of generating high-quality videos in real-time.

AI-powered tool for recognizing Turkish license plates from images.

Wiro/AI-Job-Description-Generator helps you create tailored job descriptions based on detailed job specifications. Using AI-driven analysis, effortlessly transform job details into clear, professional, and well-structured job postings. Simplify your hiring process by automating job description creation, ensuring consistency, and aligning your descriptions with industry standards. Streamline recruitment efforts with accurate, data-driven job descriptions that attract the right talent.

Wiro/AI-Resume-CV-Evaluator-JobDesc helps you evaluate and score resumes (CVs) based on job descriptions using AI-driven analysis. Effortlessly parse and structure data from resumes, PDF, and Word files, ensuring precise candidate-job matching. Streamline your recruitment process by automating CV screening, reducing manual effort, and making data-driven hiring decisions with confidence.
MMAudio generates synchronized audio given video and/or text inputs.
UltraPixel is designed to create exceptionally high-quality, detail-rich images at various resolutions, pushing the boundaries of ultra-high-resolution image synthesis.

This face anonymization tool detects and obscures faces in images.
HunyuanVideo is an advanced video generation tool capable of producing high-quality, precise, and visually compelling videos.

BLIP is a model that is able to perform various multi-modal tasks including visual question answering and image captioning. This is the blip image captioning base model.

It is an image-to-text tool for running different vision language models.
Hotshot-XL is a tool that allows you to generate GIFs from given text. Additionally, it can be controlled by another GIF.
Convert the video data into textual descriptions.
Convert the video data into textual descriptions.
Interior design is a tool that fills an empty room with a provided prompt.

Text-Prompted Generative Audio Model
Larger model with higher video generation quality and better visual effects.
Entry-level model, balancing compatibility. Low cost for running and secondary development.

Change your hair color with this AI tool.
BirefNet is a GenAI tool that removes backgrounds with high precision.

This image-to-image model removes the background of your image.
Make your portrait alive! It is a tool that turns your portrait into a video using a provided reference video.
Hotshot-XL is a tool that allows you to generate GIFs from given text. Additionally, it can be controlled by another GIF.
It generates a video from the given prompt.

More Lighting! IC-Light is used to manipulate the illumination of images using a text-conditioned model.
Outpainting is the process to extend beyond the boundaries of an existing image canvas.

CodeFormer is a tool designed for image restoration and enhancement, particularly focusing on faces.
It turns the input image into an animated form by applying the prompt.
It creates a video by incorporating the movements and audio of a person from a given video into the input image.

A text-to-speech tool by Coqui that generates natural-sounding speech from text and performs voice cloning by taking a 6-second voice sample to produce new voice outputs.

Zest is a tool for material transfer to an object in the input image.

SDXL-Turbo is a fast generative text-to-image model that can synthesize photorealistic images from a text prompt in a single network evaluation.

Stable Video Diffusion (SVD) Image-to-Video is a diffusion model that takes in a still image as a conditioning frame, and generates a video from it.

Whisper is a pre-trained model for automatic speech recognition (ASR) and speech translation. Trained on 680k hours of labelled data, Whisper models demonstrate a strong ability to generalise to many datasets and domains without the need for fine-tuning.

Real-ESRGAN aims at developing Practical Algorithms for General Image Restoration.
curl -X POST "https://api.wiro.ai/v1/Run/bytedance/seedream-v4" \
-H "Content-Type: application/json" \
-H "x-api-key: ${YOUR_API_KEY}" \
-d '{
"prompt": "A serene desert landscape at dusk, a lone traveler riding a camel toward ancient ruins, golden sky with stars starting to appear.",
"size": "2048x2048",
"maxImages": "2"
}'
curl -X POST "https://api.wiro.ai/v1/Run/qwen/qwen-image-edit-fast" \
-H "Content-Type: application/json" \
-H "x-api-key: ${YOUR_API_KEY}" \
-d '{
"inputImage": "https://cdn.wiro.ai/uploads/sampleinputs/qwen-qwen-image-edit-fast_input_2.jpg",
"prompt": "Replace the cat in the image with a koala",
}'; curl -X POST "https://api.wiro.ai/v1/Task/Detail" \
-H "Content-Type: application/json" \
-H "x-api-key: ${YOUR_API_KEY}" \
-d '{
"taskid": "534574"
}';
Wiro's fast interface makes AI instantly accessible and delivers reliable results.
A single integration unlocks access to hundreds of AI models.




Engineered to process huge workloads and peak traffic effortlessly.
Clear docs, SDKs, and seamless onboarding built for developers.
Compare model costs and estimate how many runs you can generate based on your selected budget.















