Blog
Generative AI Blog by Wiro
Camera Angle Editor: 4 Viewpoint Changes on One Photo
wiro/camera-angle-editor changes perspective on a single photo. This post runs four angles on one input and shows the raw outputs. The goal:…
Live Avatar: Audio-Driven Talking Head Videos in 6 Tests
Live Avatar: Audio-Driven Talking Head Videos in 6 Tests Live Avatar generates a talking head video from a still image and an…
Stable Diffusion 3.5 Large: 6 Prompt Tests (1024px)
Stable Diffusion 3.5 Large: 6 Prompt Tests (1024px) Stable Diffusion 3.5 Large is positioned as a stronger general-purpose text-to-image model with better…
GPT-5.2 vs GPT-5 Mini vs GPT-5 Nano: 6 Constraint Tests
GPT-5.2 vs GPT-5 Mini vs GPT-5 Nano: 6 Constraint Tests GPT-5.2 vs GPT-5 Mini vs GPT-5 Nano comes down to one question:…
GPT-5 Mini: 6 Practical Text Generation Tests
GPT-5 Mini: 6 Practical Text Generation Tests GPT-5 Mini targets fast, low-friction text generation. This review runs six small tests that show…
Seedance Pro v1.5: 8 Prompts for Text-to-Video and Image-to-Video
Seedance Pro v1.5: 8 Prompts for Text-to-Video and Image-to-Video Seedance Pro v1.5 generates short videos from text prompts. It can also animate…
Moondream3 Caption: Image Captioning in 6 Tests
Moondream3 Caption: Image Captioning in 6 Tests Moondream3 Caption turns an image into a plain-language description. It is built for fast image…
Camera Angle Editor: 6 Before/After Viewpoint Changes
Camera Angle Editor: 6 Before/After Viewpoint Changes Camera Angle Editor changes perspective on an existing image. It does not just rotate a…
What Seedream V5 Lite Uncensored Can Actually Edit: 5 Real Examples
Seedream V5 Lite Uncensored: 5 Before and After Edits Seedream V5 Lite Uncensored was tested with five real editing tasks to show…
3D Text Animations: 8 EffectType Presets for Kinetic Typography Videos
3D Text Animations: 8 EffectType Presets for Kinetic Typography Videos 3D Text Animations turns short captions into vertical 9:16 text videos. It…
DreamOmni2: 6 Before/After Image Edits
DreamOmni2: 6 Before/After Image Edits DreamOmni2 is an image editing model. It takes one or more reference images and a short instruction.…
PersonaPlex Realtime: Real-time Speech-to-Speech for Live Voice Agents
PersonaPlex Realtime: Real-time Speech-to-Speech for Live Voice Agents PersonaPlex Realtime targets live speech-to-speech workflows. The model runs as a streaming WebSocket service…
FireRed Image Edit 1.1: 3 Before and After Edits
FireRed Image Edit 1.1 demonstrates improved identity consistency and multi image conditioning. This post shows three real before and after edits that…
Seedream 4.5: 6 Before/After Image Edits
Seedream 4.5: 6 before and after image edits Seedream 4.5 supports both text-to-image and image editing. This post focuses on editing. Each…
Chatterbox Turbo: Fast TTS with Paralinguistic Tags in 6 Tests
Chatterbox Turbo: fast TTS with paralinguistic tags in 6 tests Chatterbox Turbo targets low-latency text-to-speech, but it still tries to sound natural.…
Product with Model: 8 EffectType Recipes for Image-to-Video Ads
8 effect recipes for Product with Model (image-to-video) Product with Model turns a product photo plus a model photo into a short…
LongCat-Image: Multilingual Text Rendering in 6 Tests
LongCat-Image: multilingual text rendering in 6 tests LongCat-Image targets a tricky combo: photoreal images plus readable text in multiple languages. This review…
dots.ocr-1.5: OCR in 6 Screenshot Tests
dots.ocr-1.5 targets OCR and document parsing with a single vision-language model. This post runs 6 screenshot-style tests and shows the raw extracted…
Seedream 4.5 vs Seedream V5 Lite: 6 Prompt Test
Seedream 4.5 vs Seedream V5 Lite: 6 prompt test Seedream 4.5 and Seedream V5 Lite both target fast, high resolution image generation.…
Wan2.2 Animate vs VACE vs Hailuo 2.3: 6 Motion Tests
Wan2.2 Animate vs VACE vs Hailuo 2.3: 6 motion tests This test compares three different ways to animate a still image into…
Kling V3 Motion Control: Motion Transfer in 3 Tests
Kling V3 Motion Control turns a single photo plus a driving video into an action-consistent clip. This post runs three real tests…
Hunyuan Flux SRPO: Text-to-Image Quality in 6 Tests
Hunyuan Flux SRPO: a fast text-to-image model in 6 tests Hunyuan Flux SRPO is a text-to-image model that targets clean aesthetics and…