No login required. We will surgically analyze your ad and reconstruct it for maximum retention.
WHY THIS IS A LOSER: This ad lacks a "Hook Cut" in the critical first 3 seconds, meaning users scroll past before the premise is established. The scene chaos is high (10.8 context resets), creating visual cognitive overload.
HOW IT CAN BECOME A WINNER: To salvage this creative, a human editor must inject a hard visual cut within the first 2 seconds to instantly reset user attention. The pacing must be slowed down to reduce the context reset rate below 8.0. This will theoretically boost the win probability to over 35%.
> System initialized...
STATUS: LOSER
STATUS: IMPROVED (+6.5%)
| METRIC DEFINITION | VARIANT A (ORIGINAL) | VARIANT B (EDITED) | IMPACT |
|---|---|---|---|
| A. COMPUTER VISION & PIXEL MATH | |||
| HOOK CUT DENSITY Number of hard cuts in the critical first 3 seconds. |
0 | 1 | Added cut in first 3s retains attention. |
| CONTEXT RESETS How frequently the entire scene completely changes. |
10.80 | 7.20 | Reduced scene chaos makes it easier to watch. |
| OVERALL CUT DENSITY Pacing of the ad across its total duration. |
0.42 | 0.33 | Pacing slowed down slightly. |
| CONTRAST DELTA Change in contrast over time. |
33.74 | 29.56 | Visual contrast smoothed. |
| LUMINANCE DELTA Change in brightness over time. |
27.48 | 24.46 | Visual brightness smoothed. |
| VIDEO DURATION Total length of the ad in seconds. |
33.32s | 33.32s | Identical length. |
| B. AUDIO SIGNAL PROCESSING | |||
| AUDIO LOUDNESS (LUFS) Total perceived loudness relative to broadcast standards. |
-18.53 | -18.53 | Original audio perfectly remuxed. |
| AUDIO TRANSIENT SPIKES Sudden loud impacts (beats, sound effects, claps). |
37 | 37 | Original audio perfectly remuxed. |
| SPEECH WPM The speed of the speaker's delivery. |
0.00 | 0.00 | Original audio correctly identified as pure music. |
| VOCAL-TO-MUSIC RATIO Ensures voice isn't drowned by background track. |
0.00 | 0.00 | Original audio correctly identified as pure music. |
| C. PSYCHOLOGICAL SEMANTICS | |||
| VIBE AESTHETIC Overall visual mood and grading. |
Cinematic luxury with slow motion | Aspirational lifestyle with warm grading | Aesthetic adjusted to lifestyle. |
| CAPTION STYLE Typography used for subtitles. |
Bold full-sentence overlays | Minimalist white subtitles | Text simplified for readability. |
| TEXT OVERLAY PRESENCE Proportion of frames containing text/graphics. |
0.90 | 1.00 | More consistent text visibility. |
| HOOK ANGLE Psychological opening trigger. |
Visual curiosity | Visual curiosity | Maintained unusual hook. |
We built a multi-agent orchestration pipeline. First, we feed the 41 extracted backend metrics (OpenCV, Librosa) and the original video to Gemini 3.1 Flash Lite. Gemini acts as the "Director," analyzing why the video is a mathematical loser (e.g., 0 hook cuts, too much scene chaos) and writing a precise blueprint for how to fix it.
Once the blueprint is generated, we trigger the Omni Flash API to perform the actual video generation based strictly on Gemini's framing instructions.