Overview
Now users can generate outstanding still and video creative directly within Triple Whale. Bringing creative production together with creative analysis promises to simplify workflows and help brands get more value from their data.
More of a visual learner? Watch the video overview:
Key Benefits
Easily generate new photos and videos that are inspired by your winning ads
Go from creative idea to prototype in mere minutes
Transform any product photo into vivid video ready for Meta Ads
Get Connected
In chat
Go to triplewhale.com/chat and describe what you want Moby to generate
Make sure to specify whether you want an image or video
When requesting video, also specify how long you want the video to be
Best Practices
Upload an existing photo or photos as "reference files"
Tell Moby to review the image or video assets he created for you, provide constructive criticism, iterate based on that feedback, and keep going in that cycle until he is satisfied
All assets that Moby generates can be downloaded as well as accessed centrally by clicking the plus button in Moby chat
Refer to our Prompting Guide for further insights about how to best prompt Moby.
Image Prompt Framework
Watch this video to see this in practice
Placing products or models in new environments
Step 1: Upload your current product photo or lifestyle image to Moby
Step 2: Copy the following prompt into Moby, replace all text within the prompt found between the curly brackets "{{ }}" with your direction for what you want to achieve.
Step 3: Run the prompt with your product photo or lifestyle image attached
Prompt (edit anything within the {{ }} )
Use the first reference file as the exact product source. Replicate the {{ product description }} precisely, including all label text, fonts, colors, proportions, and container shape, without any changes or distortions. Do not alter the product design in any way. The product must be identical to the reference, sharp, and in full focus.
Place the product centered on a {{ environment description }}. The environment should feature {{ surface description }} with {{ surface texture or color details }}, clean edges, and a reflective or matte surface depending on the lighting style.
Include {{ secondary object description }} in the frame to add balance and atmosphere, but ensure it remains softly out of focus.
In the background, show {{ background elements }} that align with the {{ aesthetic style }}. These should be slightly blurred with shallow depth of field so they are noticeable but not distracting.
The overall scene should feel {{ mood or tone }}, filled with {{ lighting description }} to create balanced highlights and soft shadows.
The composition should look like a premium lifestyle advertisement, with the {{ product description }} as the hero, rendered in ultra high resolution with photorealistic studio-quality lighting, crisp details, and natural depth of field.
**Negative Prompt:** Do not alter or distort the product label, text, fonts, or colors. Do not change the container shape. Do not invent or replace any branding. Avoid blurry, low-resolution, or stylized outputs. Do not obscure the product with background objects. No cartoon, painting, illustration, or abstract effects.
Output Example
Combining Images
Watch this video to see them in practice
Step 1: Upload your product or model photo (image 1) as well as a secondary photo that you want to combine or blend your product/model image into (image 2)
Step 2: Copy the following prompt into Moby, replace all text within the prompt found between the curly brackets "{{ }}" with your direction for what you want to achieve.
Step 3: Run the prompt with your product photo or lifestyle image attached
Prompt (edit anything within the {{ }} )
Use the **first reference image** ({{ primary reference description }}) as the **exact source** for the {{ product or subject being added }}. Replicate every visible detail precisely — including color, material, proportions, and branding — with no distortion or alteration. The product must appear sharp, detailed, and identical to the original.
Use the **second reference image** ({{ target scene description }}) as the **base environment**. Seamlessly blend the {{ product or subject being added }} into {{ how it should appear in the scene }} so it looks naturally part of the photo. Match lighting direction, brightness, and color tone between both images to ensure cohesive realism.
Ensure proper scale, natural hand or surface interaction, and realistic **contact shadows and reflections**. The product should integrate as if it were physically photographed in the same studio or environment as the base image.
Maintain a {{ mood or style description }} aesthetic with {{ lighting description }} lighting. The overall look should feel cinematic yet natural, with crisp detail, lifelike texture, and realistic depth of field.
Render the image in **ultra-high resolution** with **photorealistic lighting, accurate shadow blending, and perfect color consistency**. The final composition should appear like a single professionally shot photograph.
**Negative Prompt:**
Do not distort or recolor the {{ product or subject being added }}.
Do not alter text, logos, or labels.
Avoid mismatched lighting, unrealistic reflections, or flat compositing.
Exclude stylized, painted, or cartoonish effects.
Maintain full sharpness and photoreal clarity throughout.
Output Example
Video Prompt Framework
Watch this video to see it in practice
While the SMART framework keeps Triple Whale prompts business-focused, Google’s new Veo-3 video model offers a complementary way to think about creative prompts to generate beautiful video content.
Video Prompt Framework (using Veo3 for rich media & beyond)
Building-block | What it covers | Mini-example |
1. Subject | Who/what is on screen (people, animals, objects, combos). | “A playful golden-retriever puppy” |
2. Action | The movement or behaviour. | “Leaps to catch a frisbee” |
3. Scene / Context | Location, time of day, weather, era. | “At sunset on a misty beach” |
4. Camera Angle | Eye-level, low/high angle, bird’s-eye, etc. | “Low-angle tracking shot” |
5. Lighting & Style | Natural vs. artificial light, overall tone/mood, artistic style or color palette. | “Golden-hour glow, cinematic 35 mm look” |
6. Ambiance | Atmospheric & textural details (fog, rain, grain, neon, etc.). | “Soft fog rolls across wet cobblestones” |
7. Temporal Elements | Pacing (slow-mo, time-lapse), subtle evolution. | “Time-lapse day-to-night skyline” |
8. Audio (opt.) | Dialogue, ambient noise, SFX. | “Waves crashing, distant seagulls” |
9. Post-Prompt Helpers | Gemini as “expert prompter” or “second-pair-of-eyes”. | Ask Gemini to refine or QC the prompt. |
Quick Rule of Thumb
Think of SMART as the business objective and Veo-3’s nine elements as the creative blueprint. Use them together when you need both strategic insight and vivid storytelling (e.g., campaign videos, ad mock-ups, product-storyboards).
Best Practices using Video (Veo3)
Use cinematic language – terms like match-cut, montage, or split-diopter give the model clearer direction.
Be laser-specific, avoid filler words. Clear, direct phrasing reduces ambiguity.
Dialogue tip: Use a colon (:) after the speaker to prevent on-screen subtitles.
Generate multiple aspect ratios (16:9, 9:16, 1:1) for omnichannel performance.
One scene per prompt – break multi-step stories into separate clips for sharper results.
How this applies inside Triple Whale
SMART overlay: Once you’ve drafted the creative prompt, run it through SMART to ensure it’s measurable and time-bound before handing it to an Agent.
Iterate with Moby: Treat Moby as your in-house prompt QA—ask it to tighten language, flag missing data context, or suggest alternate styles.
Frequently Asked Questions
1. Which models does Moby use to generate videos?
Veo3 for video and Nano Banana for images (both from Google).
2. Is Moby good at preserving the details of product photography?
Yes

