How do you write a Seedance 2.0 text-to-video prompt?

Write the subject and action first, then camera movement, setting, sound, and duration. For multi-shot clips, label each shot instead of packing every action into one sentence.

Can Seedance 2.0 generate audio from text?

Yes. Seedance 2.0 text-to-video supports audio generation, so prompts should include concrete sound cues such as footsteps, rain, traffic, room tone, or product clicks.

When should you use text-to-video instead of image-to-video?

Use text-to-video when composition is flexible and you are exploring ideas. Use image-to-video when identity, product shape, outfit, or scene design must stay fixed.

Back to blog

Seedance 2.0 Text-to-Video Guide: Prompts That Work

May 14, 2026By Bilal Azhar

Write better Seedance 2.0 text-to-video prompts with shot structure, camera language, audio cues, pacing, aspect ratios, and copy-paste examples.

Seedance 2.0 text-to-video works best with director-style prompts: subject, action, camera, setting, sound, and cuts. Use Seedance 2.0 on Morphed when you need 1080p output, and use Seedance 2.0 Fast for draft variants. If you are comparing Seedance against Sora, Veo, and Kling first, the text to video AI page has a side-by-side model table with per-second costs.

Seedance 2.0 text-to-video is the right workflow when you do not already have an image, product frame, or character reference. You write the scene from scratch and let the model create the visuals, motion, and audio.

The tradeoff is control. Text-to-video gives you freedom, but less identity stability than Seedance image-to-video or reference-to-video. The prompt needs to define motion clearly without trying to micromanage every pixel.

The biggest mistake is treating text-to-video like a search box. "A beautiful cinematic ad for a smartwatch" sounds clear to a human, but it does not tell Seedance what happens on screen. A better prompt tells the model what a camera operator would shoot: the watch sits on black glass, the second hand starts moving, cyan light catches the bezel, the camera slowly orbits at table height.

That is the difference between an idea and a shot.

Start With A Shot, Not A Mood

Before you write a Seedance text prompt, answer five questions:

Question	Example Answer
What is the subject?	a matte black smartwatch
What changes during the clip?	the second hand starts moving
Where is the camera?	table-height orbit
What is the environment?	black glass studio surface
What should we hear?	subtle mechanical tick

Once those answers are clear, the prompt almost writes itself. You can add style later, but the scene needs bones first.

What Is The Best Prompt Formula?

Use a single-shot formula for simple clips and a shot-list formula for sequences.

[Subject] [does one action] in [specific setting], [camera movement], [lighting or atmosphere], [sound cue].

Example:

A young chef slides a copper pan across a stainless counter, low side-tracking camera, warm kitchen light, pan scrape and soft sizzle.

For 8-15 second clips, use labeled shots:

Shot 1: A cyclist waits at a red light in light rain, close side profile, traffic hum.
Shot 2: The light turns green and the cyclist pushes off, low tracking shot beside the wheel.
Shot 3: Wide shot as the bike crosses a reflective street, rain and tire sound.

Why this works: each shot is a camera instruction, not just a sentence. Seedance gets a starting frame, a motion beat, and a payoff. If you wrote the same idea as one long paragraph, the model would have to decide where the cuts go.

How Should You Pace A Clip?

Pacing depends on duration. A 4-second clip should be one action. A 15-second clip can support a setup, movement, and payoff.

Duration	Best Prompt Type	Example
4s	One action	bottle cap clicks open
6s	One action plus camera move	slow push-in on a dancer turning
8s	Two beats	hand picks up product, camera reveals label
10s	Three-shot sequence	setup, action, payoff
15s	Multi-shot mini ad	problem, transformation, final frame

If the output feels chaotic, reduce either the number of shots or the number of actions per shot.

For text-to-video, duration is not just a quality setting. It is a storytelling budget. Four seconds can carry one action. Fifteen seconds can carry a mini-story. If you ask for a full ad, a costume change, a camera orbit, a product reveal, and a reaction shot in 5 seconds, the model will compress everything until the clip feels confused.

Camera Language That Works

Seedance responds better to physical camera directions than to generic style words.

Instead Of	Use
cinematic	slow push-in
dynamic	handheld follow shot
premium	controlled studio orbit
epic	wide crane pullback
dramatic	low-angle dolly with hard side light

Good camera moves:

slow push-in
locked-off wide
low side-tracking shot
handheld follow
clockwise orbit
top-down macro
crane pullback

Use one main camera move per shot. "Orbit, dolly, zoom, handheld, drone" in one shot usually produces confused motion.

Camera language is also how you control emotion. A locked-off wide shot feels observational. A handheld follow shot feels immediate. A slow push-in feels intimate. A low-angle dolly feels powerful. These words do more work than "cinematic" because they tell Seedance how the viewer is physically moving through the scene.

Sound Cues For Text-To-Video

Seedance can generate synchronized audio, so sound cues are not decoration. They help time events. Use sounds tied to visible actions.

Scene	Sound Cue
Rain street	wet footsteps, traffic hiss
Product reveal	soft click, cloth slide
Cafe scene	espresso hiss, cup tap
Fitness clip	shoe squeak, breath, gym ambience
Fashion shot	fabric rustle, camera shutter

Avoid "great music" or "emotional sound." If you need music timing, describe tempo, beat, or transition point.

Sound is where Seedance prompts can feel unusually alive. A lot of video prompts look good but feel silent. Even if you plan to replace the audio later, a sound cue can help the model understand timing: the cap click happens when the bottle opens, the splash happens when the shoe lands, the room tone drops when the reveal happens.

Prompt Examples By Aspect Ratio

16:9 Landscape

16:9. A silver train crosses a mountain bridge at sunrise, wide locked-off landscape frame, mist moving below, distant rail sound.

What it does: this is a calm establishing shot. The train gives horizontal motion, the mist gives atmospheric motion, and the locked frame keeps the landscape readable.

9:16 Vertical

9:16 vertical. A creator opens a package at a desk, camera close to hands, tape tear and cardboard tap, bright natural window light.

What it does: this is a creator-style product beat. The hands fill the frame, the sound is tied to a visible action, and the vertical crop is not fighting a wide composition.

1:1 Square

1:1 square. A glass perfume bottle rotates on black marble, top-down light sweep, subtle music-box chime, clean luxury product frame.

What it does: square product shots work best when the subject is centered and the movement is symmetrical. This prompt is built for feed posts and product thumbnails.

21:9 Ultrawide

21:9 ultrawide. A lone car drives along a coastal highway at blue hour, drone pullback, ocean wind and engine hum.

What it does: ultrawide is for space. The prompt uses a small subject inside a large environment, so the aspect ratio feels intentional instead of empty.

When Should You Switch To References?

Switch from text-to-video to references when the output must preserve a specific face, product, room, logo, outfit, or camera rhythm. Seedance 2.0 reference-to-video can use images, video clips, and audio clips, but the prompt needs to assign each file one role.

Use text-to-video for exploration:

mood tests
concept boards
generic social clips
landscape and b-roll ideas
rough ad beats

Use image/reference workflows for production:

exact products
repeat characters
branded objects
visual continuity
audio-synced edits

If you keep rerolling a text prompt because the product changes shape or the character looks different each time, that is not a prompting failure. It is a workflow signal. Move to image-to-video or reference-to-video and give Seedance the visual anchor it needs.

A Better First Draft Workflow

Use this when you are exploring from text:

Write the simplest possible one-shot version.
Generate it short, usually 4-6 seconds.
Keep the useful motion and remove the rest.
Add shot labels only after the single-shot version works.
Move to Standard quality only after the prompt is stable.

The goal of the first draft is not to create the final clip. It is to learn what Seedance understands from your wording. Once the motion language is working, then you can add polish.