ai pet spa videopet facial videocat spa asmrdog spa day videopet pampering videopet facial trend tiktokcat getting facialai pet asmrai pet video generator

How to Make an AI Pet Spa Video (The Top-Down Pet Facial Trend)

Make the AI pet spa video: one pet photo becomes a soothing top-down clip of your dog or cat getting the full facial — towel turban, cucumber eyes, cream massage, foam brush, cool mist — with close-mic'd ASMR audio. Free to try.

Ian Brillantes · Founder & Senior Software Engineer|July 1, 20267 min read

Quick answer

To make an AI pet spa video, upload one clear photo of your pet to Starrd's Spa Day template. AI lays your pet on its back on a plush white towel with a tiny towel turban and films the whole facial from a locked top-down camera — cucumber slices over the eyes, a slow cream massage, a foam brush sweep, a cool mist, and a finishing cheek squish — with close-mic'd ASMR audio (cream squelch, brush bristles, a contented purr). 10 seconds, powered by Seedance 2.0 — free to try, no editing.

The Most Relaxed Anyone Has Ever Been

You've seen the videos: a cat flat on its back in a tiny towel turban, cucumber slices on its eyes, paws folded, receiving a facial with the serenity of a monk. The comments are always the same — "why is this cat calmer than me" — and the format loops endlessly because nothing happens except bliss, in the most satisfying way possible.

One photo makes it yours: your pet on the plush white towel, turban on, getting the full treatment — cucumbers placed one at a time, cream dotted and massaged in slow circles, a foam brush sweeping the cheeks, a cool mist, a towel pat-down, and one soft finishing cheek squish as it melts. All from a single locked top-down camera, all in close-mic'd ASMR.

What you get — a Spa Day clip made from one pet photo

Where This Trend Comes From

Two evergreen formats, one clip:

  • The pet-pampering genre. Top-down grooming and spa videos — dogs at the blow-dry bar, cats getting brushed into a trance — are a permanent fixture of pet TikTok, and AI pet-transformation clips in adjacent formats have pulled as many as 880 million views. The pet barbershop "fresh fade" proved the AI version of pets-getting-groomed travels just as well.
  • The AI-ASMR wave. The squish/silicone-head trend showed that AI video with close-mic'd tactile audio — squelches, taps, brushes — is its own retention machine. Native-audio models generate the sound in the same pass as the picture.

Spa Day sits exactly at the intersection: the pet-pampering subject, the ASMR sound design, and the one composition both genres share — the locked overhead shot.

The Fastest Way — the Spa Day Template

The Starrd Spa Day template has the whole ritual built: the turban, the cucumbers, the cream, the brush, the mist, and the camera rule. Upload one pet photo, tap once, done.

Spa Day

Your pet gets the full facial — cucumber eyes, cream massage, foam brush, cool mist — filmed top-down like the viral pet-pampering ASMR videos. Just add your pet's photo.

Try It

Building it yourself on a raw model? Here's what matters.

Step 1 — Upload One Clear Pet Photo

  • Face fully visible, well lit. The whole video looks straight down at the face — that's what has to stay recognizably yours.
  • One pet. One spa table, one client.
  • No filters, no costumes. The turban and towels come from the template; a clean photo anchors the likeness best.
Pro Tip

Fluffy, light-coated pets read beautifully against the white towels, but the format works for any pet — a deeply unbothered senior dog is every bit as funny-soothing as a Persian in a turban.

Step 2 — The Camera Rule (This Is the Format)

One locked top-down bird's-eye camera for the entire clip, like a phone mounted above a grooming table. The only move allowed is a very slow push in. No cuts, no side angles, no POV switches. This single constraint is what makes the clip read as a real pampering video — the moment the camera starts "directing," it reads as AI. If you write your own prompt, state the rule twice and put it in the negatives too.

Step 3 — The Ritual, In Order

The beats are a real facial, miniaturized: settle → cucumbers → cream massage → brush → mist → pat dry → cheek squish. Each beat is one clean, slow action. The pet's only job is to melt — ears softening, a slow blink, a deep purr or sigh, a faint smile at the end.

AI Pet Spa — Top-Down Facial ASMR (10s)
A fluffy white Persian cat lies on its back on a plush white spa towel with a tiny white towel turban on its head, utterly relaxed, while gentle hands perform a facial. Bright clean home spa station: ceramic cream bowl, cucumber dish, soft brush, fine-mist bottle, pastel towels. Soft diffused daylight. Hyper-satisfying pet-pampering ASMR, photorealistic.
Camera: STRICT top-down bird's-eye overhead for the ENTIRE video, locked directly above like a phone mounted over a grooming table. Only movement allowed: a very slow gentle push in. No angle changes, no cuts, no side views.[00:00-00:02] The hands place two fresh cucumber slices over the pet's eyes, one at a time. Its body softens with a slow easy breath.
[00:02-00:05] The hands dot creamy white facial cream on its cheeks and forehead, then massage in slow tiny circles. Ears relax outward; a deep rumbling purr.
[00:05-00:08] Cucumbers lifted away; a soft brush sweeps light foam across its cheeks and chin in slow, even strokes. Whiskers twitch; a blissful slow blink.
[00:08-00:10] A fine cool mist, a gentle towel pat-down, and one soft finishing cheek squish. The pet melts into the towel, eyes closed, faint smile.Sound: close-mic ASMR only — towel rustle, cream squelch, fingertip taps, brush bristles, mist spritz, towel pats, a deep contented purr. Quiet spa room tone. No music. No narration. Generate audio.
Avoid: camera angle changes, side views, cuts, morphing, distortion, extra hands, watermarks.

Seedance 2.0 is the model for this — it keeps the pet's face consistent under the top-down angle and generates the squelch, bristles, and purr in the same pass as the picture. Swap the purr for a contented sigh if your client is a dog.

Step 4 — The Sound Is Half the Video

No music. The format's audio is the treatment itself: the cream squelch, the fingertip taps, the brush bristles, the mist, and — the detail that makes people replay it — the pet's own contented purr or sigh under everything. If you add a lofi track on top, you've made a montage; leave the close-mic sounds alone and it's ASMR.

Director's Notes — Remix It

  • Swap the treatment — a clay mask, a jelly under-eye patch, a tiny jade roller.
  • Add a beat — a paw massage, a chin scratch between steps.
  • Dress the set — rose petals around the towel, candles at the frame edges.
  • Change the client — every species keeps its own coat and gets its own contented sound.

Common Mistakes

Pro Tip

Don't move the camera. The locked overhead shot is the genre. One angle change and it stops being a found pampering video.

Pro Tip

Don't rush the beats. One slow action per beat. Cramming three treatments into two seconds reads as a highlight reel, not a spa.

Pro Tip

Keep the pet melted. A squirming or startled pet breaks the spell — the comedy and the calm both come from total, impossible relaxation.

Frequently Asked Questions

What is the AI pet spa video trend? The AI version of the top-down pet-facial clips — your pet in a towel turban getting cucumbers, cream, brush, and mist from a single locked overhead camera, with close-mic'd ASMR sound.

How do I make one? Upload one clear pet photo to Starrd's Spa Day template and tap generate — the whole ritual is built in.

Why is the camera top-down the whole time? That's the format — one locked bird's-eye view like a phone over a grooming table. The template enforces it.

Does it work with dogs, cats, or other pets? Any pet — exact likeness kept, contented sound adapted to the species.

What does it sound like? Pure close-mic ASMR: squelch, taps, bristles, mist, and the pet's purr or sigh. No music.

What photo should I upload? One clear, well-lit, front-facing shot — the video looks straight down at the face.

Can I change the treatments? Yes, via Director's Notes — but the top-down camera, the calm, and the ASMR sound stay.

About the author

Ian Brillantes · Founder & Senior Software Engineer

Ian is the founder of Starrd and a senior, forward-deployed software engineer. He builds the Seedance 2.0 generation pipeline behind Starrd and writes the step-by-step how-to guides, turning the model internals he works on into practical walkthroughs anyone can follow.

Related Articles

Ready to create your own video?

Pick a template, upload your photos, and generate a cinematic AI video in minutes.

Browse Templates