Skip to main content

Face Swap and Lip Sync Guide: Features, Limitations, and Tips

Tips, tricks, and best practices

Runbo (CEO of Magic Hour) avatar
Written by Runbo (CEO of Magic Hour)
Updated over 2 months ago

Overview

This guide covers Magic Hour's Face Swap and Lip Sync tools, helping you create professional-quality videos with realistic face replacements and synchronized audio. You'll learn what each tool can do, important limitations to know upfront, best practices for success, and how to get refunds if you're unsatisfied with your results.

Face Swap Video: What You Need to Know

What Face Swap Does

Face Swap replaces a person's face in a video with a different face (from a photo you upload). The AI automatically adjusts lighting, expressions, and head movements to create realistic, seamless results that blend naturally into your original video.

Key Features

  • Multiple face swaps: You can swap multiple different faces in a single video using individual face-to-face mapping. This is useful for group videos or ensemble casts.

  • Full-length video support: Paid plans support videos up to your plan's length limit without watermarks.

  • Automatic lighting adjustment: The AI adapts lighting to match your original video, creating consistent, professional results.

  • Frame-by-frame precision: The tool tracks facial features frame-by-frame for smooth, realistic swaps even during movement.

  • Multiple formats: Upload MP4, MOV, AVI, or WebM videos, or use YouTube URLs directly.

  • Commercial use: All face swaps can be used for commercial projects, marketing, and client work.

Important Limitations

Single Subject Per Swap: Each face swap operation replaces one specific face in your video. If your video has multiple people and you want different results for each, you'll need to upload different swap images for each person and map them individually.

Video Length Limits by Plan: Different plans support different maximum video lengths:

  • Basic (Free): Up to ~33 seconds (400 frames at 12 fps)

  • Creator: Up to ~60 seconds

  • Pro and higher: Full-length videos supported

If your video exceeds your plan's limit, you'll need to trim it or upgrade your plan.

Known issues to expect:

  • Flickering: Videos with fast movement, side profiles, or complex camera angles may show flickering. This happens because the AI struggles to consistently identify the face across all frames.

  • Side profiles perform worse: Face swaps work best on forward-facing footage. Side angles and profile shots often produce lower quality results.

  • Quality varies with input video: If your source video is low-resolution, poorly lit, or has the person blurred or partially visible, the swap quality will suffer.

Face Swap Best Practices

Choose the Right Source Photo: Use a high-quality, well-lit photo of the face you want to swap in. The photo should show a clear, front-facing view of the face with neutral or natural lighting. Avoid blurry images or heavily filtered photos—clarity matters.

Ensure Your Video Is Well-Lit: The better lit your source video, the better the results. Videos shot in daylight or with professional lighting produce much more realistic swaps than dimly lit footage.

Use Forward-Facing Footage: For the best results, use videos where the person's face is clearly visible and facing forward. Avoid side profiles, shots where the face is partially obscured, or videos with extreme head tilts.

Keep Video Movement Steady: Excessive camera movement or erratic head movement can confuse the AI and produce flickering results. Steady footage produces smoother, more consistent swaps.

If you're unsure whether your video will work well, test with a shorter clip first. This lets you see how the AI handles your specific footage before committing all your credits to a full-length render.

Lip Sync: What You Need to Know

What Lip Sync Does

Lip Sync takes a video and matches the person's mouth movements to new audio you provide. The AI analyzes the audio and adjusts the person's lip movements so they appear to be speaking or singing the new audio naturally.

Key Features

  • Multilingual support: Works with audio in any language, making it perfect for dubbing and localization.

  • Text-to-speech integration: Generate audio directly from text using our built-in text-to-speech tool, or upload your own audio file.

  • Long video support: Lip sync supports videos up to approximately 11.1 minutes long (20,000 frames at 30 fps).

  • Video format flexibility: Upload MP4, MOV, AVI, or WebM videos, or link a YouTube URL.

  • Commercial use: All lip-synced content can be used commercially without restrictions.

Important Limitations

Audio Timing Matters: Lip sync works best when your audio file starts immediately (at the beginning, not with silence). If your audio has a gap at the start, it can confuse the model and produce out-of-sync results.

Mouth Visibility Required: The tool needs a clear view of the person's mouth throughout the video. If the face is obscured, the video quality is poor, or the person is at an extreme angle, lip sync won't work well.

Common issues and their causes:

  • Out-of-sync mouths: The person's lips don't match the audio. This often happens with fast-paced audio, mumbling, or unclear speech patterns in your audio file.

  • Excessive mouth movement: If the person in your video is already talking or has exaggerated mouth movements, the AI may over-correct, making the sync look unnatural.

  • Blurry mouth area: Videos of people with beards or shadowing around the mouth area produce blurrier results, making lip sync less accurate.

Lip Sync Success Tips

Use Videos with Minimal Mouth Movement: For the best results, use videos where the person's mouth is relatively still or has minimal natural movement. A person with a neutral expression or a static pose syncs much better than someone who's already talking or making exaggerated expressions. This is the single biggest factor in lip sync success.

Ensure Your Audio Is Clear and Properly Timed: Use high-quality audio with clear speech or singing. Make sure your audio file starts immediately—no silence at the beginning. If you're using music, ensure the vocals are clear and distinct.

Avoid Videos with Beards or Heavy Shadows: Beards and dark shadows around the mouth area make it harder for the AI to detect exact lip movements. If possible, use videos of people without facial hair for cleaner results.

Keep Your Video Forward-Facing and Well-Lit: The person's face should be clearly visible, facing forward, and well-lit. Poor lighting or extreme angles reduce accuracy.

Consider Using Different Model Options: Magic Hour offers different model options (Lite, Standard, Pro). If one model produces poor results, try a different one. Some models handle certain mouth shapes or speech patterns better than others.

If lip sync isn't producing satisfactory results, consider trying our Talking Photo tool instead. Users often report better results with Talking Photo, which requires just an image and audio—no video needed. This eliminates many of the variables that cause lip sync issues.

Troubleshooting Common Issues

Issue

Likely Cause

Solution

Video upload fails or says "Unable to upload face images"

File format issue (often an uppercase file extension like .JPG instead of .jpg)

Rename your image to use lowercase extension (.jpg, .png, .mp4) and try again

Video swap takes a very long time to process

Long video length, high resolution, or busy servers

Trim your video, lower the resolution or FPS, or try again later. Pro plan users get priority processing

Flickering faces in the final swap video

Fast movement, side profiles, or complex camera angles confusing the AI

Use steadier footage with forward-facing angles, or trim sections that cause flickering

Swap produces distorted or blurry faces

Low-quality source video or unclear input image

Use a higher-resolution source video and a clear, well-lit input photo

Lip sync mouths are out of sync with audio

Fast audio, unclear speech, or excessive existing mouth movement in the video

Use clearer audio, ensure it starts immediately (no silence at beginning), or choose videos with minimal existing mouth movement

Lip sync doesn't work on a static face

Expected behavior—if the person isn't talking or moving their mouth, there's nothing to sync

Use videos where the person's mouth is visible and can move. For static images, use Talking Photo instead

Photo is getting compressed during face swap

Plan limitation—Creator plan is limited to 1024px max

Upgrade to Pro or higher for higher resolutions, or accept the compression for social media use

Resolution and Output Quality by Plan

Your plan tier affects the maximum output resolution and processing priority:

  • Basic (Free): Up to 512×512 resolution, watermarked, limited daily generations

  • Creator: Up to 1024×1024 resolution, no watermark, longer videos

  • Pro: Up to 828×1472 (or higher depending on aspect ratio), priority processing, full-length video support

  • Business: Up to 4K resolution, highest priority processing, best quality for commercial use

Plan Your Upgrade: If you're consistently getting poor results with low resolutions or slow processing times, upgrading to Pro or higher will often solve those issues. Higher-resolution inputs produce much better AI results.

What's Next

  • Explore similar tools: Talking Photo (better results for animations from static images)

  • Learn more about Face Swap Photo for swapping faces in still images

  • Check out Video-to-Video to stylize and transform existing videos

Getting Help

If you encounter issues or need support:

  • Email support: Contact [email protected] with your project ID and a description of the issue

  • Include: Project ID, input files (if possible), specific error messages, and what you've already tried

  • Community help: Join our Discord community to share tips and see how others solve similar problems

Pro tip: When reaching out for help, include a screenshot of your generation and describe what specific aspect didn't meet expectations. Detailed descriptions help our team assist you faster and determine whether a refund or troubleshooting is the best path forward.

Did this answer your question?