Skip to main content
This guide walks you through the full process: image audio video

Before You Start

You will need the following:
  • Image (uploaded or generated in Hedra)
  • Audio (uploaded, recorded, or AI-generated)
  • Access to video generation inside Hedra Studio

See: Create Images
Tip: Front-facing portraits produce the most natural speaking results.

See: Create Voice
Tip: Clear audio improves lip-sync accuracy.

Select an Avatar Video Model
  1. Go to the Avatar section
  2. Choose an avatar video model (Hedra Omnia, Avatar, Kling) from the pop-up menu below the prompt box
  3. Select aspect ratio, resolution and batch size
  4. Confirm image and audio are both attached loaded

Generate the Video
  1. In the prompt box, describe your video
  2. Click to Generate
  3. Wait for processing to complete
  4. Preview the result
  5. On the video, select to:
    View Details, Share, Download or Delete if needed

Best Practices

  • Use high-resolution images
  • Avoid extreme angles or heavy shadows
  • Keep audio clean and evenly paced
  • Start with short clips when testing

Troubleshooting

The lips don’t match the audio
  • Check for background noise
  • Ensure the audio file is clear and not distorted

The face looks unnatural
  • Use a well-lit, front-facing image
  • Avoid heavy filters or stylization

The video failed to generate
  • Confirm both image and audio are attached
  • Try regenerating

Next Steps

  • Enhance motion with motion control tools
  • Experiment with different voice styles
  • Create multi-scene videos using Composer