DeepSound-V1 — Video-to-Audio Synthesis

Code: https://github.com/lym0302/DeepSound-V1

NOTE: It takes longer to process high-resolution videos (>384 px on the shorter side). Doing so does not improve results.

This is a step-by-step v2a process and may take a long time. If Post Processing is set to 'rm', the generated video may be None.

Mode
Post Processing