Code: https://github.com/lym0302/DeepSound-V1
NOTE: It takes longer to process high-resolution videos (>384 px on the shorter side).
Doing so does not improve results.
This is a step-by-step v2a process and may take a long time.
If Post Processing is set to 'rm', the generated video may be None.