Skip to content

Commit 5d03cd4

Browse files
charliewwdevclaude
andcommitted
V4: add 6 postprocess modules + enhanced audio pipeline
- beat_sync: librosa-based beat detection with 3 shot-alignment modes - deflicker: histogram/optical_flow temporal flicker removal + cross-shot harmonize - lip_sync: multi-backend (musetalk/wav2lip/sadtalker/hallo2/basic) with face detection cascade - mtv_sync: Demucs source separation + per-frame audio-video sync mapping - style_harmonize: cross-shot color/brightness/texture consistency (LAB/k-means/VGG) - video_extend: loop_blend, reverse_bounce, RIFE interpolation, VACE continuation - audio: voice profiles, generate_narration_with_voice(), storyboard voice loading Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
1 parent 5f42448 commit 5d03cd4

File tree

8 files changed

+6109
-31
lines changed

8 files changed

+6109
-31
lines changed

animatediff/postprocess/__init__.py

Lines changed: 30 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -4,12 +4,40 @@
44
Modules:
55
- interpolation: RIFE frame interpolation for smoother motion
66
- upscale: Real-ESRGAN anime super-resolution
7-
- audio: F5-TTS voice generation + BGM
7+
- audio: F5-TTS voice generation + BGM + per-character voice cloning
88
- compositor: shot assembly, transitions, audio mixing
9+
- deflicker: temporal flicker removal within and across shots
10+
- beat_sync: beat detection and shot-to-music alignment (requires librosa)
11+
- mtv_sync: multi-track audio separation + frame-level video sync
12+
- video_extend: extend short video clips to longer duration
13+
- style_harmonize: cross-shot visual consistency
14+
- lip_sync: multi-backend lip synchronization (MuseTalk, Wav2Lip, SadTalker, Hallo2)
915
"""
1016

1117
from animatediff.postprocess.interpolation import FrameInterpolator
1218
from animatediff.postprocess.upscale import VideoUpscaler
13-
from animatediff.postprocess.audio import AudioGenerator, VoiceProfile
19+
from animatediff.postprocess.audio import (
20+
AudioGenerator,
21+
VoiceProfile,
22+
generate_narration_with_voice,
23+
normalize_audio,
24+
add_reverb,
25+
crossfade_audio,
26+
concatenate_audio,
27+
get_audio_duration,
28+
prepare_reference_audio,
29+
)
1430
from animatediff.postprocess.compositor import VideoCompositor
1531
from animatediff.postprocess.lipsync import LipSyncProcessor, apply_lipsync_to_shots
32+
from animatediff.postprocess.lip_sync import LipSyncer, FaceDetector
33+
from animatediff.postprocess.deflicker import VideoDeflicker
34+
from animatediff.postprocess.beat_sync import BeatAnalyzer, ShotBeatAligner, BeatInfo
35+
from animatediff.postprocess.mtv_sync import (
36+
AudioStreamAnalyzer,
37+
VideoAudioSync,
38+
AudioStreams,
39+
TimeSegment,
40+
SyncMap,
41+
)
42+
from animatediff.postprocess.video_extend import VideoExtender
43+
from animatediff.postprocess.style_harmonize import StyleHarmonizer, StyleDescriptor

0 commit comments

Comments
 (0)