Skip to content

Commit 3c85a67

Browse files
committed
Add 5Hz LM planner (CoT-only mode)
Wraps acestep-5Hz-lm-1.7B as a Qwen3 causal LM that runs before the DiT. Generates until </think>, parses the YAML metadata block, and returns structured BPM, key/scale, time signature, language, and a rewritten caption to use as DiT conditioning instead of the raw user input.
1 parent 43bddb3 commit 3c85a67

File tree

2 files changed

+426
-0
lines changed

2 files changed

+426
-0
lines changed

src/model.rs

Lines changed: 2 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -5,8 +5,10 @@
55
//! - [`encoder`] — condition encoder (lyric encoder, timbre encoder, text projector)
66
//! - [`tokenizer`] — audio tokenizer (FSQ) and detokenizer
77
//! - [`generation`] — top-level generation model combining all components
8+
//! - [`lm_planner`] — 5Hz LM planner (CoT-only), expands raw caption → structured metadata
89
910
pub mod encoder;
1011
pub mod generation;
12+
pub mod lm_planner;
1113
pub mod tokenizer;
1214
pub mod transformer;

0 commit comments

Comments
 (0)