Skip to content

Commit f08872a

Browse files
authored
Merge pull request #1814 from hexisyztem/main
[BUG FIX] 使用 float64 避免精度误差问题,弃用 CPU 计算,避免拖累性能
2 parents f2ddcbe + 0d990d6 commit f08872a

File tree

1 file changed

+2
-2
lines changed

1 file changed

+2
-2
lines changed

cosyvoice/hifigan/generator.py

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -713,8 +713,8 @@ def decode(self, x: torch.Tensor, s: torch.Tensor = torch.zeros(1, 1, 0), finali
713713
@torch.inference_mode()
714714
def inference(self, speech_feat: torch.Tensor, finalize: bool = True) -> torch.Tensor:
715715
# mel->f0 NOTE f0_predictor precision is crucial for causal inference, move self.f0_predictor to cpu if necessary
716-
self.f0_predictor.to('cpu')
717-
f0 = self.f0_predictor(speech_feat.cpu(), finalize=finalize).to(speech_feat)
716+
self.f0_predictor.to(torch.float64)
717+
f0 = self.f0_predictor(speech_feat.to(torch.float64), finalize=finalize).to(speech_feat)
718718
# f0->source
719719
s = self.f0_upsamp(f0[:, None]).transpose(1, 2) # bs,n,t
720720
s, _, _ = self.m_source(s)

0 commit comments

Comments
 (0)