paddlespeech.t2s.datasets.get_feats 模块
- class paddlespeech.t2s.datasets.get_feats.Energy(n_fft: int = 2048, hop_length: int = 300, win_length: Optional[int] = None, window: str = 'hann', center: bool = True, pad_mode: str = 'reflect')[来源]
基础:
object方法
获取能源
- class paddlespeech.t2s.datasets.get_feats.LinearSpectrogram(n_fft: int = 1024, win_length: Optional[int] = None, hop_length: int = 256, window: str = 'hann', center: bool = True)[来源]
基础:
object方法
获取线性频谱图
- class paddlespeech.t2s.datasets.get_feats.LogMelFBank(sr: int = 24000, n_fft: int = 2048, hop_length: int = 300, win_length: Optional[int] = None, window: str = 'hann', n_mels: int = 80, fmin: int = 80, fmax: int = 7600, norm: Optional[Union[typing_extensions.Literal[slaney], float]] = 'slaney', htk: bool = False, power: float = 1.0)[来源]
基础:
object方法
获取日志梅尔滤波器银行