介绍
快速入门
语音转文本
文本到语音
发布的模型
演示
API 参考
AddBabble
AddBabble.forward()
AddNoise
AddNoise.forward()
AddReverb
AddReverb.forward()
DropChunk
DropChunk.forward()
DropFreq
DropFreq.forward()
EnvCorrupt
EnvCorrupt.forward()
重采样
向前重采样()
SpeedPerturb
SpeedPerturb.forward()
时间域规范增强
时间域规格增强.forward()
构建增强管道()
waveform_augment()
批量特征归一化()
batch_pad_right()
feature_normalize()
pad_right_2d()
pad_right_to()
waveform_collate_fn()
CSVDataset
CSVDataset.convert_to_record()
CSVDataset.load_data_csv()
CSVDataset.load_speaker_to_label()
meta_info
meta_info.duration
meta_info.label
meta_info.start
meta_info.stop
meta_info.utt_id
meta_info.wav
JSON数据集
meta_info.record_id
InputNormalization
InputNormalization.save()
InputNormalization.spk_dict_count
InputNormalization.spk_dict_mean
InputNormalization.spk_dict_std
InputNormalization.to()
blackman_window()
compute_amplitude()
convolve1d()
dB_to_amplitude()
normalize()
notch_filter()
重新缩放()
reverberate()