语音活动检测

如果您想训练一个语音活动检测(VAD)模型来自动化呼叫中心交互、改进语音激活助手系统或其他语音检测应用场景，可以使用此模板。创建可能包含语音的音频片段分段，然后将每个片段分类为是否包含语音(适用于语音激活助手系统场景的特定唤醒词)，或者仅仅是噪音。

交互式模板预览

标注配置

<View>
  <Header value="Listen to the audio:"></Header>
  <Audio name="audio" value="$url" zoom="true"></Audio>    
  <Header value="Highlight segments with sound louder than baseline:"></Header>
  <Labels name="label" toName="audio" choice="multiple">
    <Label value="Above Baseline Sound" background="green" alias="possible-speech"></Label>
  </Labels>
  <Header value="Select a segment and classify it:"></Header>
  <Choices name="voice" toName="audio" choice="multiple" showInline="true" perRegion="true">
    <Choice value="Wake Word" alias="wake-word"></Choice>
    <Choice value="Speech" alias="plain-speech"></Choice>
    <Choice value="Noise" alias="not-speech"></Choice>
  </Choices>
</View>

关于标注配置

所有标注配置必须包裹在View标签内。

您可以添加一个header来为标注者提供说明：

<Header value="Listen to the audio:"></Header>

使用 Audio 对象标签来指定要处理的音频文件位置：

<Audio name="audio" value="$url"></Audio>

使用Labels控制标签，让标注人员能够分割音频并识别可能存在语音的片段。

<Labels name="label" toName="audio" choice="multiple">
  <Label value="Above Baseline Sound" background="green" alias="possible-speech"></Label>
</Labels>

choice="multiple" 参数允许为同一音频片段添加重叠标签。alias 参数可让您为导出标注中的标签指定一个与标注者所见不同的名称。

使用Choices控制标签提示标注者对每个音频片段中的声音类型进行分类：

<Choices name="voice" toName="audio" choice="multiple" showInline="true" perRegion="true">
  <Choice value="Wake Word" alias="wake-word"></Choice>
  <Choice value="Speech" alias="plain-speech"></Choice>
  <Choice value="Noise" alias="not-speech"></Choice>
</Choices>

choice="multiple" 参数允许标注者为特定片段同时选择"语音"和"唤醒词"作为选项。perRegion 参数表示每个分类都应用于特定的音频片段。

专为各种规模的团队设计版本比较

分享您的Label Studio配置！

通过在Awesome Label Studio Configs仓库分享您独特的Label Studio配置，激发社区灵感！

立即贡献！