基于分段的自动语音识别

聆听音频文件并进行分段，然后用自然语言转录每个分段的内容，利用分段执行语音识别。

交互式模板预览

标注配置

<View>
  <Labels name="labels" toName="audio">
    <Label value="Speech" />
    <Label value="Noise" />
  </Labels>
  <Audio name="audio" value="$audio"/>
  <TextArea name="transcription" toName="audio"
            rows="2" editable="true"
            perRegion="true" required="true" />
</View>

关于标注配置

所有标注配置必须包含在View标签内。

使用Labels控制标签，让标注者能够高亮显示音频中代表不同类型噪音的部分：

<Labels name="labels" toName="audio">
    <Label value="Speech" />
    <Label value="Noise" />
</Labels>

使用Audio对象标签来显示可标注的音频波形：

<Audio name="audio" value="$audio"/>

使用TextArea控件标签提示标注者为每段音频提供文字转录：

<TextArea name="transcription" toName="audio"
          rows="2" editable="true"
          perRegion="true" required="true" />

editable="true"参数指定转录文本可编辑，required="true"将转录文本设为标注者的必填字段。若未为音频片段的每个分段提供转录文本（通过perRegion="true"参数设置），则无法提交标注结果。

增强此模板

为特定音频片段添加上下文

如果您希望提示标注者为特定音频片段添加上下文信息，例如选择给定音频片段中说话者的口音或假定性别，您可以在标注配置中添加以下内容：

<View visibleWhen="region-selected">
  <Header value="Select the assumed gender of the speaker:" />
  <Choices name="gender" toName="audio"
           perRegion="true" required="true">
    <Choice value="Man" />
    <Choice value="Woman" />
  </Choices>
</View>

View标签中的visibleWhen参数表示该选项仅在选中特定音频片段时可见。Header标签为标注者提供操作说明。Choices标签包含perRegion参数，用于将所选选项仅应用于选中的音频片段。

专为各种规模的团队设计版本比较

分享您的Label Studio配置！

通过在Awesome Label Studio Configs仓库分享您独特的Label Studio配置，激发社区灵感！

立即贡献！