Audio Segment Filter

Prev Next

Filters out segments that does not contain desired amount of audio data.

Parameters

name
description
default
ThresholdPercentage A threshold to filter segments. (Technically this is the percentage for segment energy to median energy ratio) 33

Inputs

Audio

 Accepts audio from a single channel.

Events

name description known nodes that generate this event
Start of Segment Signals the start of a speech segment. Audio Segmenter
End of Segment Signals the end of a speech segment. Audio Segmenter

Outputs

Audio

  Audio fragments are sent to output

Events

name description
Start of Segment Raised once at the beginning of each audio fragment.
End of Segment Raised once at the end of each audio fragment.

Remarks

  • This node does performs its calculations when flush is called. So until all the segments are provided and the flush method is called this node does not output anything. Hence it is only suitable for batch usage.

  • Similar to AudioSegmenter Node, this node also toasts the audio output write actions for each segment in between start-of-segment and end-of-segment events. As an example for an audio data with 3 audio segments, the flow is as follows (the order is well defined) :

segment-1: send "Start of Segment" event
segment-1: write the audio data of this segment to output
segment-1: send "End of Segment" event
segment-2: send "Start of Segment" event
segment-2: write the audio data of this segment to output
segment-2: send "End of Segment" event
segment-3: send "Start of Segment" event
segment-3: write the audio data of this segment to output
segment-3: send "End of Segment" event

Project Structure

A sample project can be built as such:

image.png

The aim of this project is to only make use of segments with high enough energy ratios.

Supported flow types

Batch

Release History

v1.0.0
  • Introduced Node.