Let's start with obtaining the resource files. For labeled audio, use the script we provide to cut the audio, and for unlabeled audio, use the silero-vad toolkit to cut it. The dataset is licensed ...