Slide 27
Slide 27 text
ex) Whisper
文字起こしに前処理いれる
from pyannote.audio import Audio, Pipeline
pipeline = Pipeline.from_pretrained("pyannote/speaker-diarization-3.1")
diarization = pipeline(audio_file.name)
audio = Audio(sample_rate=16000, mono=True)
for segment, _, speaker in diarization.itertracks(yield_label=True):
# 音声ファイルから話者のセグメントを切り出す
waveform, sample_rate = audio.crop(no_silence_audio_file.name, segment)
・・・