Audio Diarization Python

A Python tool for transcription and speaker diarization

StellaScript is a Python application for generating speaker-aware transcriptions from live or pre-recorded audio. It integrates several machine learning models for its core functions: speech-to-text ...

GitHub

baturalpkars/audio-diarization-cs

This repository contains a pure C# pipeline for offline speaker diarization and face–speaker alignment. It mirrors the NeMo diarization flow and the Python alignment logic used in the original project ...

IEEE

Pyannote.Audio: Neural Building Blocks for Speaker Diarization

Abstract: We introduce pyannote.audio, an open-source toolkit written in Python for speaker diarization. Based on PyTorch machine learning framework, it provides a set of trainable end-to-end neural ...

IEEE

Audio-Visual Speaker Diarization Based on Spatiotemporal Bayesian Fusion

Abstract: Speaker diarization consists of assigning speech signals to people engaged in a dialogue. An audio-visual spatiotemporal diarization model is proposed. The model is well suited for ...

Microsoft

Train Short, Infer Long: Speech-LLM Enables Zero-Shot Streamable Joint ASR and Diarization on Long Audio

Joint automatic speech recognition (ASR) and speaker diarization aim to answer the question”who spoke what”in multi-speaker scenarios. In this paper, we present an end-to-end speech large language ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results