Speech to text
This is the process of converting spoken words into written texts. This process is also often called speech recognition.
This is the process of listening to and analyzing audio recordings. this process is at the heart of a variety of modern AI technology including virtual assistants, automatic speech recognition, and text to speech applications
Speaker Diarization is the task of identifying the start and end time of a speaker in an audio file, together with the identity of the speaker.
Transcription per region (Speaker Classification)
This is the process of automatically recognizing who is speaking by using the speaker-specific information included in speech waves to verify identities being claimed by people accessing systems; that is, it enables access control of various services by voice.
Annotations of timestamps in audio.