Improving Diarisation in Speech-to-Text Applications

In police or legal conversation transcriptions, recognising the people speaking is important and an algorithmically difficult problem. An existing solution is to be extended and improved by additional approaches such as encoder-decoder-attractors or multimodal transcription. Students can actively participate in the definition of the work.

  • Information

    • Semester or Master’s thesis for 1-2 people
    • 40% theory, 60% realisation
    • Prerequisites: Signal processing, Python

    German is commonly spoken within the company. Basic proficiency is helpful and appreciated.

Have we sparked your interest?

I am interested in the study Improving Diarisation in Speech-to-Text Applications and would like to find out more.