DE eng

Search in the Catalogues and Directories

Hits 1 – 4 of 4

1
Speaker Diarization: Current Limitations and New Directions
Knox, Mary Tai. - : eScholarship, University of California, 2013
BASE
Show details
2
Speaker Diarization: Current Limitations and New Directions
Knox, Mary Tai. - : eScholarship, University of California, 2013
In: Knox, Mary Tai. (2013). Speaker Diarization: Current Limitations and New Directions. UC Berkeley: Electrical Engineering & Computer Sciences. Retrieved from: http://www.escholarship.org/uc/item/03v5b9wd (2013)
BASE
Show details
3
The ICSI RT-09 speaker diarization system
In: Institute of Electrical and Electronics Engineers. IEEE transactions on audio, speech and language processing. - New York, NY : Inst. 20 (2012) 2, 371-381
BLLDB
OLC Linguistik
Show details
4
The ICSI RT-09 Speaker Diarization System
In: http://infoscience.epfl.ch/record/175320 (2012)
Abstract: The speaker diarization system developed at the International Computer Science Institute (ICSI) has played a prominent role in the speaker diarization community, and many researchers in the rich transcription community have adopted methods and techniques developed for the ICSI speaker diarization engine. Although there have been many related publications over the years, previous articles only presented changes and improvements rather than a description of the full system. Attempting to replicate the ICSI speaker diarization system as a complete entity would require an extensive literature review, and might ultimately fail due to component description version mismatches. This paper therefore presents the first full conceptual description of the ICSI speaker diarization system as presented to the National Institute of Standards Technology Rich Transcription 2009 (NIST RT-09) evaluation, which consists of online and offline subsystems, multi-stream and single-stream implementations, and audio and audio-visual approaches. Some of the components, such as the online system, have not been previously described. The paper also includes all necessary preprocessing steps, such as Wiener filtering, speech activity detection and beamforming.
URL: http://infoscience.epfl.ch/record/175320
https://doi.org/10.1109/TASL.2011.2158419
BASE
Hide details

Catalogues
0
0
1
0
0
0
0
Bibliographies
1
0
0
0
0
0
0
0
0
Linked Open Data catalogues
0
Online resources
0
0
0
0
Open access documents
3
0
0
0
0
© 2013 - 2024 Lin|gu|is|tik | Imprint | Privacy Policy | Datenschutzeinstellungen ändern