DE eng

Search in the Catalogues and Directories

Page: 1 2 3 4 5...23
Hits 1 – 20 of 460

1
Machine learning for speaker recognition
Mak, M. W.; Chien, Jen-tzung. - Cambridge : Cambridge University Press, 2020
BLLDB
UB Frankfurt Linguistik
Show details
2
Sprachanalyse : does forensic phonetics reveal the criminal? = Voice analysis
Braun, Stefan K.. - Frankfurt am Main : neowiss - Europäischer Wissenschaftsverlag; MCDP International UG, 2020
BLLDB
Institut für Empirische Sprachwissenschaft
UB Frankfurt Linguistik
Show details
3
Towards Understanding Voice Discrimination Abilities of Humans and Machines
Park, Soo Jin. - : eScholarship, University of California, 2019
In: Park, Soo Jin. (2019). Towards Understanding Voice Discrimination Abilities of Humans and Machines. UCLA: Electrical and Computer Engineering 0333. Retrieved from: http://www.escholarship.org/uc/item/22d942x3 (2019)
BASE
Show details
4
Der VokalJäger : eine phonetisch-algorithmische Methode zur Vokaluntersuchung : exemplarisch angewendet auf historische Tondokumente der Frankfurter Stadtmundart
Keil, Carsten. - New York : Georg Olms Verlag, 2017
BLLDB
UB Frankfurt Linguistik
Show details
5
Linguistically-constrained formant-based i-vectors for automatic speaker recognition
Abstract: This is the author’s version of a work that was accepted for publication in Speech Communication. Changes resulting from the publishing process, such as peer review, editing, corrections, structural formatting, and other quality control mechanisms may not be reflected in this document. Changes may have been made to this work since it was submitted for publication. A definitive version was subsequently published in Speech Communication, VOL 76 (2016) DOI 10.1016/j.specom.2015.11.002 ; This paper presents a large-scale study of the discriminative abilities of formant frequencies for automatic speaker recognition. Exploiting both the static and dynamic information in formant frequencies, we present linguistically-constrained formant-based i-vector systems providing well calibrated likelihood ratios per comparison of the occurrences of the same isolated linguistic units in two given utterances. As a first result, the reported analysis on the discriminative and calibration properties of the different linguistic units provide useful insights, for instance, to forensic phonetic practitioners. Furthermore, it is shown that the set of units which are more discriminative for every speaker vary from speaker to speaker. Secondly, linguistically-constrained systems are combined at score-level through average and logistic regression speaker-independent fusion rules exploiting the different speaker-distinguishing information spread among the different linguistic units. Testing on the English-only trials of the core condition of the NIST 2006 SRE (24,000 voice comparisons of 5 minutes telephone conversations from 517 speakers -219 male and 298 female-), we report equal error rates of 9.57 and 12.89% for male and female speakers respectively, using only formant frequencies as speaker discriminative information. Additionally, when the formant-based system is fused with a cepstral i-vector system, we obtain relative improvements of ∼6% in EER (from 6.54 to 6.13%) and ∼15% in minDCF (from 0.0327 to 0.0279), compared to the cepstral system alone. ; This work has been supported by the Spanish Ministry of Economy and Competitiveness (project CMC-V2: Caracterizacion, Modelado y Compensacion de Variabilidad en la Señal de Voz, TEC2012-37585-C02-01). Also, the authors would like to thank SRI for providing the Decipher phonetic transcriptions of the NIST 2004, 2005 and 2006 SREs that have allowed to carry out this work.
Keyword: Automatic speaker recognition; Formant dynamics; Formant frequencies; Linguistically-constrained systems; Telecomunicaciones
URL: https://doi.org/10.1016/j.specom.2015.11.002
http://hdl.handle.net/10486/675247
BASE
Hide details
6
Elektronische Sprachsignalverarbeitung 2015 : Tagungsband der 26. Konferenz, Eichstätt, 25. - 27. März 2015
Wirsching, Günther (Hrsg.). - Dresden : TUDpress, 2015
BLLDB
UB Frankfurt Linguistik
Show details
7
Improving the self-adaptive voice activity detector for speaker verification using map adaptation and asymmetric tapers
In: International journal of speech technology. - Boston, Mass. [u.a.] : Kluwer Acad. Publ. 18 (2015) 2, 195-203
BLLDB
Show details
8
Automatic-Type Calibration of Traditionally Derived Likelihood Ratios: Forensic Analysis of Australian English/o/Formant Trajectories
In: Proceedings of Interspeech 2008 incorporating SST 2008 (2015)
BASE
Show details
9
Cepstral trajectories in linguistic units for text-independent speaker recognition
BASE
Show details
10
Severe apnoea detection using speaker recognition techniques
Fernández Pozo, Rubén; Blanco, José Luis; Hernández, Luis Alberto. - : Institute for Systems and Technologies of Information, Control and Communication, 2015
BASE
Show details
11
Implementation of forensic voice comparison within the new paradigm for the evaluation of forensic evidence
Enzinger, Ewald, Electrical Engineering & Telecommunications, Faculty of Engineering, UNSW. - : University of New South Wales. Electrical Engineering & Telecommunications, 2015
BASE
Show details
12
Statistical language and speech processing : second International Conference, SLSP 2014, Grenoble, France, October 14-16, 2014 ; proceedings
Besacier, Laurent (Hrsg.). - Cham [u.a.] : Springer, 2014
BLLDB
UB Frankfurt Linguistik
Show details
13
Text, speech, and dialogue : 17th international conference, TSD 2014, Brno, Czech Republic, September 8 - 12, 2014. ; proceedings
Sojka, Petr (Hrsg.). - Cham [u.a.] : Springer, 2014
BLLDB
UB Frankfurt Linguistik
Show details
14
Evaluating Automatic Speaker Recognition systems: An overview of the NIST Speaker Recognition Evaluations (1996-2014)
In: http://atvs.ii.uam.es/files/loquens_jgr_published.pdf (2014)
BASE
Show details
15
Evaluating automatic speaker recognition systems: an overview of the nist speaker recognition evaluations (1996-2014)
BASE
Show details
16
Advances in Nonlinear Speech Processing : 6th International Conference, NOLISP 2013, Mons, Belgium, June 19-21, 2013, Proceedings
Drugman, Thomas; Dutoit, Thierry. - Berlin, Heidelberg : Springer Berlin Heidelberg, 2013
UB Frankfurt Linguistik
Show details
17
Advances in nonlinear speech processing : 6th international conference ; proceedings
Solé-Casals, Jordi; Carson-Berndsen, Julie; Daoudi, Khalid. - Heidelberg [u.a.] : Springer, 2013
BLLDB
UB Frankfurt Linguistik
Show details
18
Elektronische Sprachsignalverarbeitung 2013 : Tagungsband der 24. Konferenz Bielefeld, 26. - 28.3.2013
Wagner, Petra (Hrsg.). - Dresden : TUDpress, 2013
BLLDB
UB Frankfurt Linguistik
Show details
19
Eigenvoice modelling for cross likelihood ratio based speaker clustering: a Bayesian approach
In: Computer speech and language. - Amsterdam [u.a.] : Elsevier 27 (2013) 4, 1011-1027
BLLDB
Show details
20
Will smart surveillance systems listen, understand and speak Slovene?
In: Slovenščina 2.0: Empirične, aplikativne in interdisciplinarne raziskave, Vol 1, Iss 2, Pp 165-180 (2013) (2013)
BASE
Show details

Page: 1 2 3 4 5...23

Catalogues
52
0
213
0
0
8
0
Bibliographies
441
0
0
0
0
0
0
0
0
Linked Open Data catalogues
0
Online resources
0
0
0
0
Open access documents
17
0
0
0
0
© 2013 - 2024 Lin|gu|is|tik | Imprint | Privacy Policy | Datenschutzeinstellungen ändern