1 |
The "Fat Face" illusion: A robust adaptation for processing pairs of faces
|
|
|
|
In: ISSN: 0042-6989 ; EISSN: 0042-6989 ; Vision Research ; https://hal.archives-ouvertes.fr/hal-03579276 ; Vision Research, Elsevier, 2022, 195, pp.108015. ⟨10.1016/j.visres.2022.108015⟩ (2022)
|
|
BASE
|
|
Show details
|
|
2 |
Biological constraints on configural odour mixture perception
|
|
|
|
In: ISSN: 0022-0949 ; EISSN: 1477-9145 ; Journal of Experimental Biology ; https://hal-cnrs.archives-ouvertes.fr/hal-03610253 ; Journal of Experimental Biology, The Company of Biologists, 2022, 225 (6), pp.jeb242274. ⟨10.1242/jeb.242274⟩ ; https://journals.biologists.com/jeb/article-abstract/225/6/jeb242274/274695/Biological-constraints-on-configural-odour-mixture (2022)
|
|
BASE
|
|
Show details
|
|
3 |
When the Easy Becomes Difficult: Factors Affecting the Acquisition of the English /iː/-/ɪ/ Contrast
|
|
|
|
In: Frontiers in Communication ; 6 (2022)
|
|
BASE
|
|
Show details
|
|
4 |
Face recognition improvements in adults and children with face recognition difficulties
|
|
|
|
BASE
|
|
Show details
|
|
5 |
Phonetics and phonology of Tashlhiyt geminates: An overview
|
|
|
|
In: https://halshs.archives-ouvertes.fr/halshs-03511107 ; 2022 (2022)
|
|
BASE
|
|
Show details
|
|
6 |
Évaluation de la perception des sons de parole chez les populations pédiatriques : réflexion sur les épreuves existantes
|
|
|
|
In: ISSN: 0298-6477 ; EISSN: 2117-7155 ; Glossa ; https://hal.archives-ouvertes.fr/hal-03646757 ; Glossa, UNADREO - Union NAtionale pour le Développement de la Recherche en Orthophonie, 2022, 132, pp.1-27 ; https://www.glossa.fr/index.php/glossa/article/view/1043 (2022)
|
|
BASE
|
|
Show details
|
|
7 |
Speaking Style Variability in Speaker Discrimination by Humans and Machines
|
|
|
|
Abstract:
A speaker's voice constantly varies in everyday situations, such as when talking to a friend, reading aloud, talking to pets, or narrating a happy incident. These changes in speaking style affect human and machine abilities to distinguish speakers based on their voice. This dissertation studies the effects of speaking style variability on speaker discrimination performance by humans and machines.We compare human speaker discrimination performance for read speech versus casual conversations. Listeners perform better when stimuli are style-matched, particularly in read speech -- read speech trials. They perform the worst in style-mismatched conditions. Moderate style variability affects the "same speaker" task more than the "different speaker" task. The speakers who are "easy" or "hard" to "tell together" are not the same as those who are "easy" or "hard" to "tell apart." Analysis of acoustic variability suggests that listeners find it easier to "tell speakers together" when they rely on speaker-specific idiosyncrasies and that they "tell speakers apart" based on their relative positions within a shared acoustic space.The effects of style variability on automatic speaker verification (ASV) systems are systematically analyzed using the UCLA Speaker Variability database, which comprises multiple speaking styles per speaker. The performance is better when enrollment and test utterances are of the same style, but it substantially degrades when styles are mismatched. We hypothesize that between-frame entropy can capture style-related spectral and temporal variations. We propose an entropy-based variable frame rate (VFR) technique to address style variability in two different approaches: data augmentation and self-attentive conditioning. Both approaches improve performance in style-mismatch scenarios and are comparable in performance.Furthermore, humans and machines seem to employ different approaches to speaker discrimination. In an attempt to improve ASV performance in the presence of style variability, insights learnt from the human speaker perception experiments are used to design a training loss function, referred to as "CllrCE loss". CllrCE loss focuses on both speaker-specific idiosyncrasies and relative acoustic distances between the speakers to train the ASV system. This loss function improves ASV performance in case of style variability, especially in the case of moderate style variations from conversational speech.
|
|
Keyword:
Acoustic space analysis; Computer engineering; Electrical engineering; Human speaker perception; Self-attention conditioning; Speaker verification; Speaking style; Variable frame rate
|
|
URL: https://escholarship.org/uc/item/3zh346jm
|
|
BASE
|
|
Hide details
|
|
8 |
Hippocampal ensembles represent sequential relationships among an extended sequence of nonspatial events.
|
|
|
|
In: Nature communications, vol 13, iss 1 (2022)
|
|
BASE
|
|
Show details
|
|
9 |
Variation interculturelle de la perception du spectre masculin-féminin : indexation de la voix genrée en France et aux Etats-Unis
|
|
|
|
In: Devenir non-binaire en français contemporain ; https://hal.archives-ouvertes.fr/hal-03573714 ; Vinay Swamy; Louisa Mackenzie. Devenir non-binaire en français contemporain, Le Manuscrit, 2022, 9782304052428 (2022)
|
|
BASE
|
|
Show details
|
|
10 |
Finding the best way to put media bias research into practice via an annotation app ...
|
|
|
|
BASE
|
|
Show details
|
|
11 |
Isolating the locus of informational interference during speech-in-noise perception: the role of temporal predictability ...
|
|
|
|
BASE
|
|
Show details
|
|
12 |
Leibniz Dream: Children's comprehension of conjunctive expressions in Hungarian ...
|
|
|
|
BASE
|
|
Show details
|
|
13 |
Linguistic intergroup bias and persistence of stereotype-affirming memory ...
|
|
|
|
BASE
|
|
Show details
|
|
14 |
Linguistic intergroup bias and persistence of stereotype-affirming memory - Addendum 04.26.2022 ...
|
|
|
|
BASE
|
|
Show details
|
|
15 |
Testing the pitch-luminance mapping in humans and in a group of Guinea baboons. A replication of Ludwig et al. (2011) study. ...
|
|
|
|
BASE
|
|
Show details
|
|
16 |
Isolating the locus of informational interference during speech-in-noise perception: energetic masking vs. intelligibility ...
|
|
|
|
BASE
|
|
Show details
|
|
18 |
Perception of Vowels Following Obstruents by Native English Speakers and Native Japanese Speakers
|
|
片山 圭巳. - : 熊本大学大学院人文社会科学研究部(文学系), 2022
|
|
BASE
|
|
Show details
|
|
19 |
Psycholinguistic factors of formation associative perception of the brand ...
|
|
|
|
BASE
|
|
Show details
|
|
20 |
Psycholinguistic factors of formation associative perception of the brand ...
|
|
|
|
BASE
|
|
Show details
|
|
|
|