1 |
Using Automatic Speech Recognition to Optimize Hearing-Aid Time Constants
|
|
|
|
In: ISSN: 1662-4548 ; EISSN: 1662-453X ; Frontiers in Neuroscience ; https://hal.archives-ouvertes.fr/hal-03627441 ; Frontiers in Neuroscience, Frontiers, 2022, 16 (779062), ⟨10.3389/fnins.2022.779062⟩ ; https://www.frontiersin.org/articles/10.3389/fnins.2022.779062/full (2022)
|
|
BASE
|
|
Show details
|
|
2 |
A fine-grained recognition of Named Entities in ELTeC collection using cascades
|
|
|
|
In: Final Action Event of COST Action Distant Reading for European Literary History ; https://hal.archives-ouvertes.fr/hal-03615219 ; Final Action Event of COST Action Distant Reading for European Literary History, Christof Schöch, Apr 2022, Krakow, Poland ; https://www.distant-reading.net/events/conference-programme/ (2022)
|
|
BASE
|
|
Show details
|
|
3 |
RETRIEVING SPEAKER INFORMATION FROM PERSONALIZED ACOUSTIC MODELS FOR SPEECH RECOGNITION
|
|
|
|
In: IEEE ICASSP 2022 ; https://hal.archives-ouvertes.fr/hal-03539741 ; IEEE ICASSP 2022, 2022, Singapour, Singapore (2022)
|
|
BASE
|
|
Show details
|
|
4 |
Emotional Speech Recognition Using Deep Neural Networks
|
|
|
|
In: ISSN: 1424-8220 ; Sensors ; https://hal.archives-ouvertes.fr/hal-03632853 ; Sensors, MDPI, 2022, 22 (4), pp.1414. ⟨10.3390/s22041414⟩ (2022)
|
|
BASE
|
|
Show details
|
|
5 |
The Impact of Removing Head Movements on Audio-visual Speech Enhancement
|
|
|
|
In: ICASSP 2022 - IEEE International Conference on Acoustics, Speech and Signal Processing ; https://hal.inria.fr/hal-03551610 ; ICASSP 2022 - IEEE International Conference on Acoustics, Speech and Signal Processing, IEEE Signal Processing Society, May 2022, Singapore, Singapore. pp.1-5 (2022)
|
|
BASE
|
|
Show details
|
|
6 |
Face recognition improvements in adults and children with face recognition difficulties
|
|
|
|
BASE
|
|
Show details
|
|
7 |
Face masks versus sunglasses: Limited effects of time and individual differences in the ability to judge facial identity and social traits
|
|
|
|
BASE
|
|
Show details
|
|
8 |
An Overview of Indian Spoken Language Recognition from Machine Learning Perspective
|
|
|
|
In: ISSN: 2375-4699 ; EISSN: 2375-4702 ; ACM Transactions on Asian and Low-Resource Language Information Processing ; https://hal.inria.fr/hal-03616853 ; ACM Transactions on Asian and Low-Resource Language Information Processing, ACM, In press, ⟨10.1145/3523179⟩ (2022)
|
|
BASE
|
|
Show details
|
|
9 |
BBC-Oxford British Sign Language Dataset
|
|
|
|
In: https://hal.archives-ouvertes.fr/hal-03516444 ; 2022 (2022)
|
|
BASE
|
|
Show details
|
|
10 |
Fine-tuning pre-trained models for Automatic Speech Recognition: experiments on a fieldwork corpus of Japhug (Trans-Himalayan family)
|
|
|
|
In: https://halshs.archives-ouvertes.fr/halshs-03647315 ; 2022 (2022)
|
|
BASE
|
|
Show details
|
|
11 |
Evaluation of Speaker Anonymization on Emotional Speech ; Analyse de l'anonymisation du locuteur sur de la parole émotionnelle
|
|
|
|
In: JEP2022 - Journées d'Études sur la Parole ; https://hal.archives-ouvertes.fr/hal-03636737 ; JEP2022 - Journées d'Études sur la Parole, Jun 2022, Île de Noirmoutier, France (2022)
|
|
BASE
|
|
Show details
|
|
12 |
Can machines learn to see without visual databases?
|
|
|
|
In: https://hal.archives-ouvertes.fr/hal-03526569 ; 2022 (2022)
|
|
BASE
|
|
Show details
|
|
13 |
К вопросу о сущности основных конституционных обязанностей человека и гражданина ... : On the question of the essence of the basic constitutional duties of a person and a citizen ...
|
|
|
|
BASE
|
|
Show details
|
|
14 |
Contextual time-continuous emotion recognition based on multimodal data ...
|
|
|
|
BASE
|
|
Show details
|
|
15 |
Unsupervised quantification of entity consistency between photos and text in real-world news ...
|
|
Müller-Budack, Eric. - : Hannover : Institutionelles Repositorium der Leibniz Universität Hannover, 2022
|
|
Abstract:
Das World Wide Web und die sozialen Medien übernehmen im heutigen Informationszeitalter eine wichtige Rolle für die Vermittlung von Nachrichten und Informationen. In der Regel werden verschiedene Modalitäten im Sinne der Informationskodierung wie beispielsweise Fotos und Text verwendet, um Nachrichten effektiver zu vermitteln oder Aufmerksamkeit zu erregen. Kommunikations- und Sprachwissenschaftler erforschen das komplexe Zusammenspiel zwischen Modalitäten seit Jahrzehnten und haben unter Anderem untersucht, wie durch die Kombination der Modalitäten zusätzliche Informationen oder eine neue Bedeutungsebene entstehen können. Die Anzahl gemeinsamer Konzepte oder Entitäten (beispielsweise Personen, Orte und Ereignisse) zwischen Fotos und Text stellen einen wichtigen Aspekt für die Bewertung der Gesamtaussage und Bedeutung eines multimodalen Artikels dar. Automatisierte Ansätze zur Quantifizierung von Bild-Text-Beziehungen können für zahlreiche Anwendungen eingesetzt werden. Sie ermöglichen beispielsweise eine ... : In today’s information age, the World Wide Web and social media are important sources for news and information. Different modalities (in the sense of information encoding) such as photos and text are typically used to communicate news more effectively or to attract attention. Communication scientists, linguists, and semioticians have studied the complex interplay between modalities for decades and investigated, e.g., how their combination can carry additional information or add a new level of meaning. The number of shared concepts or entities (e.g., persons, locations, and events) between photos and text is an important aspect to evaluate the overall message and meaning of an article. Computational models for the quantification of image-text relations can enable many applications. For example, they allow for more efficient exploration of news, facilitate semantic search and multimedia retrieval in large (web) archives, or assist human assessors in evaluating news for credibility. To date, only a few ...
|
|
Keyword:
Bild-Text-Beziehungen; Bildindexierung; Computer vision; Date estimation; Deep Learning; Deep learning; Dewey Decimal Classification000 | Allgemeines, Wissenschaft000 | Informatik, Wissen, Systeme004 | Informatik; Event classification; Eventklassifikation; Face recognition; Geolocation estimation; Image indexing; Image-text relations; Maschinelles Sehen; Multimedia retrieval; Multimedia Retrieval; Nachrichtenanalyse; Natürliche Sprachverarbeitung; Natural language processing; News analytics; Personenerkennung; Schätzung des Aufnahmejahres; Schätzung des Aufnahmeortes
|
|
URL: https://dx.doi.org/10.15488/11719 https://www.repo.uni-hannover.de/handle/123456789/11812
|
|
BASE
|
|
Hide details
|
|
16 |
Currencies of recognition: What rewards and recognition do Canadian distributed medical education preceptors value? ...
|
|
|
|
BASE
|
|
Show details
|
|
17 |
Monolinguals and Bilinguals’ Visual Recognition Memory of Socially Relevant Stimuli at 8-10 Months. ...
|
|
|
|
BASE
|
|
Show details
|
|
18 |
Dvoice : An open source dataset for Automatic Speech Recognition on African Languages and Dialects ...
|
|
|
|
BASE
|
|
Show details
|
|
19 |
Dvoice : An open source dataset for Automatic Speech Recognition on African Languages and Dialects ...
|
|
|
|
BASE
|
|
Show details
|
|
20 |
Linked Open Tafsir - Rekonstruktion der Entstehungsdynamik(en) des Korans mithilfe der Netzwerkmodellierung früher islamischer Überlieferungen ...
|
|
|
|
BASE
|
|
Show details
|
|
|
|