Home Catalogue search

eng

Refine your search:
- Keyword:
- Creator / Publisher
- Year
- Medium
- Type:
- BLLDB-Access:
  - free (218)
  - subject to license (16)

Search in the Catalogues and Directories






	Sort by
Simple Search

Page: 1 2 3 4 5...11

Hits 1 – 20 of 218

1	Universal Segmentations 1.0 (UniSegments 1.0)
	Žabokrtský, Zdeněk; Bafna, Nyati; Bodnár, Jan. - : Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL), 2022
	BASE
	Show details

2	Investigating alignment interpretability for low-resource NMT
	Zanon Boito, Marcely; Villavicencio, Aline; Besacier, Laurent
	In: ISSN: 0922-6567 ; EISSN: 1573-0573 ; Machine Translation ; https://hal.archives-ouvertes.fr/hal-03139744 ; Machine Translation, Springer Verlag, 2021, ⟨10.1007/s10590-020-09254-w⟩ (2021)
	BASE
	Show details

3	Is there a bilingual disadvantage for word segmentation? A computational modeling approach
	Fibla, Laia; Sebastian-Galles, Nuria; Cristia, Alejandrina
	In: ISSN: 0305-0009 ; EISSN: 1469-7602 ; Journal of Child Language ; https://hal.archives-ouvertes.fr/hal-03498905 ; Journal of Child Language, Cambridge University Press (CUP), 2021, pp.1-28. ⟨10.1017/S0305000921000568⟩ (2021)
	BASE
	Show details

4	SM to: Is there a bilingual disadvantage for word segmentation? A computational modeling approach ...
	Fibla, Laia. - : Open Science Framework, 2021
	BASE
	Show details

5	Early Tashelhiyt Berber word segmentation: the role of the Possible Word Constraint ...
	Elouatiq, Abdellah. - : Open Science Framework, 2021
	BASE
	Show details

6	Discovering structure in speech recordings: Unsupervised learning of word and phoneme like units for automatic speech recognition
	Walter, Oliver. - 2021
	In: Fraunhofer IAIS (2021)
	BASE
	Show details

7	Handling cross and out-of-domain samples in Thai word segmentation
	Sarwar, Raheem; Phatthiyaphaibun, Wannaphong; Nutanong, Sarana...
	In: 1003 ; 1016 (2021)
	BASE
	Show details

8	Measuring (online) word segmentation in adults and children
	Iris Broedelet; Paul Boersma; Judith Rispens
	In: Dutch Journal of Applied Linguistics, Vol 10 (2021) (2021)
	BASE
	Show details

9	Investigating Language Impact in Bilingual Approaches for Computational Language Documentation
	Zanon Boito, Marcely; Villavicencio, Aline; Besacier, Laurent
	In: Proceedings of the 1st Joint SLTU and CCURL Workshop (SLTU-CCURL 2020), ; SLTU-CCURL workshop, LREC 2020 ; https://hal.archives-ouvertes.fr/hal-02895907 ; SLTU-CCURL workshop, LREC 2020, May 2020, Marseille, France (2020)
	BASE
	Show details

10	F0 Slope and Mean: Cues to Speech Segmentation in French
	Cordero Rull, Maria del Mar; Meunier, Fanny; Grimault, Nicolas...
	In: Interspeech 2020 ; https://hal.archives-ouvertes.fr/hal-03042331 ; Interspeech 2020, Oct 2020, Shanghai, China. pp.1610-1614, ⟨10.21437/Interspeech.2020-2509⟩ (2020)
	BASE
	Show details

11	The learnability consequences of Zipfian distributions: Word Segmentation is Facilitated in More Predictable Distributions ...
	Lavi-Rotbain, Ori; Arnon, Inbal. - : PsychArchives, 2020
	BASE
	Show details

12	Data for: The learnability consequences of Zipfian distributions: Word Segmentation is Facilitated in More Predictable Distributions ...
	Lavi-Rotbain, Ori; Arnon, Inbal. - : PsychArchives, 2020
	BASE
	Show details

13	The learnability consequences of Zipfian distributions: Word Segmentation is Facilitated in More Predictable Distributions ...
	Lavi-Rotbain, Ori; Arnon, Inbal. - : PsychArchives, 2020
	BASE
	Show details

14	Automatic word count estimation from daylong child-centered recordings in various language environments using language-independent syllabification of speech
	Soderstrom, M; Karadayi, J; Casillas, M; Riebling, E; Räsänen, O; Cristia, A; Metze, F; Seshadri, S; Rosemberg, C; Bunce, J; Bergelson, E. - : Elsevier BV, 2020
	Abstract: © 2019 The Authors Automatic word count estimation (WCE) from audio recordings can be used to quantify the amount of verbal communication in a recording environment. One key application of WCE is to measure language input heard by infants and toddlers in their natural environments, as captured by daylong recordings from microphones worn by the infants. Although WCE is nearly trivial for high-quality signals in high-resource languages, daylong recordings are substantially more challenging due to the unconstrained acoustic environments and the presence of near- and far-field speech. Moreover, many use cases of interest involve languages for which reliable ASR systems or even well-defined lexicons are not available. A good WCE system should also perform similarly for low- and high-resource languages in order to enable unbiased comparisons across different cultures and environments. Unfortunately, the current state-of-the-art solution, the LENA system, is based on proprietary software and has only been optimized for American English, limiting its applicability. In this paper, we build on existing work on WCE and present the steps we have taken towards a freely available system for WCE that can be adapted to different languages or dialects with a limited amount of orthographically transcribed speech data. Our system is based on language-independent syllabification of speech, followed by a language-dependent mapping from syllable counts (and a number of other acoustic features) to the corresponding word count estimates. We evaluate our system on samples from daylong infant recordings from six different corpora consisting of several languages and socioeconomic environments, all manually annotated with the same protocol to allow direct comparison. We compare a number of alternative techniques for the two key components in our system: speech activity detection and automatic syllabification of speech. As a result, we show that our system can reach relatively consistent WCE accuracy across multiple corpora and languages (with some limitations). In addition, the system outperforms LENA on three of the four corpora consisting of different varieties of English. We also demonstrate how an automatic neural network-based syllabifier, when trained on multiple languages, generalizes well to novel languages beyond the training data, outperforming two previously proposed unsupervised syllabifiers as a feature extractor for WCE.
	Keyword: Acoustics; Automatic syllabification; Computer Science; Daylong recordings; Interdisciplinary Applications; Language acquisition; LENA(TM); Noise robustness; RELIABILITY; Science & Technology; SEGMENTATION; SYSTEM; Technology; Word count estimation
	URL: https://hdl.handle.net/10161/19710
	BASE
	Hide details

15	Infants Segment Words from Songs—An EEG Study
	Snijders; Benders; Fikkert
	In: Brain Sciences ; Volume 10 ; Issue 1 (2020)
	BASE
	Show details

16	Not all words are equally acquired: transitional probabilities and instructions affect the electrophysiological correlates of statistical learning
	Soares, Ana Paula; Gutiérrez-Domínguez, Francisco-Javier; Vasconcelos, Margarida Fátima Gomes. - : Frontiers Media, 2020
	BASE
	Show details

17	Controlling Utterance Length in NMT-based Word Segmentation with Attention
	Godard, Pierre; Besacier, Laurent; Yvon, François
	In: International Workshop on Spoken Language Translation ; https://hal.archives-ouvertes.fr/hal-02343206 ; International Workshop on Spoken Language Translation, Nov 2019, Hong-Kong, China (2019)
	BASE
	Show details

18	Segmentability Differences Between Child-Directed and Adult-Directed Speech: A Systematic Test With an Ecologically Valid Corpus
	Cristia, Alejandrina; Dupoux, Emmanuel; Bernstein Ratner, Nan...
	In: EISSN: 2470-2986 ; Open Mind ; https://hal.archives-ouvertes.fr/hal-02274050 ; Open Mind, MIT Press, 2019, 3, pp.13-22. ⟨10.1162/opmi_a_00022⟩ (2019)
	BASE
	Show details

19	Unsupervised word discovery for computational language documentation ; Découverte non-supervisée de mots pour outiller la linguistique de terrain
	Godard, Pierre. - : HAL CCSD, 2019
	In: https://tel.archives-ouvertes.fr/tel-02286425 ; Artificial Intelligence [cs.AI]. Université Paris Saclay (COmUE), 2019. English. ⟨NNT : 2019SACLS062⟩ (2019)
	BASE
	Show details

20	MiNgMatch—A Fast N-gram Model for Word Segmentation of the Ainu Language
	Karol Nowakowski; Michal Ptaszynski; and Fumito Masui
	In: Information ; Volume 10 ; Issue 10 (2019)
	BASE
	Show details

Page: 1 2 3 4 5...11

© 2013 - 2024 Lin|gu|is|tik | Imprint | Privacy Policy | Datenschutzeinstellungen ändern