1 |
BBC-Oxford British Sign Language Dataset
|
|
|
|
In: https://hal.archives-ouvertes.fr/hal-03516444 ; 2022 (2022)
|
|
BASE
|
|
Show details
|
|
2 |
Sign Language Video Retrieval with Free-Form Textual Queries ...
|
|
|
|
BASE
|
|
Show details
|
|
4 |
Aligning Subtitles in Sign Language Videos
|
|
|
|
In: International Conference on Computer Vision (ICCV) ; https://hal.archives-ouvertes.fr/hal-03515983 ; International Conference on Computer Vision (ICCV), Oct 2021, Montreal, Canada (2021)
|
|
BASE
|
|
Show details
|
|
5 |
Sign Segmentation with Changepoint-Modulated Pseudo-Labelling
|
|
|
|
In: 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW) ; https://hal.archives-ouvertes.fr/hal-03513415 ; 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW), Jun 2021, Nashville, TN, United States. ⟨10.1109/CVPRW53098.2021.00379⟩ (2021)
|
|
BASE
|
|
Show details
|
|
6 |
Read and Attend: Temporal Localisation in Sign Language Videos
|
|
|
|
In: 2021 IEEE Conference on Computer Vision and Pattern Recognition (CVPR 2021) ; https://hal.archives-ouvertes.fr/hal-03513396 ; 2021 IEEE Conference on Computer Vision and Pattern Recognition (CVPR 2021), Jun 2021, Nashville, TN, United States. ⟨10.1109/CVPR46437.2021.01658⟩ (2021)
|
|
BASE
|
|
Show details
|
|
7 |
Sign language segmentation with temporal convolutional networks
|
|
|
|
In: 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) ; https://hal.archives-ouvertes.fr/hal-03513405 ; 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Jun 2021, Toronto, ON, Canada. ⟨10.1109/ICASSP39728.2021.9413817⟩ (2021)
|
|
BASE
|
|
Show details
|
|
8 |
Read and Attend: Temporal Localisation in Sign Language Videos ...
|
|
|
|
Abstract:
The objective of this work is to annotate sign instances across a broad vocabulary in continuous sign language. We train a Transformer model to ingest a continuous signing stream and output a sequence of written tokens on a large-scale collection of signing footage with weakly-aligned subtitles. We show that through this training it acquires the ability to attend to a large vocabulary of sign instances in the input sequence, enabling their localisation. Our contributions are as follows: (1) we demonstrate the ability to leverage large quantities of continuous signing videos with weakly-aligned subtitles to localise signs in continuous sign language; (2) we employ the learned attention to automatically generate hundreds of thousands of annotations for a large sign vocabulary; (3) we collect a set of 37K manually verified sign instances across a vocabulary of 950 sign classes to support our study of sign language recognition; (4) by training on the newly annotated data from our method, we outperform the prior ... : Appears in: 2021 IEEE Conference on Computer Vision and Pattern Recognition (CVPR 2021). 14 pages ...
|
|
Keyword:
Computer Vision and Pattern Recognition cs.CV; FOS Computer and information sciences
|
|
URL: https://dx.doi.org/10.48550/arxiv.2103.16481 https://arxiv.org/abs/2103.16481
|
|
BASE
|
|
Hide details
|
|
9 |
Sign Segmentation with Changepoint-Modulated Pseudo-Labelling ...
|
|
|
|
BASE
|
|
Show details
|
|
12 |
BSL-1K: Scaling up co-articulated sign language recognition using mouthing cues
|
|
|
|
In: European Conference on Computer Vision (ECCV) 2020 ; https://hal.archives-ouvertes.fr/hal-03516489 ; European Conference on Computer Vision (ECCV) 2020, Aug 2020, Glasgow, United Kingdom. ⟨10.1007/978-3-030-58621-8_3⟩ (2020)
|
|
BASE
|
|
Show details
|
|
13 |
Watch, read and lookup: learning to spot signs from multiple supervisors
|
|
|
|
In: Asian Conference on Computer Vision (ACCV) 2020 ; https://hal.archives-ouvertes.fr/hal-03516457 ; Asian Conference on Computer Vision (ACCV) 2020, Nov 2020, Kyoto, Japan. ⟨10.1007/978-3-030-69544-6_18⟩ (2020)
|
|
BASE
|
|
Show details
|
|
14 |
Watch, read and lookup: learning to spot signs from multiple supervisors ...
|
|
|
|
BASE
|
|
Show details
|
|
15 |
Sign language segmentation with temporal convolutional networks ...
|
|
|
|
BASE
|
|
Show details
|
|
16 |
BSL-1K: Scaling up co-articulated sign language recognition using mouthing cues ...
|
|
|
|
BASE
|
|
Show details
|
|
17 |
Disentangled Speech Embeddings using Cross-modal Self-supervision ...
|
|
|
|
BASE
|
|
Show details
|
|
|
|