DE eng

Search in the Catalogues and Directories

Hits 1 – 3 of 3

1
Utilising visual attention cues for vehicle detection and tracking
In: Hu, Feiyan orcid:0000-0001-7451-6438 , Gurram Munirathnam, Venkatesh orcid:0000-0002-4393-9267 , O'Connor, Noel E. orcid:0000-0002-4033-9135 , Smeaton, Alan F. orcid:0000-0003-1028-8389 and Little, Suzanne orcid:0000-0003-3281-3471 (2021) Utilising visual attention cues for vehicle detection and tracking. In: 25th International Conference on Pattern Recognition (ICPR2020), 10-15 Jan 2021, Milan, Italy (Online). (2021)
BASE
Show details
2
Dublin City University participation in the VTT track at TRECVid 2017
In: Afli, Haithem orcid:0000-0002-7449-4707 , Hu, Feiyan orcid:0000-0001-7451-6438 , Du, Jinhua orcid:0000-0002-3267-4881 , Cosgrove, Daniel, McGuinness, Kevin orcid:0000-0003-1336-6477 , O'Connor, Noel E. orcid:0000-0002-4033-9135 , Arazo Sánchez, Eric, Zhou, Jiang orcid:0000-0002-3067-8512 and Smeaton, Alan F. orcid:0000-0003-1028-8389 (2017) Dublin City University participation in the VTT track at TRECVid 2017. In: TRECVid workshop, 13-15 Nov 2017, Gaithersburg, Md., USA. (2017)
Abstract: Dublin City University participated in the video-to-text caption generation task in TRECVid and this paper describes the three approaches we took for our 4 submitted runs. The first approach is based on extracting regularly-spaced keyframes from a video, generating a text caption for each keyframe and then combining the keyframe captions into a single caption. The second approach is based on detecting image crops from those keyframes using saliency map to include as much of the attractive part of the image as possible, generating a caption for each crop in each keyframe, and combining the captions into one. The third approach is an end-to-end system, a true deep learning submission based on MS-COCO, an externally available set of training captions. The paper presents a description and the official results of each of the approaches.
Keyword: Artificial intelligence; Computational linguistics; Digital video; Image processing; Video captions
URL: http://doras.dcu.ie/22155/
BASE
Hide details
3
Formulating queries for collecting training examples in visual concept classification
In: Albatal, Rami orcid:0000-0002-9269-8578 , McGuinness, Kevin orcid:0000-0003-1336-6477 , Hu, Feiyan orcid:0000-0001-7451-6438 and Smeaton, Alan F. orcid:0000-0003-1028-8389 (2014) Formulating queries for collecting training examples in visual concept classification. In: Proceedings of the Third Workshop on Vision and Language, 23 Aug 2014, Dublin, Ireland. ISBN 978-1-873769-28-1 (2014)
BASE
Show details

Catalogues
0
0
0
0
0
0
0
Bibliographies
0
0
0
0
0
0
0
0
0
Linked Open Data catalogues
0
Online resources
0
0
0
0
Open access documents
3
0
0
0
0
© 2013 - 2024 Lin|gu|is|tik | Imprint | Privacy Policy | Datenschutzeinstellungen ändern