1 |
ViQuAE, a Dataset for Knowledge-based Visual Question Answering about Named Entities
|
|
|
|
In: ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR ’22) ; https://hal-universite-paris-saclay.archives-ouvertes.fr/hal-03650618 ; 2022 (2022)
|
|
BASE
|
|
Show details
|
|
2 |
Unsupervised quantification of entity consistency between photos and text in real-world news ...
|
|
Müller-Budack, Eric. - : Hannover : Institutionelles Repositorium der Leibniz Universität Hannover, 2022
|
|
BASE
|
|
Show details
|
|
3 |
Supporting an effective review of telecollaboration for second language learning by visualising the participation and engagement at Dublin City University
|
|
|
|
In: Lee, Hyowon orcid:0000-0003-4395-7702 , Scriney, Michael orcid:0000-0001-6813-2630 , Dey-Plissonneau, Aparajita and Smeaton, Alan orcid:0000-0003-1028-8389 (2021) Supporting an effective review of telecollaboration for second language learning by visualising the participation and engagement at Dublin City University. In: Virtual Exchange in Higher Education: Charting the Irish Experience, 17 Sept 2021, Online vs MS Teams. (2021)
|
|
BASE
|
|
Show details
|
|
4 |
Sign and Search: Sign Search Functionality for Sign Language Lexica ...
|
|
|
|
BASE
|
|
Show details
|
|
5 |
Unsupervised Cross-Modal Audio Representation Learning from Unstructured Multilingual Text ...
|
|
|
|
BASE
|
|
Show details
|
|
6 |
Recommending Themes for Ad Creative Design via Visual-Linguistic Representations ...
|
|
|
|
BASE
|
|
Show details
|
|
7 |
Fuzzy Logic Based Integration of Web Contextual Linguistic Structures for Enriching Conceptual Visual Representations ...
|
|
|
|
BASE
|
|
Show details
|
|
8 |
MusicTM-Dataset for Joint Representation Learning among Sheet Music, Lyrics, and Musical Audio ...
|
|
|
|
BASE
|
|
Show details
|
|
9 |
Utilization of multimodal interaction signals for automatic summarisation of academic presentations
|
|
Curtis, Keith. - : Dublin City University. School of Computing, 2018
|
|
In: Curtis, Keith (2018) Utilization of multimodal interaction signals for automatic summarisation of academic presentations. PhD thesis, Dublin City University. (2018)
|
|
BASE
|
|
Show details
|
|
10 |
Multimodal Machine Translation with Reinforcement Learning ...
|
|
|
|
BASE
|
|
Show details
|
|
11 |
ImproteK: introducing scenarios into human-computer music improvisation
|
|
|
|
In: ACM Computers in Entertainment ; https://hal.archives-ouvertes.fr/hal-01380163 ; ACM Computers in Entertainment, 2017, ⟨10.1145/3022635⟩ (2017)
|
|
BASE
|
|
Show details
|
|
12 |
Multimodal Person Discovery in Broadcast TV: lessons learned from MediaEval 2015
|
|
|
|
In: ISSN: 1380-7501 ; EISSN: 1573-7721 ; Multimedia Tools and Applications ; https://hal.archives-ouvertes.fr/hal-01690581 ; Multimedia Tools and Applications, Springer Verlag, 2017, 76 (21), pp.22547 - 22567. ⟨10.1007/s11042-017-4730-x⟩ (2017)
|
|
BASE
|
|
Show details
|
|
13 |
Enabling Embodied Analogies in Intelligent Music Systems ...
|
|
|
|
BASE
|
|
Show details
|
|
14 |
Narrative Smoothing: Dynamic Conversational Network for the Analysis of TV Series Plots
|
|
|
|
In: DyNo: 2nd International Workshop on Dynamics in Networks, in conjunction with the 2016 IEEE/ACM International Conference ASONAM ; https://hal.archives-ouvertes.fr/hal-01276708 ; DyNo: 2nd International Workshop on Dynamics in Networks, in conjunction with the 2016 IEEE/ACM International Conference ASONAM, Aug 2016, San Francisco, United States. pp.1111-1118, ⟨10.1109/ASONAM.2016.7752379⟩ (2016)
|
|
BASE
|
|
Show details
|
|
16 |
Hierarchical topic structuring: from dense segmentation to topically focused fragments via burst analysis
|
|
|
|
In: Recent Advances on Natural Language Processing ; https://hal.archives-ouvertes.fr/hal-01186443 ; Recent Advances on Natural Language Processing, 2015, Hissar, Bulgaria (2015)
|
|
BASE
|
|
Show details
|
|
17 |
Temporal re-scoring vs. temporal descriptors for semantic indexing of videos
|
|
|
|
In: 13th International Workshop on Content-Based Multimedia Indexing (CBMI) ; https://hal.archives-ouvertes.fr/hal-01230719 ; 13th International Workshop on Content-Based Multimedia Indexing (CBMI), Jun 2015, Prague, Czech Republic. pp.1-4, ⟨10.1109/CBMI.2015.7153626⟩ (2015)
|
|
BASE
|
|
Show details
|
|
18 |
Visual Affect Around the World: A Large-scale Multilingual Visual Sentiment Ontology ...
|
|
|
|
BASE
|
|
Show details
|
|
19 |
Novel perspectives and approaches to video summarization
|
|
Guan, Genliang. - : The University of Sydney, 2015. : Faculty of Engineering and Information Technologies, School of Information Technologies, 2015
|
|
Abstract:
The increasing volume of videos requires efficient and effective techniques to index and structure videos. Video summarization is such a technique that extracts the essential information from a video, so that tasks such as comprehension by users and video content analysis can be conducted more effectively and efficiently. The research presented in this thesis investigates three novel perspectives of the video summarization problem and provides approaches to such perspectives. Our first perspective is to employ local keypoint to perform keyframe selection. Two criteria, namely Coverage and Redundancy, are introduced to guide the keyframe selection process in order to identify those representing maximum video content and sharing minimum redundancy. To efficiently deal with long videos, a top-down strategy is proposed, which splits the summarization problem to two sub-problems: scene identification and scene summarization. Our second perspective is to formulate the task of video summarization to the problem of sparse dictionary reconstruction. Our method utilizes the true sparse constraint L0 norm, instead of the relaxed constraint L2,1 norm, such that keyframes are directly selected as a sparse dictionary that can reconstruct the video frames. In addition, a Percentage Of Reconstruction (POR) criterion is proposed to intuitively guide users in selecting an appropriate length of the summary. In addition, an L2,0 constrained sparse dictionary selection model is also proposed to further verify the effectiveness of sparse dictionary reconstruction for video summarization. Lastly, we further investigate the multi-modal perspective of multimedia content summarization and enrichment. There are abundant images and videos on the Web, so it is highly desirable to effectively organize such resources for textual content enrichment. With the support of web scale images, our proposed system, namely StoryImaging, is capable of enriching arbitrary textual stories with visual content.
|
|
Keyword:
image processing; multimedia; video retrieval; video summarization
|
|
URL: http://hdl.handle.net/2123/13550
|
|
BASE
|
|
Hide details
|
|
20 |
Planning Human-Computer Improvisation
|
|
|
|
In: International Computer Music Conference ; https://hal.archives-ouvertes.fr/hal-01053834 ; International Computer Music Conference, Sep 2014, Athens, Greece ; http://icmc14-smc14.net (2014)
|
|
BASE
|
|
Show details
|
|
|
|