Home Catalogue search

eng

Refine your search:

Search in the Catalogues and Directories






	Sort by
Simple Search

Hits 1 – 5 of 5

1	Hierarchical Object-oriented Spatio-Temporal Reasoning for Video Question Answering ...
	Dang, Long Hoang; Le, Thao Minh; Le, Vuong. - : arXiv, 2021
	BASE
	Show details

2	Object-Centric Representation Learning for Video Question Answering ...
	Dang, Long Hoang; Le, Thao Minh; Le, Vuong; Tran, Truyen. - : arXiv, 2021
	Abstract: Video question answering (Video QA) presents a powerful testbed for human-like intelligent behaviors. The task demands new capabilities to integrate video processing, language understanding, binding abstract linguistic concepts to concrete visual artifacts, and deliberative reasoning over spacetime. Neural networks offer a promising approach to reach this potential through learning from examples rather than handcrafting features and rules. However, neural networks are predominantly feature-based - they map data to unstructured vectorial representation and thus can fall into the trap of exploiting shortcuts through surface statistics instead of true systematic reasoning seen in symbolic systems. To tackle this issue, we advocate for object-centric representation as a basis for constructing spatio-temporal structures from videos, essentially bridging the semantic gap between low-level pattern recognition and high-level symbolic algebra. To this end, we propose a new query-guided representation framework to ... : Accepted by IJCNN 2021 ...
	Keyword: Computer Vision and Pattern Recognition cs.CV; FOS Computer and information sciences
	URL: https://arxiv.org/abs/2104.05166 https://dx.doi.org/10.48550/arxiv.2104.05166
	BASE
	Hide details

3	Hierarchical Conditional Relation Networks for Multimodal Video Question Answering ...
	Le, Thao Minh; Le, Vuong; Venkatesh, Svetha. - : arXiv, 2020
	BASE
	Show details

4	Dynamic Language Binding in Relational Visual Reasoning ...
	Le, Thao Minh; Le, Vuong; Venkatesh, Svetha. - : arXiv, 2020
	BASE
	Show details

5	Hierarchical Conditional Relation Networks for Video Question Answering ...
	Le, Thao Minh; Le, Vuong; Venkatesh, Svetha. - : arXiv, 2020
	BASE
	Show details

© 2013 - 2024 Lin|gu|is|tik | Imprint | Privacy Policy | Datenschutzeinstellungen ändern