Page: 1 2 3 4 5 6 7 8 9... 870
81 |
AnyFace: Free-style Text-to-Face Synthesis and Manipulation ...
|
|
|
|
BASE
|
|
Show details
|
|
84 |
Local-Global Context Aware Transformer for Language-Guided Video Segmentation ...
|
|
|
|
BASE
|
|
Show details
|
|
85 |
Unsupervised Vision-Language Parsing: Seamlessly Bridging Visual Scene Graphs with Language Structures via Dependency Relationships ...
|
|
|
|
BASE
|
|
Show details
|
|
88 |
IR-GAN: Image Manipulation with Linguistic Instruction by Increment Reasoning ...
|
|
|
|
BASE
|
|
Show details
|
|
90 |
AGQA 2.0: An Updated Benchmark for Compositional Spatio-Temporal Reasoning ...
|
|
|
|
BASE
|
|
Show details
|
|
91 |
Self-supervised 3D Semantic Representation Learning for Vision-and-Language Navigation ...
|
|
|
|
BASE
|
|
Show details
|
|
92 |
Domain Adaptation Meets Zero-Shot Learning: An Annotation-Efficient Approach to Multi-Modality Medical Image Segmentation ...
|
|
|
|
BASE
|
|
Show details
|
|
93 |
Compositional Temporal Grounding with Structured Variational Cross-Graph Correspondence Learning ...
|
|
|
|
BASE
|
|
Show details
|
|
94 |
Uni-EDEN: Universal Encoder-Decoder Network by Multi-Granular Vision-Language Pre-training ...
|
|
|
|
BASE
|
|
Show details
|
|
95 |
Optimized latent-code selection for explainable conditional text-to-image GANs ...
|
|
|
|
BASE
|
|
Show details
|
|
96 |
Aesthetic Text Logo Synthesis via Content-aware Layout Inferring ...
|
|
|
|
BASE
|
|
Show details
|
|
99 |
The CLEAR Benchmark: Continual LEArning on Real-World Imagery ...
|
|
|
|
BASE
|
|
Show details
|
|
100 |
Winoground: Probing Vision and Language Models for Visio-Linguistic Compositionality ...
|
|
|
|
BASE
|
|
Show details
|
|
Page: 1 2 3 4 5 6 7 8 9... 870
|
|