Vision-Language Models for Vision Tasks: A Survey Vision-Language Pretraining Methods

Debiasing vision-language models for vision tasks: a survey

In recent years, foundation Vision-Language Models (VLMs), such as CLIP [1], which empower zero-shot transfer to a wide variety of domains without fine-tuning, have led to a significant shift in ...

EurekAlert!

VLP: A survey on vision-language pre-training

Making machines respond in ways similar to humans has been a relentless goal of AI researchers. To enable machines to perceive and think, researchers propose a series of related tasks, such as face ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results

Debiasing vision-language models for vision tasks: a survey

VLP: A survey on vision-language pre-training

Trending now