Multimodal Embedding Space
2AI tools in the Multimodal Embedding Space category
Microsoft KOSMOS-2
new capabilities of perceiving object descriptions (e.g., bounding boxes) and grounding text to the visual world [[HF demo]](https://huggingface.co/spaces/ydshieh/Kosmos-2) [[arxiv]](https://arxiv.org/abs/2306.14824)
...moreSkillMultimodal Embedding Space
1 dir
facebookresearch/ImageBind
A
ImageBind One Embedding Space to Bind Them All
SkillMultimodal Embedding Space
9K1 dir