allenai/Molmo2-8B#
Note
This is a remotely-sourced model from the molmo2 plugin, maintained by the community. It is not part of FiftyOne core and may have special installation requirements. Please review the plugin documentation and license before use.
Molmo2 is a family of open vision-language models developed by the Allen Institute for AI (Ai2) that support image, video and multi-image understanding and grounding..
Details
Model name:
allenai/Molmo2-8BModel source: https://huggingface.co/allenai/Molmo2-8B
Model author: Allen Institute for AI (Ai2)
Model license: Apache-2.0
Exposes embeddings? no
Tags:
torch, keypoints, zero-shot, video
Requirements
Packages:
huggingface-hub, transformers==4.57.1, torch, torchvision, molmo_utils, decord2, einops, accelerateCPU support
yes
GPU support
yes
Example usage
1import fiftyone as fo
2import fiftyone.zoo as foz
3
4foz.register_zoo_model_source("https://github.com/harpreetsahota204/molmo2")
5
6dataset = foz.load_zoo_dataset(
7 "coco-2017",
8 split="validation",
9 dataset_name=fo.get_default_dataset_name(),
10 max_samples=50,
11 shuffle=True,
12)
13
14model = foz.load_zoo_model("allenai/Molmo2-8B")
15
16dataset.apply_model(model, label_field="predictions")
17
18session = fo.launch_app(dataset)