facebook/VGGT-1B#
Note
This is a remotely-sourced model from the vggt plugin, maintained by the community. It is not part of FiftyOne core and may have special installation requirements. Please review the plugin documentation and license before use.
Visual Geometry Grounded Transformer (VGGT) is a feed-forward neural network that directly infers all key 3D attributes of a scene..
Details
Model name:
facebook/VGGT-1BModel source: huggingface.co/facebook/vggt-1b
Model author: Meta AI
Model license: CC BY-NC 4.0
Exposes embeddings? no
Tags:
depth, threed, keypoints
Requirements
Packages:
vggt@git+https://github.com/facebookresearch/vggt.git, torch, torchvision, open3dCPU support
yes
GPU support
yes
Example usage
1import fiftyone as fo
2import fiftyone.zoo as foz
3
4foz.register_zoo_model_source("https://github.com/harpreetsahota204/vggt")
5
6dataset = foz.load_zoo_dataset(
7 "coco-2017",
8 split="validation",
9 dataset_name=fo.get_default_dataset_name(),
10 max_samples=50,
11 shuffle=True,
12)
13
14model = foz.load_zoo_model("facebook/VGGT-1B")
15
16dataset.apply_model(model, label_field="predictions")
17
18session = fo.launch_app(dataset)