google/gemma-4-31B-it#
Note
This is a remotely-sourced model from the gemma4 plugin, maintained by the community. It is not part of FiftyOne core and may have special installation requirements. Please review the plugin documentation and license before use.
Gemma 4 31B is a 30.7B dense multimodal model supporting text and image. Image operations: detect, point, classify, vqa. No video support..
Details
Model name:
google/gemma-4-31B-itModel source: https://huggingface.co/google/gemma-4-31B-it
Model author: Google DeepMind
Model license: gemma (https://ai.google.dev/gemma/terms)
Exposes embeddings? no
Tags:
detection, classification, keypoints, vqa, torch, zero-shot, image, VLM
Requirements
Packages:
huggingface-hub, transformers>=4.52.0, torch, torchvision, accelerate, torchcodecCPU support
yes
GPU support
yes
Example usage
1import fiftyone as fo
2import fiftyone.zoo as foz
3
4foz.register_zoo_model_source("https://github.com/Burhan-Q/gemma4")
5
6dataset = foz.load_zoo_dataset(
7 "coco-2017",
8 split="validation",
9 dataset_name=fo.get_default_dataset_name(),
10 max_samples=50,
11 shuffle=True,
12)
13
14model = foz.load_zoo_model("google/gemma-4-31B-it")
15
16dataset.apply_model(model, label_field="predictions")
17
18session = fo.launch_app(dataset)