Model Zoo#

Welcome to the FiftyOne Model Zoo! 🚀

Here you’ll discover state-of-the-art computer vision models, pre-trained on various datasets and ready to use with your FiftyOne datasets.

The FiftyOne Model Zoo provides access to a curated collection of models from popular frameworks like PyTorch and TensorFlow, enabling you to quickly apply cutting-edge computer vision techniques to your data.


alexnet-imagenet-torch

Classic neural network that recognizes images and helped launch the deep learning revolution

Classification,Embeddings,Logits,Imagenet,PyTorch,Alexnet,Official

centernet-hg104-1024-coco-tf2

Finds objects in high-resolution photos by pinpointing their centers with exceptional accuracy and speed

Detection,Coco,TensorFlow-2,Centernet

centernet-hg104-512-coco-tf2

Efficient object finder optimized for medium-resolution images to run faster on regular computers

Detection,Coco,TensorFlow-2,Centernet

centernet-mobilenet-v2-fpn-512-coco-tf2

Lightweight object detector that runs smoothly on phones and other portable devices

Detection,Coco,TensorFlow-2,Centernet,Mobilenet

centernet-resnet101-v1-fpn-512-coco-tf2

Advanced object finder with deeper processing for more accurate results in challenging scenes

Detection,Coco,TensorFlow-2,Centernet,Resnet

centernet-resnet50-v1-fpn-512-coco-tf2

Balanced object detector that works well for most everyday computer vision tasks and applications

Detection,Coco,TensorFlow-2,Centernet,Resnet

centernet-resnet50-v2-512-coco-tf2

Updated version with improved training stability for more consistent object detection across different images

Detection,Coco,TensorFlow-2,Centernet,Resnet

classification-transformer-torch

Vision transformer for image classification and custom fine-tuning on specialized datasets

Classification,Logits,Embeddings,PyTorch,Transformers,Official

clip-vit-base32-torch

Understands both images and text together, enabling search and classification using natural language descriptions

Classification,Logits,Embeddings,PyTorch,Clip,Zero-shot,Transformer,Official

convnext-base-224-torch

Base modern CNN with transformer elements for robust visual understanding

Classification,Imagenet,PyTorch,Transformers,Convnext,Official

convnext-large-224-torch

Large modern CNN demonstrating competitive performance with vision transformers

Classification,Imagenet,PyTorch,Transformers,Convnext

convnext-small-224-torch

Small modernized CNN delivering strong accuracy through architectural innovations

Classification,Imagenet,PyTorch,Transformers,Convnext,Official

convnext-tiny-224-torch

Tiny modern CNN bridging traditional convolutions with transformer-inspired improvements

Classification,Imagenet,PyTorch,Transformers,Convnext

convnext-xlarge-224-torch

Extra-large modern CNN maximizing architectural improvements for top accuracy

Classification,Imagenet,PyTorch,Transformers,Convnext,Official

deeplabv3-cityscapes-tf

Creates detailed pixel-by-pixel labels for urban scenes, helping autonomous vehicles understand their surroundings

Segmentation,Cityscapes,TensorFlow,Deeplabv3,Legacy

deeplabv3-mnv2-cityscapes-tf

Efficient street scene labeler designed to run on phones and edge devices with limited resources

Segmentation,Cityscapes,TensorFlow,Deeplabv3,Legacy

deeplabv3-resnet101-coco-torch

Labels everyday objects in images pixel by pixel for general scene understanding and analysis

Segmentation,Coco,PyTorch,Resnet,Deeplabv3,Official

deeplabv3-resnet50-coco-torch

Faster version that quickly identifies and labels objects in images for real-time applications

Segmentation,Coco,PyTorch,Resnet,Deeplabv3,Official

densenet121-imagenet-torch

Compact yet powerful classifier that delivers strong results while using minimal computational resources

Classification,Embeddings,Logits,Imagenet,PyTorch,Densenet,Official

densenet161-imagenet-torch

Dense network that achieves high accuracy for image classification and adapts well to new tasks

Classification,Embeddings,Logits,Imagenet,PyTorch,Densenet

densenet169-imagenet-torch

Deeper variant offering improved accuracy while remaining efficient enough for practical deployment

Classification,Embeddings,Logits,Imagenet,PyTorch,Densenet,Official

densenet201-imagenet-torch

Extra-deep model providing the most detailed features for complex image understanding tasks

Classification,Embeddings,Logits,Imagenet,PyTorch,Densenet,Official

depth-estimation-transformer-torch

Hugging Face Transformers model for monocular depth estimation

Depth,PyTorch,Transformers

detection-transformer-torch

Modern object detector that finds items in images without needing complex post-processing steps

Detection,Logits,Embeddings,PyTorch,Transformers,Official

dfine-large-coco-torch

D-FINE Large from "D-FINE: Redefine Regression Task in DETRs as Fine-grained Distribution Refinement" trained on COCO. Achieves 54.0% AP at 124 FPS on T4 GPU.

Detection,Coco,PyTorch,Transformers,Detr,Official

dfine-medium-coco-torch

D-FINE Medium from "D-FINE: Redefine Regression Task in DETRs as Fine-grained Distribution Refinement" trained on COCO. Mid-size real-time object detector.

Detection,Coco,PyTorch,Transformers,Detr,Official

dfine-nano-coco-torch

D-FINE Nano from "D-FINE: Redefine Regression Task in DETRs as Fine-grained Distribution Refinement" trained on COCO. Ultra-lightweight real-time object detector.

Detection,Coco,PyTorch,Transformers,Detr,Official

dfine-small-coco-torch

D-FINE Small from "D-FINE: Redefine Regression Task in DETRs as Fine-grained Distribution Refinement" trained on COCO. Balanced real-time object detector.

Detection,Coco,PyTorch,Transformers,Detr,Official

dfine-xlarge-coco-torch

D-FINE XLarge from "D-FINE: Redefine Regression Task in DETRs as Fine-grained Distribution Refinement" trained on COCO. Achieves 55.8% AP at 78 FPS on T4 GPU.

Detection,Coco,PyTorch,Transformers,Detr,Official

dinov2-vitb14-reg-torch

Enhanced image search model that resists noise and errors for more reliable similarity matching

Embeddings,PyTorch,Dinov2,Transformer,Official

dinov2-vitb14-torch

Creates searchable image fingerprints for finding similar pictures and organizing large photo collections

Embeddings,PyTorch,Dinov2,Transformer,Official

dinov2-vitg14-reg-torch

Highest-capacity dinov2 search model with maximum stability for finding images across massive diverse datasets

Embeddings,PyTorch,Dinov2,Transformer,Official

dinov2-vitg14-torch

Powerful image search engine that handles enormous photo collections with rich detail extraction

Embeddings,PyTorch,Dinov2,Transformer,Official

dinov2-vitl14-reg-torch

Large stable model for finding and grouping similar images across big databases reliably

Embeddings,PyTorch,Dinov2,Transformer,Official

dinov2-vitl14-torch

Large model that creates detailed image fingerprints for advanced search and automatic grouping

Embeddings,PyTorch,Dinov2,Transformer,Official

dinov2-vits14-reg-torch

Compact stable model for image search that runs efficiently on phones and edge devices

Embeddings,PyTorch,Dinov2,Transformer,Official

dinov2-vits14-torch

Small model enabling image search and similarity matching directly on mobile devices

Embeddings,PyTorch,Dinov2,Transformer,Official

efficientdet-d0-512-coco-tf2

Tiny object detector optimized for phones and embedded systems working with smaller images

Detection,Coco,TensorFlow-2,Efficientdet

efficientdet-d0-coco-tf1

Legacy-compatible tiny object detector for older systems still running TensorFlow 1 frameworks

Detection,Coco,TensorFlow-1,Efficientdet,Legacy

efficientdet-d1-640-coco-tf2

Versatile object finder for medium-sized images supporting many different computer vision applications

Detection,Coco,TensorFlow-2,Efficientdet

efficientdet-d1-coco-tf1

Legacy version of versatile object finder maintaining compatibility with TensorFlow 1

Detection,Coco,TensorFlow-1,Efficientdet,Legacy

efficientdet-d2-768-coco-tf2

Balanced object detector offering good speed and accuracy for general-purpose image analysis

Detection,Coco,TensorFlow-2,Efficientdet

efficientdet-d2-coco-tf1

Legacy-compatible balanced detector for established pipelines still using TensorFlow 1

Detection,Coco,TensorFlow-1,Efficientdet,Legacy

efficientdet-d3-896-coco-tf2

Accurate object finder for larger images with better detection of big objects

Detection,Coco,TensorFlow-2,Efficientdet

efficientdet-d3-coco-tf1

Legacy object detector maintaining compatibility for systems using TensorFlow 1

Detection,Coco,TensorFlow-1,Efficientdet,Legacy

efficientdet-d4-1024-coco-tf2

High-accuracy object detector for detailed images delivering precise results in complex scenes

Detection,Coco,TensorFlow-2,Efficientdet

efficientdet-d4-coco-tf1

Legacy high-accuracy object detector ensuring backward compatibility with TensorFlow 1

Detection,Coco,TensorFlow-1,Efficientdet,Legacy

efficientdet-d5-1280-coco-tf2

Precision-focused object finder for very large images prioritizing accuracy over speed

Detection,Coco,TensorFlow-2,Efficientdet

efficientdet-d5-coco-tf1

Legacy object detector with the highest accuracy in its family using TensorFlow 1

Detection,Coco,TensorFlow-1,Efficientdet,Legacy

efficientdet-d6-1280-coco-tf2

Deep object detector for large images achieving state-of-the-art accuracy on challenging content

Detection,Coco,TensorFlow-2,Efficientdet

efficientdet-d6-coco-tf1

Legacy deep detector maintaining top accuracy for mature TensorFlow 1 production stacks

Detection,Coco,TensorFlow-1,Efficientdet,Legacy

efficientdet-d7-1536-coco-tf2

Maximum accuracy object finder for extra-large images pushing detection quality to the limit

Detection,Coco,TensorFlow-2,Efficientdet

efficientnet-b0-imagenet-torch

Efficient image classifier optimized for mobile devices with excellent accuracy-efficiency tradeoff

Classification,Imagenet,PyTorch,Transformers,Efficientnet,Official

efficientnet-b1-imagenet-torch

Scaled efficient classifier with improved accuracy for slightly larger computational budgets

Classification,Imagenet,PyTorch,Transformers,Efficientnet,Official

efficientnet-b2-imagenet-torch

Balanced efficient model providing stronger performance while maintaining reasonable resource usage

Classification,Imagenet,PyTorch,Transformers,Efficientnet,Official

efficientnet-b3-imagenet-torch

Mid-scale efficient classifier delivering high accuracy for versatile deployment scenarios

Classification,Imagenet,PyTorch,Transformers,Efficientnet,Official

efficientnet-b4-imagenet-torch

Large efficient model with enhanced features for transfer learning applications

Classification,Imagenet,PyTorch,Transformers,Efficientnet,Official

efficientnet-b5-imagenet-torch

High-capacity efficient classifier prioritizing accuracy with available compute resources

Classification,Imagenet,PyTorch,Transformers,Efficientnet,Official

efficientnet-b6-imagenet-torch

Extended efficient model approaching state-of-the-art accuracy on challenging datasets

Classification,Imagenet,PyTorch,Transformers,Efficientnet,Official

efficientnet-b7-imagenet-torch

Maximum efficient classifier pushing performance boundaries while preserving efficiency principles

Classification,Imagenet,PyTorch,Transformers,Efficientnet,Official

faster-rcnn-inception-resnet-atrous-v2-coco-tf

High-accuracy object finder that sees wider context for better detection in complex scenes

Detection,Coco,TensorFlow,Faster-rcnn,Inception,Resnet

faster-rcnn-inception-resnet-atrous-v2-lowproposals-coco-tf

Speed-optimized detector that runs faster by examining fewer regions while maintaining good accuracy

Detection,Coco,TensorFlow,Faster-rcnn,Inception,Resnet,Legacy

faster-rcnn-inception-v2-coco-tf

Compact object detector achieving real-time speeds for responsive computer vision applications

Detection,Coco,TensorFlow,Faster-rcnn,Inception

faster-rcnn-nas-coco-tf

Smart detector using NAS-designed architecture for improved object finding across diverse images

Detection,Coco,TensorFlow,Faster-rcnn

faster-rcnn-nas-lowproposals-coco-tf

Fast NAS-designed detector that speeds up processing for time-sensitive applications and deployments

Detection,Coco,TensorFlow,Faster-rcnn,Legacy

faster-rcnn-resnet101-coco-tf

Deep object detector balancing accuracy and speed for reliable performance across varied scenes

Detection,Coco,TensorFlow,Faster-rcnn,Resnet,Legacy

faster-rcnn-resnet101-lowproposals-coco-tf

Accelerated deep detector that processes fewer regions for faster results with minimal accuracy loss

Detection,Coco,TensorFlow,Faster-rcnn,Resnet,Legacy

faster-rcnn-resnet50-coco-tf

Versatile object detector suitable for everyday vision tasks in research and production environments

Detection,Coco,TensorFlow,Faster-rcnn,Resnet

faster-rcnn-resnet50-fpn-coco-torch

Multi-scale object finder that accurately detects both small and large items in images

Detection,Coco,PyTorch,Faster-rcnn,Resnet,Official

faster-rcnn-resnet50-lowproposals-coco-tf

Speed-focused detector optimized for running on embedded devices and resource-limited hardware

Detection,Coco,TensorFlow,Faster-rcnn,Resnet,Legacy

fcn-resnet101-coco-torch

Creates detailed pixel-level labels for images, identifying and outlining twenty-one different object categories

Segmentation,Coco,PyTorch,Fcn,Resnet,Official

fcn-resnet50-coco-torch

Fast image labeler that quickly identifies and outlines objects for interactive editing and annotation

Segmentation,Coco,PyTorch,Fcn,Resnet,Official

googlenet-imagenet-torch

Classic image classifier providing reliable categorization and features for various computer vision projects.

Classification,Embeddings,Logits,Imagenet,PyTorch,Googlenet,Official

group-vit-segmentation-transformer-torch

Hugging Face Transformers model for zero-shot semantic segmentation

Segmentation,Embeddings,PyTorch,Transformers,Zero-shot,Official

inception-resnet-v2-imagenet-tf1

High-accuracy image classifier with advanced architecture for precise categorization and feature extraction

Classification,Embeddings,Logits,Imagenet,TensorFlow-1,Inception,Resnet

inception-v3-imagenet-torch

Efficient image classifier delivering accurate results with useful features for transfer learning applications

Classification,Embeddings,Logits,Imagenet,PyTorch,Inception,Official

inception-v4-imagenet-tf1

Enhanced image classifier with deeper architecture improving accuracy for demanding vision tasks

Classification,Embeddings,Logits,Imagenet,TensorFlow-1,Inception,Legacy

keypoint-rcnn-resnet50-fpn-coco-torch

Finds people in images and maps their body joints for pose estimation and motion analysis

Keypoints,Coco,PyTorch,Keypoint-rcnn,Resnet,Official

mask-rcnn-inception-resnet-v2-atrous-coco-tf

Creates precise object outlines and boxes for detailed scene understanding in high-resolution images

Instances,Coco,TensorFlow,Mask-rcnn,Inception,Resnet

mask-rcnn-inception-v2-coco-tf

Fast object outliner generating masks and boxes with lower computing requirements for real-time use

Instances,Coco,TensorFlow,Mask-rcnn,Inception

mask-rcnn-resnet101-atrous-coco-tf

Enhanced object outliner providing detailed masks with better handling of large objects in scenes

Instances,Coco,TensorFlow,Mask-rcnn,Resnet,Legacy

mask-rcnn-resnet50-atrous-coco-tf

General-purpose object outliner creating masks and boxes suitable for most vision analysis tasks

Instances,Coco,TensorFlow,Mask-rcnn,Resnet,Legacy

mask-rcnn-resnet50-fpn-coco-torch

Multi-scale object outliner using advanced architecture for accurate segmentation across different object sizes

Instances,Coco,PyTorch,Mask-rcnn,Resnet,Official

med-sam-2-video-torch

Medical segmentation tool that outlines organs and structures in medical videos and 3D scans

Segment-anything,PyTorch,Zero-shot,Video,Med-sam,Transformer,Official

medsiglip-448-zero-torch

Medical SigLIP for zero-shot image classification and embeddings

Classification,PyTorch,Official,Medical,Zero-shot

mnasnet0.5-imagenet-torch

Ultra-lightweight image classifier designed by AI for running directly on phones and IoT devices

Classification,Embeddings,Logits,Imagenet,PyTorch,Mnasnet,Official

mnasnet1.0-imagenet-torch

Mobile-optimized classifier balancing size and accuracy for efficient on-device image recognition

Classification,Embeddings,Logits,Imagenet,PyTorch,Mnasnet,Official

mobilenet-v2-imagenet-tf1

Efficient mobile classifier using specialized architecture for fast image recognition on phones

Classification,Embeddings,Logits,Imagenet,TensorFlow-1,Mobilenet

mobilenet-v2-imagenet-torch

Mobile-friendly image classifier optimized for quick training and deployment on resource-limited devices

Classification,Embeddings,Logits,Imagenet,PyTorch,Mobilenet,Official

monet-zero-torch

CLIP‑based vision‑language model for zero‑shot dermatology image classification.

Classification,PyTorch,Official,Medical

omdet-turbo-swin-tiny-torch

Real-time detector that finds any object you describe in words, perfect for live video analysis

Detection,Logits,Embeddings,PyTorch,Transformers,Zero-shot,Official

open-clip-torch

Connects images with text descriptions enabling search by words and automatic content filtering systems

Classification,Logits,Embeddings,PyTorch,Clip,Zero-shot,Transformer

owlvit-base-patch16-torch

Finds any object you name in pictures using 16x16 image patches without needing specific training for those items

Detection,Logits,Embeddings,PyTorch,Transformers,Zero-shot,Official

owlvit-base-patch32-torch

Finds any object you name in pictures using efficient 32x32 image patches without needing specific training

Detection,Logits,Embeddings,PyTorch,Transformers,Zero-shot,Official

owlvit-large-patch14-torch

Large OWL-ViT zero-shot object detector with ViT-L/14 backbone. Achieves higher accuracy than base models, especially for smaller objects.

Detection,Logits,Embeddings,PyTorch,Transformers,Zero-shot,Official

pubmed-clip-vit-base-patch32

Zero-shot medical image classifier trained on biomedical image–text pairs.

Classification,PyTorch,Official,Medical,Zero-shot

resnet-v1-50-imagenet-tf1

Classic image recognition model providing reliable categorization and visual features for many applications

Classification,Embeddings,Logits,Imagenet,TensorFlow-1,Resnet,Legacy

resnet-v2-50-imagenet-tf1

Improved image classifier with smoother training process and better features for adapting to new tasks

Classification,Embeddings,Logits,Imagenet,TensorFlow-1,Resnet,Legacy

resnet101-imagenet-torch

Deep image recognition model delivering high accuracy for demanding classification and analysis tasks

Classification,Embeddings,Logits,Imagenet,PyTorch,Resnet,Official

resnet152-imagenet-torch

Very deep classifier providing the richest visual features for precision-critical image understanding applications

Classification,Embeddings,Logits,Imagenet,PyTorch,Resnet,Official

resnet18-imagenet-torch

Lightweight image classifier designed for fast recognition on phones and other resource-limited devices

Classification,Embeddings,Logits,Imagenet,PyTorch,Resnet,Official

resnet34-imagenet-torch

Balanced image classifier offering good accuracy and speed for everyday computer vision needs

Classification,Embeddings,Logits,Imagenet,PyTorch,Resnet,Official

resnet50-imagenet-torch

Most popular image recognition backbone widely used as starting point for custom vision projects

Classification,Embeddings,Logits,Imagenet,PyTorch,Resnet,Official

resnext101-32x8d-imagenet-torch

Powerful image classifier with enhanced capacity for handling complex visual recognition challenges effectively

Classification,Embeddings,Logits,Imagenet,PyTorch,Resnext,Official

resnext50-32x4d-imagenet-torch

Efficient advanced classifier delivering strong accuracy with reasonable computing requirements for practical deployments

Classification,Embeddings,Logits,Imagenet,PyTorch,Resnext

retinanet-resnet50-fpn-coco-torch

Fast object detector that quickly finds and boxes eighty common items in any image

Detection,Coco,PyTorch,Retinanet,Resnet

rfcn-resnet101-coco-tf

Efficient object finder producing accurate boxes for eighty object types with optimized processing speed

Detection,Coco,TensorFlow,Rfcn,Resnet,Legacy

rtdetr-l-coco-torch

Modern real-time object detector that finds items without complex post-processing for responsive applications

Detection,Coco,PyTorch,Transformer,Rtdetr,Official

rtdetr-v2-m-coco-torch

Balanced real-time object detector offering improved accuracy for production use

Detection,Coco,PyTorch,Transformers,Rtdetr,Official

rtdetr-v2-s-coco-torch

Lightweight real-time object detector optimized for speed on edge devices

Detection,Coco,PyTorch,Transformers,Rtdetr

rtdetr-x-coco-torch

High-capacity object detector delivering very precise results at speeds suitable for production use

Detection,Coco,PyTorch,Transformer,Rtdetr,Official

segformer-b0-ade20k-torch

Efficient transformer-based semantic segmentation model for scene parsing with 150 classes

Segmentation,PyTorch,Segformer,Official

segformer-b1-ade20k-torch

Balanced SegFormer model providing good accuracy-efficiency tradeoff for scene understanding

Segmentation,PyTorch,Segformer,Official

segformer-b2-ade20k-torch

Medium-sized SegFormer delivering enhanced segmentation quality for complex scenes

Segmentation,PyTorch,Segformer,Official

segformer-b3-ade20k-torch

Larger SegFormer model with improved accuracy for detailed semantic segmentation

Segmentation,PyTorch,Segformer,Official

segformer-b4-ade20k-torch

High-capacity SegFormer achieving excellent results on challenging segmentation tasks

Segmentation,PyTorch,Segformer,Official

segformer-b5-ade20k-torch

Largest SegFormer model delivering the best semantic segmentation performance in its family

Segmentation,PyTorch,Segformer,Official

segment-anything-2-hiera-base-plus-image-torch

Accurate image segmentation model for editing, labeling, and creative work with still pictures

Segment-anything,PyTorch,Zero-shot,Transformer,Official

segment-anything-2-hiera-base-plus-video-torch

Video segmentation model that tracks and outlines objects throughout clips for editing and analysis

Segment-anything,PyTorch,Zero-shot,Video,Transformer,Official

segment-anything-2-hiera-large-image-torch

High-quality image segmenter producing detailed masks for demanding professional editing and annotation tasks

Segment-anything,PyTorch,Zero-shot,Transformer,Official

segment-anything-2-hiera-large-video-torch

Advanced video segmenter providing fine object tracking throughout full videos for post-production work

Segment-anything,PyTorch,Zero-shot,Video,Transformer

segment-anything-2-hiera-small-image-torch

Fast image segmentation model that runs efficiently on laptops and edge computing devices

Segment-anything,PyTorch,Zero-shot,Transformer,Official

segment-anything-2-hiera-small-video-torch

Quick video segmentation model delivering rapid object tracking on standard graphics cards

Segment-anything,PyTorch,Zero-shot,Video,Transformer,Official

segment-anything-2-hiera-tiny-image-torch

Smallest image segmentation model offering instant results for mobile apps and embedded systems

Segment-anything,PyTorch,Zero-shot

segment-anything-2-hiera-tiny-video-torch

Tiny video segmenter enabling real-time object tracking on phones and compact devices

Segment-anything,PyTorch,Zero-shot,Video,Transformer,Official

segment-anything-2.1-hiera-base-plus-image-torch

Updated image segmenter with improved mask accuracy for everyday editing and dataset creation

Segment-anything,PyTorch,Zero-shot,Transformer,Official

segment-anything-2.1-hiera-base-plus-video-torch

Enhanced video segmenter with better tracking quality for video analysis and scene understanding

Segment-anything,PyTorch,Zero-shot,Video,Transformer,Official

segment-anything-2.1-hiera-large-image-torch

Large updated model offering even finer masks for high-resolution professional image workflows

Segment-anything,PyTorch,Zero-shot,Transformer,Official

segment-anything-2.1-hiera-large-video-torch

Large video model producing exceptionally detailed masks throughout long videos for intensive production

Segment-anything,PyTorch,Zero-shot,Video,Transformer,Official

segment-anything-2.1-hiera-small-image-torch

Balanced updated segmenter combining speed and accuracy for edge device image processing

Segment-anything,PyTorch,Zero-shot,Transformer,Official

segment-anything-2.1-hiera-small-video-torch

Improved video segmenter maintaining quick performance on compact hardware while enhancing mask quality

Segment-anything,PyTorch,Zero-shot,Video,Transformer,Official

segment-anything-2.1-hiera-tiny-image-torch

Enhanced mobile image segmenter for apps, augmented reality filters, and on-device processing

Segment-anything,PyTorch,Zero-shot,Transformer

segment-anything-2.1-hiera-tiny-video-torch

Upgraded mobile video segmenter for live effects on phones, wearables, and smart cameras

Segment-anything,PyTorch,Zero-shot,Video,Transformer

segment-anything-vitb-torch

Interactive segmentation tool that instantly outlines any object you point to or describe

Segment-anything,Sa-1b,PyTorch,Zero-shot,Transformer,Official

segment-anything-vith-torch

Highest quality segmentation model creating extremely detailed masks for research and large-scale annotation projects

Segment-anything,Sa-1b,PyTorch,Zero-shot,Transformer,Official

segment-anything-vitl-torch

Large segmentation model producing finer object outlines for professional editing and labeling workflows

Segment-anything,Sa-1b,PyTorch,Zero-shot,Transformer

segmentation-transformer-torch

Hugging Face Transformers model for semantic segmentation

Segmentation,PyTorch,Transformers,Official

shufflenetv2-0.5x-imagenet-torch

Ultra-small image classifier for tiny devices with very limited power and memory

Classification,Embeddings,Logits,Imagenet,PyTorch,Shufflenet,Official

shufflenetv2-1.0x-imagenet-torch

Mobile image classifier that works efficiently on phones with modest computing resources

Classification,Embeddings,Logits,Imagenet,PyTorch,Shufflenet,Official

siglip-base-patch16-224-torch

Hugging Face Transformers model for zero-shot image classification

Classification,Logits,Embeddings,PyTorch,Transformers,Zero-shot,Official

squeezenet-1@1.1.1-imagenet-torch

Tiny image classifier that fits in just five megabytes for embedded devices

Classification,Imagenet,PyTorch,Squeezenet

squeezenet-imagenet-torch@1.0

Ultra-compact image classifier perfect for severely resource-constrained hardware and applications

Classification,Imagenet,PyTorch,Squeezenet,Official

ssd-inception-v2-coco-tf

Real-time object finder that quickly identifies eighty common items in any image

Detection,Coco,TensorFlow,Ssd,Inception

ssd-mobilenet-v1-coco-tf

Mobile object detector that runs smoothly on phones and edge computing processors

Detection,Coco,TensorFlow,Ssd,Mobilenet,Legacy

ssd-mobilenet-v1-fpn-640-coco17

Enhanced mobile detector that better finds small objects in larger resolution images

Detection,Coco,TensorFlow-2,Ssd,Mobilenet

ssd-mobilenet-v1-fpn-coco-tf

Mobile object detector that runs smoothly on phones and edge computing processors

Detection,Coco,TensorFlow,Ssd,Mobilenet,Legacy

ssd-mobilenet-v2-320-coco17

Fast object finder optimized for quick GPU processing of smaller input images

Detection,Coco,TensorFlow-2,Ssd,Mobilenet

ssd-resnet50-fpn-coco-tf

Accurate object detector combining strong backbone with multi-scale detection for better results

Detection,Coco,TensorFlow,Ssd,Resnet,Legacy

swin-v2-base-torch

Base hierarchical transformer delivering strong results across vision tasks

Classification,Imagenet,PyTorch,Transformers,Swin-transformer,Official

swin-v2-large-torch

Large hierarchical transformer with enhanced capacity for demanding applications

Classification,Imagenet,PyTorch,Transformers,Swin-transformer,Official

swin-v2-small-torch

Small hierarchical transformer balancing efficiency and performance for practical use

Classification,Imagenet,PyTorch,Transformers,Swin-transformer,Official

swin-v2-tiny-torch

Tiny hierarchical transformer for efficient visual recognition on edge devices

Classification,Imagenet,PyTorch,Transformers,Swin-transformer,Official

vgg11-bn-imagenet-torch

Classic image classifier with stable training useful for various computer vision projects

Classification,Embeddings,Logits,Imagenet,PyTorch,Vgg,Official

vgg11-imagenet-torch

Simple baseline image classifier valuable for research experimentation and learning purposes

Classification,Embeddings,Logits,Imagenet,PyTorch,Vgg,Official

vgg13-bn-imagenet-torch

Deeper classic classifier providing stable training process and solid accuracy results overall

Classification,Embeddings,Logits,Imagenet,PyTorch,Vgg,Official

vgg13-imagenet-torch

Straightforward image classifier valued for easy experimentation and model compression studies

Classification,Embeddings,Logits,Imagenet,PyTorch,Vgg,Official

vgg16-bn-imagenet-torch

Popular feature extractor widely used for detection, style transfer, and medical imaging

Classification,Embeddings,Logits,Imagenet,PyTorch,Vgg,Official

vgg16-imagenet-tf1

TensorFlow version of the classic image classifier supporting legacy production systems

Classification,Embeddings,Logits,Imagenet,TensorFlow-1,Vgg,Legacy

vgg16-imagenet-torch

PyTorch version of the popular classifier ready for modern deep learning workflows

Classification,Embeddings,Logits,Imagenet,PyTorch,Vgg,Official

vgg19-bn-imagenet-torch

Deep classic model providing rich features for style transfer and interpretability analysis

Classification,Embeddings,Logits,Imagenet,PyTorch,Vgg,Official

vgg19-imagenet-torch

Deep image classifier delivering detailed features for creative applications and research projects

Classification,Embeddings,Logits,Imagenet,PyTorch,Vgg,Official

vit-base-patch16-224-imagenet-torch

Modern image classifier that recognizes objects and provides useful features for various computer vision tasks

Classification,Logits,Embeddings,PyTorch,Transformers,Official

wide-resnet101-2-imagenet-torch

Extra-wide deep classifier for high-precision image recognition and advanced transfer learning tasks

Classification,Embeddings,Logits,Imagenet,PyTorch,Wide-resnet,Official

wide-resnet50-2-imagenet-torch

Wide classifier offering stronger accuracy and better features for adapting to new tasks

Classification,Embeddings,Logits,Imagenet,PyTorch,Wide-resnet,Official

yolo-nas-torch

AI-designed detector family offering three model variants for diverse deployment scenarios

Detection,PyTorch,Yolo,Official

yolo-v2-coco-tf1

Classic real-time detector finding eighty object types quickly for video analysis applications

Detection,Coco,TensorFlow-1,Yolo,Legacy

yolo11l-coco-torch

Real-time object detector balancing high accuracy with fast processing speeds effectively

Detection,Coco,PyTorch,Yolo,Official

yolo11l-seg-coco-torch

Model creating detailed object outlines for precise image editing and analysis

Instances,Coco,PyTorch,Yolo,Official

yolo11m-coco-torch

Object detector offering good balance between speed and accuracy for most applications

Detection,Coco,PyTorch,Yolo,Official

yolo11m-seg-coco-torch

Model generating object masks efficiently for everyday segmentation tasks

Instances,Coco,PyTorch,Yolo,Official

yolo11n-coco-torch

Object detector designed specifically for phones and other edge computing devices

Detection,Coco,PyTorch,Yolo,Official

yolo11n-seg-coco-torch

Edge model producing object outlines directly on phones and edge devices

Instances,Coco,PyTorch,Yolo,Official

yolo11s-coco-torch

Fast object detector ideal for systems with limited graphics processing power

Detection,Coco,PyTorch,Yolo,Official

yolo11s-seg-coco-torch

Model creating object masks quickly for real-time segmentation applications

Instances,Coco,PyTorch,Yolo,Official

yolo11x-coco-torch

Object detector prioritizing accuracy over processing speed for critical applications

Detection,Coco,PyTorch,Yolo,Official

yolo11x-seg-coco-torch

Model delivering high-quality object outlines for professional workflows

Instances,Coco,PyTorch,Yolo,Official

yoloe11l-seg-torch

Real-time model creating both object outlines and boxes for any described item

Instances,PyTorch,Yolo,Zero-shot,Official

yoloe11m-seg-torch

Model producing masks and boxes for objects described in natural language

Instances,PyTorch,Yolo,Zero-shot,Official

yoloe11s-seg-torch

Segments specified classes, generating object outlines and boxes for real-time applications

Instances,PyTorch,Yolo,Zero-shot,Official

yoloev8l-seg-torch

Model outlining and boxing any object you describe without specific training

Instances,PyTorch,Yolo,Zero-shot,Official

yoloev8m-seg-torch

Model creating masks for any object type you name in text

Instances,PyTorch,Yolo,Zero-shot,Official

yoloev8s-seg-torch

Compact model producing outlines for objects described in words on edge devices

Instances,PyTorch,Yolo,Zero-shot,Official

yolov10l-coco-torch

Object detector with special optimizations for even faster inference on modern hardware

Detection,Coco,PyTorch,Yolo,Official

yolov10m-coco-torch

Balanced detector providing good accuracy and speed for general-purpose object detection tasks

Detection,Coco,PyTorch,Yolo,Official

yolov10n-coco-torch

Edge-optimized detector for devices with minimal computing resources available

Detection,Coco,PyTorch,Yolo,Official

yolov10s-coco-torch

Fast lightweight detector suitable for systems with limited GPU capabilities and memory

Detection,Coco,PyTorch,Yolo,Official

yolov10x-coco-torch

High-accuracy detector for demanding object detection applications and research

Detection,Coco,PyTorch,Yolo,Official

yolov5l-coco-torch

Real-time detector producing accurate results quickly for demanding vision applications

Detection,Coco,PyTorch,Yolo,Official

yolov5m-coco-torch

Real-time detector balancing good accuracy with fast processing speeds

Detection,Coco,PyTorch,Yolo,Official

yolov5n-coco-torch

Lightweight detector for edge devices needing basic object detection capabilities

Detection,Coco,PyTorch,Yolo,Official

yolov5s-coco-torch

Real-time detector delivering good results with minimal computational requirements

Detection,Coco,PyTorch,Yolo,Official

yolov5x-coco-torch

High-accuracy detector offering top precision for applications where quality is critical

Detection,Coco,PyTorch,Yolo,Official

yolov8l-coco-torch

Real-time detector with advanced architecture for improved object finding in complex scenes

Detection,Coco,PyTorch,Yolo,Official

yolov8l-obb-dotav1-torch

Specialized detector that finds rotated objects in aerial and satellite imagery accurately

Detection,PyTorch,Yolo,Polylines,Obb,Official

yolov8l-oiv7-torch

General-purpose detector trained on diverse images recognizing over six hundred object categories

Detection,Oiv7,PyTorch,Yolo,Official

yolov8l-seg-coco-torch

Creates precise object outlines for detailed image editing and analysis tasks

Instances,Coco,PyTorch,Yolo,Official

yolov8l-world-torch

Finds and boxes any object you describe using natural language prompts

Detection,PyTorch,Yolo,Zero-shot,Official

yolov8m-coco-torch

Detector balancing speed and accuracy for everyday object detection needs

Detection,Coco,PyTorch,Yolo,Official

yolov8m-obb-dotav1-torch

Finds rotated bounding boxes in aerial images for mapping and surveillance applications

Detection,PyTorch,Yolo,Polylines,Obb,Official

yolov8m-oiv7-torch

Versatile detector recognizing hundreds of different object types across varied image domains

Detection,Oiv7,PyTorch,Yolo,Official

yolov8m-seg-coco-torch

Generates object masks with good balance of speed and quality

Instances,Coco,PyTorch,Yolo,Official

yolov8m-world-torch

Detector understanding text descriptions to find matching objects in images

Detection,PyTorch,Yolo,Zero-shot,Official

yolov8n-coco-torch

Edge-optimized detector recognizing common objects on resource-limited devices effectively

Detection,Coco,PyTorch,Yolo,Official

yolov8n-obb-dotav1-torch

Lightweight detector for finding rotated objects in aerial imagery on edge hardware

Detection,PyTorch,Yolo,Polylines,Obb,Official

yolov8n-oiv7-torch

Edge-friendly detector recognizing hundreds of object categories on resource-limited devices effectively

Detection,Oiv7,PyTorch,Yolo,Official

yolov8n-seg-coco-torch

Edge-optimized model producing object outlines on devices with limited resources.

Instances,Coco,PyTorch,Yolo,Official

yolov8s-coco-torch

Detector offering fast performance on mid-range graphics cards and processors

Detection,Coco,PyTorch,Yolo,Official

yolov8s-obb-dotav1-torch

Efficiently finds rotated objects in aerial photos for mapping and analysis tasks

Detection,PyTorch,Yolo,Polylines,Obb,Official

yolov8s-oiv7-torch

Compact detector recognizing diverse object types across many different image categories

Detection,Oiv7,PyTorch,Yolo,Official

yolov8s-seg-coco-torch

Fast model creating object masks for real-time image segmentation needs

Instances,Coco,PyTorch,Yolo,Official

yolov8s-world-torch

Lightweight detector finding objects based on text descriptions for edge applications

Detection,PyTorch,Yolo,Zero-shot,Official

yolov8x-coco-torch

High-accuracy detector for critical applications where precision matters most

Detection,Coco,PyTorch,Yolo,Official

yolov8x-obb-dotav1-torch

High-precision detector for rotated objects in aerial and satellite imagery analysis

Detection,PyTorch,Yolo,Polylines,Obb,Official

yolov8x-oiv7-torch

Accurate general detector recognizing over six hundred different object types

Detection,Oiv7,PyTorch,Yolo,Official

yolov8x-seg-coco-torch

High-accuracy model generating detailed object outlines for demanding professional applications

Instances,Coco,PyTorch,Yolo,Official

yolov8x-world-torch

Open-vocabulary detector with high accuracy for text-based object finding

Detection,PyTorch,Yolo,Zero-shot,Official

yolov9c-coco-torch

Detector enhanced with transformer technology for improved object finding capabilities

Detection,Coco,PyTorch,Yolo,Official

yolov9c-seg-coco-torch

Compact model producing both masks and boxes with transformer-enhanced accuracy

Instances,Coco,PyTorch,Yolo,Official

yolov9e-coco-torch

Advanced detector with transformer backbone delivering superior accuracy for complex scenes

Detection,Coco,PyTorch,Yolo,Official

yolov9e-seg-coco-torch

Advanced model creating precise object outlines using enhanced transformer architecture

Instances,Coco,PyTorch,Yolo,Official

zero-shot-classification-transformer-torch

Finds any object you name in images without requiring training on those specific items

Classification,Logits,Embeddings,PyTorch,Transformers,Zero-shot,Official

zero-shot-detection-transformer-torch

Hugging Face Transformers model for zero-shot object detection

Detection,Logits,Embeddings,PyTorch,Transformers,Zero-shot,Official