GitHub Repo

Find Duplicates#

Find and remove duplicate or near-duplicate images from a FiftyOne dataset using brain similarity.

Install#

curl -sL skil.sh | sh -s -- voxel51/fiftyone-skills

When prompted, select fiftyone-find-duplicates from the menu.

Requirements#

Usage#

Load a dataset in FiftyOne, then ask your AI assistant:

"Find duplicate images in my dataset"
"Remove near-duplicates and keep only unique samples"
"Show me which images are visually similar"

The skill computes similarity embeddings, identifies duplicates above a configurable threshold, and lets you review them in the App before deleting.

Example#

import fiftyone as fo
import fiftyone.zoo as foz

dataset = foz.load_zoo_dataset("quickstart")

Then ask your assistant:

"Find near-duplicate images in the quickstart dataset"

See also#