Summary: What You’ve Learned#

You’ve completed the Annotation Getting Started Guide. Here’s what you can now do.

Quickstart Track#

You learned the basics of multimodal annotation:

You built a complete data-centric annotation pipeline for multimodal data:

Step 2: Setup Splits: Cloned quickstart-groups to annotation_tutorial. Created test (frozen), val (iteration), golden (QA), and pool splits at the group level to prevent data leakage across modalities.
Step 3: Smart Selection: Used ZCore diversity scoring on camera images to select high-coverage scenes. Better than random.
Step 4: 2D Annotation: Labeled detections on left camera images with KITTI schema enforcement. Only samples with actual labels get marked as annotated.
Step 5: 3D Annotation: Annotated cuboids on point clouds using transform controls. Used camera projections to verify 3D→2D alignment.
Step 6: Train + Evaluate: Trained YOLOv8 on camera images, evaluated on val set, tagged FP/FN failures for targeting.
Step 7: Iteration: Ran Golden QA check, then selected next batch using hybrid strategy: 30% coverage + 70% targeted.

Group-level splits are non-negotiable. Without them, the same scene leaks between train and test.
Label smarter, not harder. Diversity sampling + failure targeting beats random selection.
30% coverage budget matters. Only chasing failures creates a model that fails on normal cases.
Cross-modal consistency. 2D and 3D labels should agree on the same objects.
QA before training. Golden QA checks catch annotation drift early.
Understand your failures. FP/FN analysis tells you what to label next.

In-app annotation is good for:

Use external tools (CVAT, Label Studio, Labelbox) when:

FiftyOne integrates with external annotation tools. See Integrations for details.

Questions or suggestions? Reach us at support@voxel51.com or join our Discord.