Spatial Grounding
Enable per-entity bounding box annotations for spatial grounding. Requires .bbox sidecar files alongside images.
Auto-Detect Bounding Boxes
Automatically generate .bbox sidecar files using Florence-2. Images that already have .bbox files will be skipped.
Auto-detect requires a local backend.
HuggingFace Florence-2 model ID.
Comma-separated labels for guided detection. Leave empty for automatic captioning and grounding.
Number of images per inference batch.