Enter a unique name for your dataset
Regularisation datasets help preserve the base model's identity. You must also include at least one normal image or video dataset.
Add a primary image or video dataset first. Once one is queued, you can enable regularisation for supporting datasets.
Configure required settings
Cache Dataset Options
Create separate dedicated datasets for caching embeddings. This allows you to reuse cached data across multiple training datasets.
Configure Text Embeddings Cache
This dedicated dataset will store cached text embeddings that can be reused across multiple training sessions.
Tip: All image datasets in your configuration will automatically use this shared text embeddings cache.
Safe to Share: Multiple datasets can safely use the same text embed cache directory. Common captions will be deduplicated, reducing disk space usage.
Configure VAE Latent Cache Backend
The image_embeds dataset provides a storage backend for VAE latents. Each image dataset can point to its own cache directory within this backend.
Select Storage Backend
Local Storage
Store VAE cache on local filesystem
S3-Compatible
Store VAE cache in S3-compatible storage
S3-Compatible Storage Configuration
Note: Multiple image datasets can share the same image_embeds backend by pointing to different .
Warning: Each dataset must use a unique VAE cache . Sharing the same cache location across datasets will cause data corruption.
Configure image resolution settings
Using default resolution: 1024px, pixel_area
Configure image cropping behavior
Choose where crops are taken from in the image
Choose how aspect ratios are handled during cropping
Define aspect ratio buckets (e.g., 0.75 = 3:4, 1.0 = 1:1, 1.33 = 4:3)
No aspect buckets defined yet
Configure caption settings
Hugging Face datasets use the huggingface caption strategy (fixed)
Webshart datasets use the webshart caption strategy (fixed)
CSV datasets use the csv caption strategy (fixed)
How captions should be loaded for each image
This prompt will be used as the sole caption for all images in the dataset
Enter the trigger word or phrase to prepend to all captions
Enable "Prepend Instance Prompt" above to use this field
Auto-generate conditioning data for ControlNet
ControlNet training detected! Would you like to automatically generate conditioning data during training?