Storage Backend
Local Paths
AWS S3 Configuration
CSV Configuration
Hugging Face Dataset
Stream data instead of downloading
Comma-separated columns joined for caption
Column for lyrics (used by lyric encoder)
Webshart Dataset
Pass the metadata repo id; Webshart follows shard subfolders like data/.
Cache Directory
Local path for storing VAE latent cache
Embed Options
Keep data backend cache between runs