{% from 'partials/hero_cta.html' import hero_cta %} {% if tab_name == 'publishing' %} {% elif tab_name == 'hardware' %} {% elif tab_name == 'model' %} {% elif tab_name == 'basic' %} {% elif tab_name == 'training' %} {% elif tab_name == 'validation' %} {% endif %} {% if prompt_libraries is defined %} {% endif %} {% if tab_name == 'publishing' %} {% elif tab_name == 'hardware' %} {% elif tab_name == 'model' %} {% elif tab_name == 'basic' %} {% elif tab_name == 'training' %} {% elif tab_name == 'validation' %} {% endif %}
Connect your HuggingFace account using an access token. Tokens are stored securely in your local HuggingFace cache (~/.cache/huggingface/token).
Configure your Hub repository ID, visibility (public/private), and whether to push checkpoints automatically during training.
Enable "Push to Hub" to automatically upload checkpoints as they're saved. Great for long training runs where you want continuous backups.
Enable Fully Sharded Data Parallel (FSDP) or DeepSpeed to split your model across multiple GPUs, reducing memory per device and enabling larger batch sizes.
Scale beyond a single machine. Note: Multi-node via WebUI is experimental — for production use, follow DISTRIBUTED.md and use the command-line approach.
Tune the number of data loading workers and PyTorch threads to optimize CPU utilization during preprocessing and training.
Get training quickly with these essential settings. This simplified view covers the core options you need to start your first training run. For advanced configuration, dismiss this panel to access the full form.
Give your training run a name so you can track it in logging platforms and find your checkpoints later.
Connect your training images. Create dataset configurations in the Datasets tab first, then select them here.
Configure how long and how fast your model learns. These settings control the training process itself—get them right and your model improves steadily; get them wrong and you waste time or damage quality.
Choose how long to train. You can specify either epochs (passes through your dataset) or a fixed number of steps. For small datasets (under 100 images), 1-3 epochs is typical. For larger datasets, you might use fewer epochs or set a step limit.
Checkpoints are snapshots of your model during training. They let you resume if training is interrupted, compare quality at different stages, and pick the best version if you overtrain.
The learning rate controls how much the model changes with each training step. Too high and training becomes unstable or "fries" the model. Too low and progress is painfully slow. This is often the most impactful hyperparameter.
The optimizer is the algorithm that actually updates model weights based on gradients. Different optimizers have different memory requirements and behaviors.
Flow-matching models (like Flux) use a noise schedule that can be shifted to change what the model focuses on learning. This only applies to flow-based architectures—ignore for standard diffusion models.
Validation generates sample images during training so you can see how your model is learning. These previews help you catch problems early—like overfitting or wrong concepts—before wasting hours on a bad run.
Control how often validation images are generated. More frequent validation gives you better visibility into training progress but adds overhead. Balance between insight and training speed.
These prompts will be used to generate preview images during training. Choose prompts that test what you're training—if you're training a character, use prompts featuring that character. If training a style, use prompts that should show that style.
Instead of a single prompt, use a library of prompts to test your model from multiple angles. Each validation run cycles through your library, giving you diverse samples to evaluate training progress.
Control how validation images are generated. Consistent seeds help you compare training progress across checkpoints; random seeds give variety but make comparison harder.
Configure the essential model settings quickly. For advanced options, dismiss this panel to access the full form.
Lower ranks (4-32) are best for style transfer and simple concepts. Train faster with less data.
Higher ranks (64-128) can learn more complex concepts but require more training steps, more data, and higher batch sizes to converge properly.
If changing rank without adjusting other settings: lower the learning rate for higher ranks, raise it for lower ranks.
{{ tab_config.description }}
{% endif %}{{ section.description }}
{% endif %} {% if tab_name == 'validation' and section.id == 'prompt_management' %} {% endif %} {% if section.get('template') %}{{ subsection.description }}
{% endif %} {% if subsection.get('template') %}