{% extends "_base.html" %} {% block content %}
Workflow · fine-tune

Fine-tune a LoRA

Apply a LoRA adapter on top of any OptIQ quant. Runs MLX-natively; takes advantage of OptIQ's sensitivity data via rank-scaling.

{# Step 1: pick base #}

Base model

{# Step 2: dataset #}

Dataset

Path to a directory containing train.jsonl (and optionally valid.jsonl). mlx-lm accepts the standard shapes: {"text": ...}, {"prompt": ..., "completion": ...}, or {"messages": [...]}.

{# Step 3: hyperparams #}

Hyperparameters

DPO uses the same adapted model with adapter scale temporarily zeroed for the reference forward pass — no second model load.
{# Step 4: live training #}

Training

{# Loss chart — minimal SVG sparkline #}

  
{# Step 5: push #}

Done

LoRA adapter saved to:



    

Push to Hugging Face

Train another
{% endblock %}