Template clustering
Bishop §9What
Cluster every log line into a small set of templates with
wildcards <*> for variable parts.
"User u-123 logged in" and "User u-456 logged in"
collapse to one template.
Algorithm Drain3 tree-based online clustering, refined by a small EM merge pass for near-duplicates.
Why it matters Templates are the atomic unit downstream. Over-merge and distinct events vanish; over-split and review is exhausting.
In Studio
Each row in log_templates becomes a candidate
:Event_N class with a sample log line attached.