{% extends "base.html" %} {% block title %}Start Interpretation ยท Sparsifire{% endblock %} {% block content %}
ollama serve
This process uses the Auto-Interpretability method described by O'Neill et al. (2024).
It feeds the LLM with a contrastive set of Top-K activating documents versus Random non-activating documents to distill the semantic meaning of each latent feature.