MILAN

Natural Language Descriptions of Deep Visual Features
Some neurons in deep networks specialize in recognizing highly specific perceptual, structural, or semantic Features.md of inputs
identifying neurons that respond to individual concept categories
richer characterization of neuron-level computation
mutual-information-guided linguistic annotation of neurons
generate open-ended, compositional, natural language descriptions of individual neurons in deep networks
generates a description by searching for a natural language string that maximizes pointwise mutual information with the image regions in which the neuron is active
MILANNOTATIONS.md
fine-grained descriptions that capture categorical, relational, and logical structure in learned Features.md
characterizing the distribution and importance of neurons selective for attribute, category, and relational information in vision models.
auditing, surfacing neurons sensitive to protected categories like race and gender in models trained on datasets intended to obscure these Features.md
editing, improving robustness in an image classifier by deleting neurons sensitive to text Features.md spuriously correlated with class labels

Subhaditya's KB