MILAN
- Natural Language Descriptions of Deep Visual Features
- Some neurons in deep networks specialize in recognizing highly specific perceptual, structural, or semantic Features.md of inputs
- identifying neurons that respond to individual concept categories
- richer characterization of neuron-level computation
- mutual-information-guided linguistic annotation of neurons
- generate open-ended, compositional, natural language descriptions of individual neurons in deep networks
- generates a description by searching for a natural language string that maximizes pointwise mutual information with the image regions in which the neuron is active
- MILANNOTATIONS.md
- fine-grained descriptions that capture categorical, relational, and logical structure in learned Features.md
- characterizing the distribution and importance of neurons selective for attribute, category, and relational information in vision models.
- auditing, surfacing neurons sensitive to protected categories like race and gender in models trained on datasets intended to obscure these Features.md
- editing, improving robustness in an image classifier by deleting neurons sensitive to text Features.md spuriously correlated with class labels