How does Mamba organize knowledge?
These scatter plots show where each of the 54 concept probes lands in the model's representation space, at the final layer of training (step 5000). Each dot is one concept; colors indicate domain.
Does curriculum ordering produce different geometric structure in an SSM?
Both models use PCA to project 768 dimensions down to 2. The shuffled model is projected into the sequenced model's eigenvector space, then Procrustes-rotated, so the two panels are directly comparable. Tight clusters of same-color dots indicate domain organization; scattered mixing indicates none.
Sequenced
Shuffled
Concept inventory (click to highlight)
Each dot is one word — a probe concept the model has seen during training. We feed that word to the trained model and record its internal activation at each layer.
Colors are domains. Teal = physical world (mass, velocity, friction). Yellow = institutional (contract, authority, property). Red = moral (fairness, harm, duty).
Clusters indicate organization. If same-color dots group together, the model has learned that those concepts are related. If colors are mixed randomly, the model treats them as unrelated.
Analogy: imagine plotting people by where they live, using GPS. A model with good geographic structure would cluster neighbors together. A model without it would scatter them randomly across the map.
GPT (attention) builds clusters. Mamba (SSM) does not — regardless of training order. Both conditions show negative silhouette throughout training, meaning domains are actively anti-clustered.