EXPLANATIONS

Method 1 works: all MAX_ACTIVATING_TOKENS involve the token “nob” attending to itself, indicating the neuron fires on the “nob” (nobility) token. [nob] => [nob]

INFERENCE

IFRAME INTEGRATION

NEGATIVE LOGITS
olis-17.875
.Option-15.938
-15.813
_amen-15.625
-15.438
-15.438
_partisan-15.250
_Pradesh-15.250
than-15.125
ój-15.125
POSITIVE LOGITS
14.000
14.000
_Dutch13.875
chw13.813
_lut13.750
ivan13.625
13.625
_illeg13.500
_noble13.188
典型13.125
ACTIVATION TIMES 0.0157%
Activation histogram is not available.
Logits histogram is not available.
TOPACTIVATIONS
SUBSAMPLING80%
SUBSAMPLING60%
SUBSAMPLING40%
NONACTIVATING