Seeing is believing: Brain-inspired modular training for mechanistic interpretability
We introduce Brain-Inspired Modular Training (BIMT), a method for making neural networks
more modular and interpretable. Inspired by brains, BIMT embeds neurons in a geometric …
more modular and interpretable. Inspired by brains, BIMT embeds neurons in a geometric …
Emergent modularity in pre-trained transformers
This work examines the presence of modularity in pre-trained Transformers, a feature
commonly found in human brains and thought to be vital for general intelligence. In analogy …
commonly found in human brains and thought to be vital for general intelligence. In analogy …
Emergent Mixture-of-Experts: Can Dense Pre-trained Transformers Benefit from Emergent Modular Structures?
Incorporating modular designs into neural networks demonstrates superior out-of-
generalization, learning efficiency, etc. Existing modular neural networks are generally …
generalization, learning efficiency, etc. Existing modular neural networks are generally …
Understanding the dynamics of dnns using graph modularity
There are good arguments to support the claim that deep neural networks (DNNs) capture
better feature representations than the previous hand-crafted feature engineering, which …
better feature representations than the previous hand-crafted feature engineering, which …
Unlocking Emergent Modularity in Large Language Models
Abstract Modular Neural Networks (MNNs) demonstrate various advantages over monolithic
models. Existing MNNs are generally explicit: their modular architectures are pre-defined …
models. Existing MNNs are generally explicit: their modular architectures are pre-defined …
Training Neural Networks for Modularity aids Interpretability
An approach to improve network interpretability is via clusterability, ie, splitting a model into
disjoint clusters that can be studied independently. We find pretrained models to be highly …
disjoint clusters that can be studied independently. We find pretrained models to be highly …
Much Easier Said Than Done: Falsifying the Causal Relevance of Linear Decoding Methods
Linear classifier probes are frequently utilized to better understand how neural networks
function. Researchers have approached the problem of determining unit importance in …
function. Researchers have approached the problem of determining unit importance in …
Neural Sculpting: Uncovering hierarchically modular task structure in neural networks through pruning and network analysis
Natural target functions and tasks typically exhibit hierarchical modularity--they can be
broken down into simpler sub-functions that are organized in a hierarchy. Such sub …
broken down into simpler sub-functions that are organized in a hierarchy. Such sub …
[PDF][PDF] Contributions to a methodology for the building of modular neural networks
DA Castillo Bolado - 2023 - accedacris.ulpgc.es
Contributions to a Methodology for the Building of Modular Neural Networks by David
Castillo Bolado Modularity is a powerful concept that has been long leveraged by humanity …
Castillo Bolado Modularity is a powerful concept that has been long leveraged by humanity …
[PDF][PDF] Multimedia Research (MR)
Artificial Neural Networks (ANNs) are a type of machine learning algorithms that are used to
solve problems such as medical diagnosis. In recent times, the amount of data that is …
solve problems such as medical diagnosis. In recent times, the amount of data that is …