Seeing is believing: Brain-inspired modular training for mechanistic interpretability

Z Liu, E Gan, M Tegmark - Entropy, 2023 - mdpi.com
We introduce Brain-Inspired Modular Training (BIMT), a method for making neural networks
more modular and interpretable. Inspired by brains, BIMT embeds neurons in a geometric …

Emergent modularity in pre-trained transformers

Z Zhang, Z Zeng, Y Lin, C Xiao, X Wang, X Han… - arXiv preprint arXiv …, 2023 - arxiv.org
This work examines the presence of modularity in pre-trained Transformers, a feature
commonly found in human brains and thought to be vital for general intelligence. In analogy …

Emergent Mixture-of-Experts: Can Dense Pre-trained Transformers Benefit from Emergent Modular Structures?

Z Qiu, Z Huang, J Fu - arXiv preprint arXiv:2310.10908, 2023 - arxiv.org
Incorporating modular designs into neural networks demonstrates superior out-of-
generalization, learning efficiency, etc. Existing modular neural networks are generally …

Understanding the dynamics of dnns using graph modularity

Y Lu, W Yang, Y Zhang, Z Chen, J Chen… - … on Computer Vision, 2022 - Springer
There are good arguments to support the claim that deep neural networks (DNNs) capture
better feature representations than the previous hand-crafted feature engineering, which …

Unlocking Emergent Modularity in Large Language Models

Z Qiu, Z Huang, J Fu - Proceedings of the 2024 Conference of the …, 2024 - aclanthology.org
Abstract Modular Neural Networks (MNNs) demonstrate various advantages over monolithic
models. Existing MNNs are generally explicit: their modular architectures are pre-defined …

Training Neural Networks for Modularity aids Interpretability

S Golechha, D Cope, N Schoots - arXiv preprint arXiv:2409.15747, 2024 - arxiv.org
An approach to improve network interpretability is via clusterability, ie, splitting a model into
disjoint clusters that can be studied independently. We find pretrained models to be highly …

Much Easier Said Than Done: Falsifying the Causal Relevance of Linear Decoding Methods

L Hayne, A Suresh, H Jain, R Kumar… - arXiv preprint arXiv …, 2022 - arxiv.org
Linear classifier probes are frequently utilized to better understand how neural networks
function. Researchers have approached the problem of determining unit importance in …

Neural Sculpting: Uncovering hierarchically modular task structure in neural networks through pruning and network analysis

SM Patil, L Michael, C Dovrolis - arXiv preprint arXiv:2305.18402, 2023 - arxiv.org
Natural target functions and tasks typically exhibit hierarchical modularity--they can be
broken down into simpler sub-functions that are organized in a hierarchy. Such sub …

[PDF][PDF] Contributions to a methodology for the building of modular neural networks

DA Castillo Bolado - 2023 - accedacris.ulpgc.es
Contributions to a Methodology for the Building of Modular Neural Networks by David
Castillo Bolado Modularity is a powerful concept that has been long leveraged by humanity …

[PDF][PDF] Multimedia Research (MR)

AF Egba, OR Okonkwo, BN Iduh - academia.edu
Artificial Neural Networks (ANNs) are a type of machine learning algorithms that are used to
solve problems such as medical diagnosis. In recent times, the amount of data that is …