Imlement tests for abstract structures such as in Curcuits thread

**duration**: scalable, can be both 175 and 350 hours
**mentor**: @oserikov , Ryan Teehan 
**difficulty**: medium
**requirements**: 
1. PyTorch
2. sklearn
3. experience with re-using the academic code
4. experience with Transformer Language models


**useful links**:
- [The Circuits thread](https://distill.pub/2020/circuits/)
- [A Primer on Bertology](https://direct.mit.edu/tacl/article/doi/10.1162/tacl_a_00349/96482/A-Primer-in-BERTology-What-We-Know-About-How-BERT)



## Idea Description: 
In Circuits, several abstract structures found in CV models were summarized. The Branches Specialization tendency of the CV neural networks, as well as the Weight Banding property of NNs last layers, have not been directly studied in LLMs, though the findings of several papers ([1](https://aclanthology.org/2020.emnlp-main.395/), [2](https://aclanthology.org/P19-1452/)) could be related.

The task is to perform a study of the abstract structures representedness in CV and NLP models, by applying the same inspection techniques to both groups of models. This would allow answering the following questions. Are the abstract structures identified for CV also found in NLP models? How do these structures differ among NLP models?

### Coding Challenge
Reproduce the Branch Specialization core insight: demonstrate [the presence of the branch specialization in the CV model](https://distill.pub/2020/circuits/branch-specialization/); Reproduce the [Individual Neurons](https://aclanthology.org/2020.emnlp-main.395/) analysis on BERT model.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Imlement tests for abstract structures such as in Curcuits thread #1

Idea Description:

Coding Challenge

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Imlement tests for abstract structures such as in Curcuits thread #1

Description

Idea Description:

Coding Challenge

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions