Integrate existing tools: Captum, LIT, AllenNLP Interpret, NeuroX into the HF pipeline

**duration**: scalable, can be both 175 and 350 hours
**mentor**: @oserikov 
**difficulty**: medium
**requirements**: 
1. pytorch
2. sklearn
3. python engineering code, with OOP and patterns
4. experience with Transformer Language models

**useful links**:
- [Captum](https://captum.ai/)
- [AllenNLP Interpret](https://allenai.github.io/allennlp-website/interpret)
- [NeuroX](https://neurox.qcri.org/)

## Idea Description: 
There exist lots of interpretability tools, both for Industry and Academia users. 
While some of them are general-purpose, and the others are very field-specific, all of them have several things in common. 
One would typically apply them to HuggingFace models. All of these methods try to explain the black-boxes we have.

What we propose is, shortly, to put together the existing popular models interpretation stack. We've made a survey of interpretability for LLMs and now have both scientific and engineering vision of what we should implement in order to maximize the interpretability of the existing LLMs.

You need to implement the HF-compatible interpretability aggregation API. The exact **tasks** to accomplish are:
1. choose the most important methods provided by Captum, Interpret and NeuroX (which ones? to better understand the task, try to figure it out yourself. having done this, reach out to us ASAP and we will discuss your vision)
2. implement the all-in-one interpret method to run all the chosen ones
3. perform the initial analysis of the BigScience models checkpoints
4. ensure the codebase is easy to cover the new methods

### Coding Challenge
see task 1.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Integrate existing tools: Captum, LIT, AllenNLP Interpret, NeuroX into the HF pipeline #3

Idea Description:

Coding Challenge

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Integrate existing tools: Captum, LIT, AllenNLP Interpret, NeuroX into the HF pipeline #3

Description

Idea Description:

Coding Challenge

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions