Skip to content

New attention masking layersΒ #86

@soran-ghaderi

Description

@soran-ghaderi

Description: We need to implement several new attention masking layers in our model to improve its performance on specific tasks. The following masking layers need to be implemented:

It is important to carefully consider the design and implementation of these masking layers to ensure they are effective and efficient.

Deadline for each layer: 2 weeks after opening the issue. After the deadline, the issue opened will be closed to make it available for other contributors.

Metadata

Metadata

Assignees

No one assigned

    Labels

    StaleenhancementNew feature or requestgood first issueGood for newcomershelp wantedExtra attention is neededissue listA list of issues closely relatedjaxRelated to JAXnumpyRelated to NumpypytorchRelated to PytorchtensorflowRelated to Tensorflow

    Type

    No type

    Projects

    Status

    Todo

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions