Skip to content

How were the precomputed tensors (.pth) generated and what is the implementation of lookup_func in Mamba-2? #1

@xuboshen

Description

@xuboshen

Hello, this is a very impressive work! I’m currently trying to conduct visualization analysis on models such as Mamba-2, and I’d like to build everything from scratch. I noticed entries like

"global_channel_indexes": "./remapping_configs/precomputed_tensors/mamba2-1.3b_global_channel_indexes.pth",
"per_channel_decays": "./remapping_configs/precomputed_tensors/mamba2-1.3b_per_channel_decays.pth"

Could you please share how these .pth files were computed? I couldn’t find the corresponding scripts—would it be possible to open-source them?

Additionally, I’m very curious about the lookup_func functions. I couldn’t locate their implementations either, and I see that in the mamba2.py
file, the default values for several related parameters are set to None.

Thank you very much!

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions