Skip to content
Change the repository type filter

All

    Repositories list

    • IPO-Mine

      Public
      A toolkit and dataset for multimodal IPO filing analysis.
      Python
      01000Updated Feb 13, 2026Feb 13, 2026
    • FLaME

      Public
      Financial Language Model Evaluation
      Python
      17471Updated Jan 29, 2026Jan 29, 2026
    • Codebase for VideoConviction, accepted at KDD 2025 (D&B Track)
      Jupyter Notebook
      11800Updated Jan 22, 2026Jan 22, 2026
    • KG-MuLQA

      Public
      HTML
      1100Updated Jan 13, 2026Jan 13, 2026
    • Jupyter Notebook
      8800Updated Nov 15, 2025Nov 15, 2025
    • tax-calc-bench

      Public archive
      Code & data for TaxCalcBench
      Python
      13000Updated Nov 6, 2025Nov 6, 2025
    • FIFE

      Public
      Financial Instruction Following Evaluation (NeurIPS 2025)
      Python
      1010Updated Nov 1, 2025Nov 1, 2025
    • FinCap

      Public
      FinCap: Topic-Aligned Captions for Short-Form Financial YouTube Videos
      Jupyter Notebook
      2300Updated Oct 20, 2025Oct 20, 2025
    • This is the official repository for the paper "Words That Unite The World: A Unified Framework for Deciphering Global Central Bank Communications"
      TeX
      51700Updated Oct 19, 2025Oct 19, 2025
    • ConfReady

      Public
      [EMNLP 2025 System Demonstrations] ConfReady is an easy-to-use Llama or GPT powered web interface which can be used to empower authors to reflect on their work …
      Python
      0700Updated Oct 1, 2025Oct 1, 2025
    • This is the official repository for the paper accepted at CoLM 2025 titled "Beyond the Reported Cutoff: Where Large Language Models Fall Short on Financial Know…
      Jupyter Notebook
      0200Updated Jul 29, 2025Jul 29, 2025
    • Python
      0000Updated Jul 15, 2025Jul 15, 2025
    • `transformers` based implementation of "Data Decide"
      HTML
      1101Updated Jul 15, 2025Jul 15, 2025
    • sparkjq

      Public
      A streamlined tool for launching Apache Spark clusters on SLURM-based systems with minimal setup.
      Python
      1001Updated Jun 21, 2025Jun 21, 2025
    • evalchemy

      Public
      Automatic evals for LLMs
      HTML
      78000Updated Jun 17, 2025Jun 17, 2025
    • SiDyP

      Public
      [KDD'25] This is the official code repo for our KDD'25 paper "Calibrating Pre-trained Language Classifier on LLM-generated Noisy Labels vis Iterative Refinement…
      Python
      4400Updated Jun 1, 2025Jun 1, 2025
    • Jupyter Notebook
      1800Updated Apr 9, 2025Apr 9, 2025
    • Codebase for FOMC-NLP, accepted at ACL 2023 (main)
      HTML
      216410Updated Dec 17, 2024Dec 17, 2024
    • Codebase for SubjECTive-QA, accepted at NeurIPS 2024 (D&B Track)
      Python
      3700Updated Dec 17, 2024Dec 17, 2024
    • graphrag

      Public
      A modular graph-based Retrieval-Augmented Generation (RAG) system
      Python
      3.3k000Updated Nov 18, 2024Nov 18, 2024
    • ACLReady

      Public
      ACLReady, a retrieval-augmented language model application that can be used to empower authors to reflect on their work and assist authors with the ACL checklis…
      TeX
      1600Updated Oct 27, 2024Oct 27, 2024
    • Python
      1000Updated Oct 4, 2024Oct 4, 2024
    • CoCoHD

      Public
      Jupyter Notebook
      0700Updated Oct 4, 2024Oct 4, 2024
    • Scatter Protocol: An Incentivized and Trustless Protocol for Decentralized Federated Learning - Accepted to IEEE International Conference on Blockchain
      TypeScript
      0500Updated Sep 19, 2024Sep 19, 2024
    • FiNER

      Public
      Jupyter Notebook
      31600Updated Sep 10, 2024Sep 10, 2024
    • FiNER-ORD

      Public
      Python
      0400Updated Sep 10, 2024Sep 10, 2024
    • Python
      0000Updated Aug 27, 2024Aug 27, 2024
    • textgrad

      Public
      TextGrad: Automatic ''Differentiation'' via Text -- using large language models to backpropagate textual gradients.
      Python
      279000Updated Aug 1, 2024Aug 1, 2024
    • Open source annotation tool for machine learning practitioners.
      Python
      1.8k000Updated Mar 10, 2024Mar 10, 2024
    • Rust-tokenizer offers high-performance tokenizers for modern language models, including WordPiece, Byte-Pair Encoding (BPE) and Unigram (SentencePiece) models
      Rust
      33000Updated Oct 1, 2023Oct 1, 2023