Skip to content
Change the repository type filter

All

    Repositories list

    • VRAG

      Public
      Multimodal Retrieval-augmented Generation Framework Built by Tongyi Lab, Alibaba Group.
      Python
      3846120Updated Feb 17, 2026Feb 17, 2026
    • qqr

      Public
      qqr is an RL training framework for open-ended agents.
      Python
      2021310Updated Feb 13, 2026Feb 13, 2026
    • DeepResearch

      Public
      Tongyi Deep Research, the Leading Open-source Deep Research Agent
      Python
      1.4k18k725Updated Feb 7, 2026Feb 7, 2026
    • hilichurl

      Public
      0000Updated Jan 13, 2026Jan 13, 2026
    • ViDoRAG

      Public
      [EMNLP 2025] ViDoRAG: Visual Document Retrieval-Augmented Generation via Dynamic Iterative Reasoning Agents
      Python
      4963011Updated Jan 11, 2026Jan 11, 2026
    • VLLM-KB

      Public
      [EMNLP 2025] Code for "Detecting Knowledge Boundary of Vision Large Language Models by Sampling-Based Inference"
      Python
      0000Updated Jan 5, 2026Jan 5, 2026
    • E2Rank

      Public
      E2Rank: Your Text Embedding can Also be an Effective and Efficient Listwise Reranker
      Python
      15230Updated Nov 4, 2025Nov 4, 2025
    • WebDetective

      Public
      A new evaluation paradigm for deep search that identifies specific LLM failure sources, introduces challenging hint-free datasets with holistic evaluation, and …
      Python
      0400Updated Oct 14, 2025Oct 14, 2025
    • ZeroSearch

      Public
      ZeroSearch: Incentivize the Search Capability of LLMs without Searching
      Python
      1151.2k00Updated Aug 16, 2025Aug 16, 2025
    • CHRONOS

      Public
      Repo for NAACL 2025 Paper "Unfolding the Headline: Iterative Self-Questioning for News Retrieval and Timeline Summarization"
      Python
      3229220Updated Aug 4, 2025Aug 4, 2025
    • LaRA

      Public
      The code for LaRA Benchmark
      Python
      34700Updated May 28, 2025May 28, 2025
    • MaskSearch

      Public
      Repo for "MaskSearch: A Universal Pre-Training Framework to Enhance Agentic Search Capability"
      Python
      714910Updated May 27, 2025May 27, 2025
    • OmniSearch

      Public
      Repo for Benchmarking Multimodal Retrieval Augmented Generation with Dynamic VQA Dataset and Self-adaptive Planning Agent
      Python
      30413111Updated Apr 22, 2025Apr 22, 2025
    • CoFE-RAG

      Public
      Python
      44150Updated Apr 11, 2025Apr 11, 2025
    • CDQA

      Public
      CDQA: Chinese Dynamic Question Answering Benchmark
      Python
      01710Updated Dec 13, 2024Dec 13, 2024
    • Vec-RA-ODQA

      Public
      Source code of paper Improving "Retrieval Augmented Open-Domain Question-Answering with Vectorized Contexts
      Python
      1400Updated Aug 30, 2024Aug 30, 2024
    • Key-Point-Analysis

      Public
      Python
      0100Updated Apr 3, 2024Apr 3, 2024
    • RankingGPT

      Public
      code for paper 《RankingGPT: Empowering Large Language Models in Text Ranking with Progressive Enhancement》
      Python
      23440Updated Jan 9, 2024Jan 9, 2024
    • SeqGPT

      Public
      SeqGPT: An Out-of-the-box Large Language Model for Open Domain Sequence Understanding
      Python
      1122730Updated Dec 13, 2023Dec 13, 2023
    • IBKD

      Public
      This is the official repository for the IBKD knowledge distillation method, as described in the paper .
      Python
      2200Updated Nov 28, 2023Nov 28, 2023
    • EcomGPT

      Public
      An Instruction-tuned Large Language Model for E-commerce
      Python
      1826591Updated Sep 26, 2023Sep 26, 2023
    • MANNER

      Public
      [ACL 2023] MANNER: A Variational Memory-Augmented Model for Cross Domain Few-Shot Named Entity Recognition
      Python
      02060Updated Jul 21, 2023Jul 21, 2023
    • StructuralKD

      Public
      [ACL-IJCNLP 2021] Structural Knowledge Distillation: Tractably Distilling Information for Structured Predictor
      Python
      1931Updated Jul 10, 2023Jul 10, 2023
    • KB-NER

      Public
      Winner system (DAMO-NLP) of SemEval 2022 MultiCoNER shared task over 10 out of 13 tracks.
      Python
      21187171Updated Jan 10, 2023Jan 10, 2023
    • Multi-CPR

      Public
      [SIGIR 2022] Multi-CPR: A Multi Domain Chinese Dataset for Passage Retrieval
      Python
      1920140Updated Jan 4, 2023Jan 4, 2023
    • ACE

      Public
      [ACL-IJCNLP 2021] Automated Concatenation of Embeddings for Structured Prediction
      Python
      47311121Updated Dec 2, 2022Dec 2, 2022
    • HiAGM

      Public
      Hierarchy-Aware Global Model for Hierarchical Text Classification
      Python
      42227120Updated Nov 28, 2022Nov 28, 2022
    • MultilangStructureKD

      Public
      [ACL 2020] Structure-Level Knowledge Distillation For Multilingual Sequence Labeling
      Python
      97201Updated Nov 23, 2022Nov 23, 2022
    • CLNER

      Public
      [ACL-IJCNLP 2021] Improving Named Entity Recognition by External Context Retrieving and Cooperative Learning
      Python
      159321Updated Nov 20, 2022Nov 20, 2022
    • AIN

      Public
      Code for our EMNLP 2020 Paper "AIN: Fast and Accurate Sequence Labeling with Approximate Inference Network"
      Python
      31901Updated Nov 14, 2022Nov 14, 2022