Skip to content
@OSU-NLP-Group

OSU Natural Language Processing

Popular repositories Loading

  1. HippoRAG HippoRAG Public

    [NeurIPS'24] HippoRAG is a novel RAG framework inspired by human long-term memory that enables LLMs to continuously integrate knowledge across external documents. RAG + Knowledge Graphs + Personali…

    Python 2.8k 255

  2. Mind2Web Mind2Web Public

    [NeurIPS'23 Spotlight] "Mind2Web: Towards a Generalist Agent for the Web" -- the first LLM-based web agent and benchmark for generalist web agents

    Jupyter Notebook 868 116

  3. SeeAct SeeAct Public

    [ICML'24] SeeAct is a system for generalist web agents that autonomously carry out tasks on any given website, with a focus on large multimodal models (LMMs) such as GPT-4V(ision).

    Python 782 102

  4. GUI-Agents-Paper-List GUI-Agents-Paper-List Public

    Building a comprehensive and handy list of papers for GUI agents

    Python 484 28

  5. TravelPlanner TravelPlanner Public

    [ICML'24 Spotlight] "TravelPlanner: A Benchmark for Real-World Planning with Language Agents"

    Python 407 59

  6. MagicBrush MagicBrush Public

    [NeurIPS'23] "MagicBrush: A Manually Annotated Dataset for Instruction-Guided Image Editing".

    Python 375 14

Repositories

Showing 10 of 58 repositories
  • GUI-Agents-Paper-List Public

    Building a comprehensive and handy list of papers for GUI agents

    OSU-NLP-Group/GUI-Agents-Paper-List’s past year of commit activity
    Python 484 28 1 0 Updated Sep 5, 2025
  • saev Public

    Sparse autoencoders for vision

    OSU-NLP-Group/saev’s past year of commit activity
    Python 43 MIT 6 3 1 Updated Sep 4, 2025
  • HippoRAG Public

    [NeurIPS'24] HippoRAG is a novel RAG framework inspired by human long-term memory that enables LLMs to continuously integrate knowledge across external documents. RAG + Knowledge Graphs + Personalized PageRank.

    OSU-NLP-Group/HippoRAG’s past year of commit activity
    Python 2,771 MIT 255 14 3 Updated Sep 4, 2025
  • ScienceAgentBench Public

    [ICLR'25] ScienceAgentBench: Toward Rigorous Assessment of Language Agents for Data-Driven Scientific Discovery

    OSU-NLP-Group/ScienceAgentBench’s past year of commit activity
    Python 99 MIT 14 3 1 Updated Aug 26, 2025
  • AttributionBench Public

    [ACL'24 Findings] AttributionBench: How Hard is Automatic Attribution Evaluation?

    OSU-NLP-Group/AttributionBench’s past year of commit activity
    Python 10 1 1 0 Updated Aug 18, 2025
  • OSU-NLP-Group/hal-harness’s past year of commit activity
    Python 0 23 0 0 Updated Aug 18, 2025
  • AutoSDT Public

    [EMNLP'25] AutoSDT is a fully automatic pipeline to collect data-driven scientific coding tasks to train co-scientist models.

    OSU-NLP-Group/AutoSDT’s past year of commit activity
    Python 10 MIT 0 0 0 Updated Aug 11, 2025
  • Explorer Public

    [ACL'25 (Findings)] Explorer: Scaling Exploration-driven Web Trajectory Synthesis for Multimodal Web Agents

    OSU-NLP-Group/Explorer’s past year of commit activity
    Python 16 MIT 0 1 0 Updated Aug 4, 2025
  • WebGuard Public

    WebGuard: Building a Generalizable Guardrail for Web Agents

    OSU-NLP-Group/WebGuard’s past year of commit activity
    Python 8 MIT 0 2 0 Updated Jul 28, 2025
  • Online-Mind2Web Public

    An Illusion of Progress? Assessing the Current State of Web Agents

    OSU-NLP-Group/Online-Mind2Web’s past year of commit activity
    Python 81 MIT 2 2 0 Updated Jul 22, 2025

Top languages

Loading…