Skip to content
Change the repository type filter

All

    Repositories list

    • soda-mmQC

      Public
      multi modal Quality checks
      Python
      0000Updated Sep 10, 2025Sep 10, 2025
    • SODA Curation is a Streamlit application that provides a simple interface for uploading and processing ZIP files. It's designed to run in a Docker container with NVIDIA GPU support.
      Python
      0102Updated Sep 10, 2025Sep 10, 2025
    • sd-graph

      Public
      Scripts to build the sd-graph database from the SourceData project
      Python
      2922Updated Sep 4, 2025Sep 4, 2025
    • Jupyter Notebook
      0000Updated Aug 19, 2025Aug 19, 2025
    • mecadoi

      Public
      Creating DOIs for peer reviews in MECA archives.
      Python
      0300Updated Aug 13, 2025Aug 13, 2025
    • SODA-RoBERTa is a Source Data resource for training RoBERTa transformers for natural language processing tasks in cell and molecular biology.
      Jupyter Notebook
      1510Updated Jul 2, 2025Jul 2, 2025
    • A mock server providing OpenID Connect (OIDC) flows for local development and testing. DO NOT USE IN PRODUCTION!
      JavaScript
      16000Updated Mar 25, 2025Mar 25, 2025
    • Python
      0200Updated Dec 10, 2024Dec 10, 2024
    • soda-data

      Public
      The Source Data dataset: a biological annotated dataset for machine learning and AI in the publishing context.
      Python
      0310Updated Dec 10, 2024Dec 10, 2024
    • Multimodal model to separate figure captions and their legends into their constituent panels
      Python
      0000Updated Aug 27, 2024Aug 27, 2024
    • Visualizing the peer review process.
      JavaScript
      1410Updated Feb 16, 2024Feb 16, 2024
    • Code for the SmartFigures viewer of the SourceData project
      JavaScript
      1214Updated Aug 7, 2023Aug 7, 2023
    • debatebox

      Public
      A demonstration of a debate between multiple AI-agents.
      Jupyter Notebook
      0000Updated Jun 2, 2023Jun 2, 2023
    • Retrieves Biorxiv data and generates datasets out of it.
      Python
      0100Updated May 25, 2023May 25, 2023
    • sdash

      Public
      A dashboard for sharing SmartFigures
      Vue
      02414Updated May 3, 2023May 3, 2023
    • Documentation on definitions and procedures for the SourceData curation system
      0000Updated Feb 10, 2023Feb 10, 2023
    • py-smtag

      Public
      PyTorch version of the SourceData SmartTag engine
      Python
      10309Updated Nov 22, 2022Nov 22, 2022
    • sodamet

      Public
      The SourceData Multi-level Embeddings Transformers for Biomedical NER
      0000Updated Sep 23, 2022Sep 23, 2022
    • datasets

      Public
      🤗 The largest hub of ready-to-use NLP datasets for ML models with fast, easy-to-use and efficient data manipulation tools
      Python
      2.9k000Updated Feb 28, 2021Feb 28, 2021
    • gender

      Public
      Gender prediction
      HTML
      0000Updated Sep 24, 2020Sep 24, 2020
    • ai

      Public
      AI toolbox
      Python
      0000Updated May 12, 2020May 12, 2020
    • pmidoi

      Public
      Extaction of pmid - doi relationship for a given journal using EuropePMC
      Python
      0000Updated Jun 26, 2019Jun 26, 2019
    • An end-to-end toy scientific experiment, just for fun and learning
      Python
      4000Updated Jun 6, 2019Jun 6, 2019
    • U-Net implementation for PyTorch based on https://arxiv.org/abs/1505.04597
      Python
      62000Updated Dec 4, 2017Dec 4, 2017