Skip to content
Change the repository type filter

All

    Repositories list

    • hbase-operator-tools

      Public
      Apache HBase Operator Tools
      Java
      Apache License 2.0
      149001Updated Mar 20, 2026Mar 20, 2026
    • Python
      BSD 3-Clause "New" or "Revised" License
      112112Updated Mar 18, 2026Mar 18, 2026
    • https://docs.zyte.com/web-scraping/tutorial/index.html
      Python
      BSD 3-Clause "New" or "Revised" License
      2600Updated Mar 18, 2026Mar 18, 2026
    • web-snap

      Public
      Create "perfect" snapshots of web pages
      JavaScript
      MIT License
      43400Updated Mar 12, 2026Mar 12, 2026
    • zyte-common-items

      Public
      Contains the common item definitions used in Zyte.
      Python
      BSD 3-Clause "New" or "Revised" License
      101047Updated Feb 27, 2026Feb 27, 2026
    • x402

      Public
      A payments protocol for the internet. Built on HTTP.
      TypeScript
      Other
      1.3k1016Updated Feb 11, 2026Feb 11, 2026
    • Python client for Zyte API
      Python
      BSD 3-Clause "New" or "Revised" License
      62984Updated Feb 10, 2026Feb 10, 2026
    • Spider templates for automatic crawlers.
      Python
      BSD 3-Clause "New" or "Revised" License
      434147Updated Jan 8, 2026Jan 8, 2026
    • HTML
      MIT License
      31211Updated Oct 28, 2025Oct 28, 2025
    • Remove DIVs, style stuff and normalize HTML preserving structure information
      Python
      MIT License
      31400Updated Oct 24, 2025Oct 24, 2025
    • hetzner

      Public
      A high-level Python API for accessing the Hetzner robot.
      Python
      Other
      41000Updated Oct 9, 2025Oct 9, 2025
    • html-text

      Public
      HTML
      MIT License
      11830Updated Oct 6, 2025Oct 6, 2025
    • 0220Updated Sep 23, 2025Sep 23, 2025
    • Python
      MIT License
      2571Updated Sep 5, 2025Sep 5, 2025
    • Bash scripts to universally deploy various distributions
      Shell
      Other
      156200Updated Aug 4, 2025Aug 4, 2025
    • Python
      0000Updated Jul 17, 2025Jul 17, 2025
    • Websites for testing spiders
      Python
      MIT License
      0300Updated May 15, 2025May 15, 2025
    • A stub implementation of a subset of Zyte API
      Python
      MIT License
      0200Updated Apr 22, 2025Apr 22, 2025
    • Python
      0000Updated Mar 18, 2025Mar 18, 2025
    • URL matching library that relates URLs with resources
      Python
      BSD 3-Clause "New" or "Revised" License
      2910Updated Feb 14, 2025Feb 14, 2025
    • Contains rules for https://github.com/zytedata/duplicate-url-discarder.
      Python
      MIT License
      1000Updated Feb 5, 2025Feb 5, 2025
    • http-parser

      Public archive
      Fork of 'https://github.com/benoitc/http-parser'
      C
      Other
      95000Updated Nov 14, 2024Nov 14, 2024
    • Example solutions for the practice and contest websites of the code contest of Web Data Extraction Summit.
      Python
      MIT License
      2500Updated Oct 21, 2024Oct 21, 2024
    • Example site for web scraping tutorials
      Julia
      BSD 3-Clause "New" or "Revised" License
      173132Updated Oct 9, 2024Oct 9, 2024
    • rrweb

      Public archive
      record and replay the web
      TypeScript
      MIT License
      1.6k003Updated Sep 14, 2024Sep 14, 2024
    • geventhttpclient

      Public archive
      A high performance, concurrent http client library for python with gevent
      Python
      Other
      138001Updated Sep 3, 2024Sep 3, 2024
    • Run upstream VS Code on a remote machine with access through a modern web browser from any device, anywhere.
      TypeScript
      MIT License
      39k104Updated Aug 30, 2024Aug 30, 2024
    • A working Dockerfile that has unsloth with all the other dependencies
      Dockerfile
      0400Updated Aug 23, 2024Aug 23, 2024
    • zyte-api-workshop

      Public archive
      Python
      0107Updated Jul 29, 2024Jul 29, 2024
    • Python
      GNU General Public License v3.0
      0102Updated May 14, 2024May 14, 2024