Skip to content

Latest commit

 

History

History
91 lines (62 loc) · 7.03 KB

File metadata and controls

91 lines (62 loc) · 7.03 KB

Bytesflows Blog & Content

Articles, guides, and tool docs for Bytesflows — dynamic residential proxies and free scraping tools. This repo is the Markdown source for what you see at bytesflows.com/en/blog.

Read on the site: Blog · Proxies · Pricing


What is Bytesflows?

Bytesflows provides dynamic residential proxies — real household IPs that rotate per request or per session so your scrapers look like normal users. That means fewer blocks, better success on protected sites, and the ability to scale.

The content in this repo is written to help you learn web scraping, choose the right proxies, and run more reliable crawls. Explore the guides on the live site and try Bytesflows proxies when you’re ready to scale.


Where to read the content

Where What
Live site bytesflows.com/en/blog — all articles, landing pages, and tool pages
This repo content/ — Markdown source that syncs to the site

Internal links in the content follow the live sitemap (e.g. /en/blog/..., /en/proxies). For the best reading experience, use the blog index and categories on the website. Below is a content map for this repository.


Content navigation (this repo)

By type

Folder Description Example slugs (live URL: https://bytesflows.com/en/blog/<slug>)
content/blog/ Long-form articles, guides, comparisons what-is-web-scraping-beginner-guide, ultimate-guide-web-scraping-2026, best-proxies-for-web-scraping
content/landing/ Topic pages (residential proxies, Cloudflare, Python) residential-proxies, cloudflare-scraping, python-scraping-proxy
content/tools/ Docs for on-site tools proxy-checker, http-header-checker, scraping-test, user-agent-generator, robots-tester, proxy-rotator

By topic (blog categories)

Articles use a category in frontmatter. Use these to find or organize content:

Topic What you’ll find Example articles (slug)
Getting started What is web scraping, first scraper, tools for beginners what-is-web-scraping-beginner-guide, ultimate-guide-web-scraping-2026, how-to-build-first-web-scraper, web-scraping-tools-beginners
Web scraping basics Workflow, vs API, vs crawling, at-scale practices web-scraping-workflow-explained, web-scraping-vs-api, web-scraping-vs-web-crawling, web-scraping-at-scale-best-practices
Python Tutorials, libraries, Scrapy, Requests, performance python-web-scraping-guide, python-web-scraping-tutorial-beginners, best-python-libraries-web-scraping, scrapy-framework-guide, using-requests-web-scraping
Proxies Residential vs datacenter, rotation, pools, best proxies datacenter-vs-residential-proxies, proxy-rotation-strategies, rotating-proxies-web-scraping, best-proxies-for-web-scraping, how-many-proxies-need-scraping
Playwright / browser Headless browser, Playwright vs Selenium/Puppeteer, proxy config playwright-web-scraping-tutorial, headless-browser-scraping-guide, playwright-proxy-configuration-guide, using-proxies-playwright
Anti-bot & bypass Cloudflare, fingerprinting, captchas, detection bypass-cloudflare-web-scraping, browser-fingerprinting-explained, how-websites-detect-scrapers, handling-captchas-in-scraping
Legal & ethics Legality, ethical practices web-scraping-legal-considerations, is-web-scraping-legal, ethical-web-scraping-best-practices-2025
Use cases E‑commerce, jobs, SERP, real estate, travel, etc. scraping-ecommerce-websites, scraping-job-listings, scraping-serp-data, scraping-real-estate-data
AI & advanced AI/LLM for extraction, pipelines, agents ai-web-scraping-explained, using-llms-extract-web-data, future-of-ai-web-scraping

Quick links (live site)

New to web scraping? Start with the beginner guide on our blog.


Repo structure (for maintainers)

content/
  blog/       # Long articles (slug → /en/blog/<slug>)
  landing/    # Topic landings (residential-proxies, cloudflare-scraping, python-scraping-proxy)
  tools/      # Tool docs (proxy-checker, scraping-test, etc.)
scripts/
  upload-blog-to-notion.js   # Sync content to Notion (site pipeline)
  fix-internal-links-sitemap.js  # Normalize internal links to /en/blog/... and /en/proxies
  # ... other utilities
  • Internal links: Use only paths that exist on the live sitemap (bytesflows.com/sitemap.xml), e.g. /en/blog/<slug>, /en/proxies. Run node scripts/fix-internal-links-sitemap.js to fix old-style links.
  • New posts: Add a .md file under content/blog/, content/landing/, or content/tools/ with frontmatter (title, slug, summary, category, tags, language). Then run the upload/sync script as used in your pipeline.

About this repository

This repo is the single source of truth for Bytesflows blog and tool content in Markdown. Articles are published at bytesflows.com and linked from the sitemap so readers can discover guides on web scraping, proxies, and best practices. The goal is to provide practical, technical content that helps you build and run better scrapers — and when you need reliable IPs at scale, Bytesflows dynamic residential proxies are there to help.