Skip to content

AlidaSoble/advanced-product-hunt-scraper

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

1 Commit
 
 

Repository files navigation

Advanced Product Hunt Scraper

A powerful scraper that collects detailed launch information from Product Hunt, including product metadata, maker profiles, website text, and media assets. Designed for marketers, analysts, and founders who need structured Product Hunt data for research, outreach, and competitive tracking.

Bitbash Banner

Telegram   WhatsApp   Gmail   Website

Created by Bitbash, built to showcase our approach to Scraping and Automation!
If you are looking for Advanced Product Hunt Scraper you've just found your team — Let's Chat. 👆👆

Introduction

This scraper automates the collection of Product Hunt launch data from daily, yearly, and archive pages. It extracts rich product information, maker details, contact links, upvotes, website content, and more — all in structured formats.

Built for growth teams, product researchers, and creators looking for high-quality intelligence on new product launches.

Why Accurate Product Hunt Data Matters

  • Tracks emerging tools and startups with reliable daily coverage.
  • Enables competitive analysis using upvotes, categories, and engagement.
  • Automates manual browsing of Product Hunt archives.
  • Helps discover founders and makers along with profile links.
  • Provides website content and email extraction for outreach workflows.

Features

Feature Description
Daily Launch Extraction Fetches all products in the “Top Products Launching Today” list.
Archive Scraping Supports archive-by-day and archive-by-year data collection.
Maker Profile Parsing Extracts maker details, social links, and about text.
Website Content Extraction Retrieves product website text and email if available.
Media & Links Collection Collects images, website URL, and YouTube video URL.
Vote & Filter Controls Filter by minimum upvotes or crawl only top N results.
Multi-format Exports Export datasets as JSON, CSV, XLSX, XML, RSS, and more.

What Data This Scraper Extracts

Field Name Field Description
date Timestamp when the product was scraped.
launchDate Launch date of the Product Hunt listing.
url URL to the Product Hunt product page.
name Product name.
title Full title including metadata.
shortDescription Brief one-sentence summary.
description Complete descriptive text from the listing.
categories Product Hunt category tags.
maker Detailed maker profile info including links.
websiteUrl Official website of the launched product.
images Array of images from the listing.
upvotes Number of votes the product received.
website Website title, URL, email, and raw text.

Example Output

[
  {
    "date": "2024-03-03T07:01:04.257Z",
    "launchDate": "2024-03-03T07:01:04.257Z",
    "url": "https://www.producthunt.com/posts/digital-creators-hub",
    "title": "Digital Creators' Hub - Content hub & idea discovery engine for digital creators | Product Hunt",
    "name": "Digital Creators' Hub",
    "shortDescription": "Content hub & idea discovery engine for digital creators",
    "description": "Easily discover great ideas, shareable content and free resources...",
    "categories": ["Marketing", "Tech", "Search"],
    "maker": {
      "makerHref": "https://www.producthunt.com/@clifftky",
      "name": "Clifford Tan",
      "slug": "@clifftky",
      "title": "Clifford Tan's profile on Product Hunt",
      "url": "https://www.producthunt.com/@clifftky",
      "links": ["https://clifford.io/", "https://twitter.com/clifftky"],
      "aboutText": "👋 I'm a creator of web apps..."
    },
    "websiteUrl": "https://dch.clifford.io",
    "images": [
      "https://ph-files.imgix.net/73cc3a49-adc2-4b8f-9111-6ddbc9435cdb.png",
      "... more images ..."
    ],
    "upvotes": 57,
    "website": {
      "title": "Digital Creators' Hub",
      "url": "https://dch.clifford.io",
      "email": "hello@clifford.io",
      "websiteRawText": "Welcome to the hub for Digital Creators..."
    }
  }
]

Directory Structure Tree

Advanced Product Hunt Scraper/
├── src/
│   ├── runner.js
│   ├── extractors/
│   │   ├── product_parser.js
│   │   ├── maker_parser.js
│   │   └── website_parser.js
│   ├── utils/
│   │   ├── request.js
│   │   └── filters.js
│   └── config/
│       └── settings.example.json
├── data/
│   ├── inputs.sample.json
│   └── sample_output.json
├── package.json
└── README.md

Use Cases

  • Growth marketers use it to identify new high-performing products so they can launch targeted outreach campaigns.
  • Startup founders monitor competitors' launches to understand traction and category trends.
  • Researchers and analysts gather structured Product Hunt datasets to study market movements.
  • Investors and scouts track early-stage products with detailed maker insights for sourcing opportunities.
  • No-code builders integrate the scraper into automations to populate databases with emerging tools.

FAQs

Q: Can I scrape yearly archives with thousands of listings? Yes, but yearly pages default to a 100-item limit. Increase filter.topNProducts to collect more, keeping in mind longer run times.

Q: Do I need to scrape website content? No — set saveWebsiteContent to false for lighter datasets and faster runs.

Q: How many listings are available per day? Typically 50–120 launches appear daily in the “Top Products Launching Today” section.

Q: Can I filter by popularity? Yes, use filter.minVotes or crawl only the top N products using filter.topNProducts.


Performance Benchmarks and Results

Primary Metric: Average extraction speed processes 40–70 product pages per minute, even with website text enabled.

Reliability Metric: Maintains a 98%+ success rate across daily and archive pages, ensuring consistent data collection.

Efficiency Metric: Optimized request batching reduces bandwidth usage by up to 35% during large archive scrapes.

Quality Metric: Data completeness typically exceeds 95%, including maker links, images, and website metadata.

Book a Call Watch on YouTube

Review 1

"Bitbash is a top-tier automation partner, innovative, reliable, and dedicated to delivering real results every time."

Nathan Pennington
Marketer
★★★★★

Review 2

"Bitbash delivers outstanding quality, speed, and professionalism, truly a team you can rely on."

Eliza
SEO Affiliate Expert
★★★★★

Review 3

"Exceptional results, clear communication, and flawless delivery. Bitbash nailed it."

Syed
Digital Strategist
★★★★★

Releases

No releases published

Packages

 
 
 

Contributors