Forge Global Scraper collects structured intelligence on private companies, funding activity, and IPO pipelines from Forge Global. It helps teams track valuations, investors, and market signals in one consistent, machine-readable dataset.
Created by Bitbash, built to showcase our approach to Scraping and Automation!
If you are looking for forge-global-scraper you've just found your team — Let’s Chat. 👆👆
Forge Global Scraper is built to extract detailed private market and pre-IPO information at scale. It turns scattered company pages and financial disclosures into clean, structured data that’s easy to analyze and integrate.
This project is designed for investors, analysts, and research teams who need reliable visibility into private companies and upcoming IPOs without manual tracking.
- Extracts structured profiles for private and pre-IPO companies
- Captures funding rounds, valuations, and share class details
- Tracks leadership, investors, and related portfolio companies
- Aggregates recent financial and press coverage
- Monitors IPO pipeline and filing status updates
| Feature | Description |
|---|---|
| Company profile extraction | Collects core company details including sector, founding year, valuation, and funding totals. |
| Funding history tracking | Extracts funding rounds, tender offers, and secondary transactions with financial terms. |
| Investor mapping | Identifies key investors and maps their broader investment portfolios. |
| Leadership monitoring | Captures founders, executives, and board members when available. |
| News aggregation | Pulls recent financial and press coverage with sources and timestamps. |
| IPO calendar insights | Tracks IPO status, estimated dates, exchanges, and related news. |
| Field Name | Field Description |
|---|---|
| company | Company name as listed on Forge Global. |
| url | Direct link to the company profile page. |
| description | Business overview and operational focus. |
| website | Official company website. |
| sector | Primary industry sector classification. |
| subsector | More specific industry categorization. |
| founded | Year the company was founded. |
| total_funding | Total capital raised across all rounds. |
| post_money_valuation | Latest reported post-money valuation. |
| fundings | Detailed list of funding rounds and transactions. |
| peoples | Founders, executives, and leadership roles. |
| investors_other_investments | Portfolio companies linked to each investor. |
| news | Recent press and financial news coverage. |
| similar_companies | Related companies in the same market segment. |
| ipo_calendar | IPO status, exchange, ticker, and timeline data. |
{
"company": "OpenAI Stock",
"url": "https://forgeglobal.com/openai_stock/",
"sector": "Enterprise Software",
"subsector": "Data Intelligence",
"founded": "2015",
"total_funding": "$14.7B",
"post_money_valuation": "$157B",
"fundings": [
{
"funding_date": "10/02/2024",
"share_class": "Funding Round",
"amount_raised": "$6.6B",
"post_money_valuation": "$157B",
"conversion_ratio": "1.0x"
}
],
"peoples": [
{ "name": "Sam Altman", "title": "CEO and Co-Founder" }
]
}
Forge Global Scraper/
├── src/
│ ├── main.py
│ ├── collectors/
│ │ ├── company_profiles.py
│ │ ├── funding_history.py
│ │ ├── ipo_calendar.py
│ │ └── news_parser.py
│ ├── processors/
│ │ ├── normalize.py
│ │ └── validators.py
│ ├── exporters/
│ │ └── json_exporter.py
│ └── config/
│ └── settings.example.json
├── data/
│ ├── samples/
│ │ └── openai.sample.json
│ └── outputs/
├── requirements.txt
└── README.md
- Venture capital firms use it to monitor private company valuations, so they can spot investment opportunities earlier.
- Investment banks use it to track IPO pipelines, helping them prepare advisory and underwriting strategies.
- Equity analysts use it to consolidate private market data, improving valuation models and coverage.
- Market research teams use it to analyze sector trends, enabling better competitive intelligence.
- Fintech platforms use it to automate private market data feeds for dashboards and analytics.
Is this scraper suitable for large-scale data collection? Yes. It’s designed to handle high-volume extraction across thousands of company profiles while maintaining consistent output structure.
What output format does it generate? All extracted data is structured as JSON, making it easy to store, analyze, or integrate into existing pipelines.
Does it include historical funding data? Yes. The scraper captures historical funding rounds, tender offers, and secondary transactions when available.
Can it track IPO status changes over time? Yes. IPO calendar data includes status, estimated dates, exchanges, and related news updates.
Primary Metric: Processes several hundred company profiles per hour under standard configurations.
Reliability Metric: Maintains a high success rate with automated retries on transient failures.
Efficiency Metric: Optimized data parsing minimizes redundant requests and reduces resource usage.
Quality Metric: Delivers high data completeness across company profiles, funding records, and IPO fields.
