A comprehensive, open-source index of AI products, services, startups, businesses, and more.
The Open Intelligence Index is a crowdsourced, centralized repository documenting the global AI landscape. Our mission is to create the most comprehensive, accessible, and well-structured index of AI companies, products, services, and startups worldwide.
Unlike proprietary databases, this index is:
- Free - Completely open source under the MIT License
- Collaborative - Anyone can contribute and improve the data
- Structured - All entries follow defined schemas for consistency
- Accessible - Human-readable YAML format, machine-parsable JSON Schema
- Structured data with JSON Schema validation
- GitHub Actions CI for automatic validation
- IDE auto-completion via YAML Language Server
- Geographic and categorical organization
- Open source and developer-friendly
open-intelligence-index/
├── schemas/ # JSON Schema definitions
│ └── startup.schema.json # Schema for startup entries
├── index/ # Main data directory
│ └── startups/ # Startup/company index
│ └── germany/ # Country-based organization
│ └── aleph-alpha.yaml
├── .github/
│ └── workflows/
│ └── validate-schema.yml # CI validation pipeline
├── schema.yaml # Human-readable schema reference
├── CONTRIBUTING.md # Contribution guidelines
├── DISCLAIMER.md # Data usage disclaimer
├── LICENSE # MIT License
└── README.md # This file
- Fork and clone the repository
- Create a new branch for your addition
- Add your entry following the schema (see Adding a New Entry)
- Submit a Pull Request
You can use this index in your projects by:
- Parsing the YAML files directly
- Converting to JSON for API usage
- Building custom queries and filters
-
Navigate to the appropriate directory based on the company's location:
cd index/startups/<country-code>/
-
Create a new YAML file named after the company (use lowercase, hyphenated):
# Example: For "DeepMind" in the UK touch index/startups/gb/deepmind.yaml -
Add the schema reference at the top (enables IDE validation):
# yaml-language-server: $schema=https://raw.githubusercontent.com/shaharia-lab/open-intelligence-index/main/schemas/startup.schema.json -
Fill in the required fields:
id: deepmind name: "DeepMind" website: "https://deepmind.com" founded_year: 2010 description: "A leading AI research company focused on solving intelligence." industries: - "Artificial Intelligence" - "Research" technologies: - "Deep Learning" - "Reinforcement Learning" headquarters: city: "London" country_code: "GB"
-
Optional fields include
logo_url,long_description,funding,open_source, andsocials. -
Validate locally (optional):
# Install Python dependencies pip install pyyaml jsonschema # Validate your file python scripts/validate.py index/startups/gb/deepmind.yaml
-
Commit and create a PR:
git add index/startups/gb/deepmind.yaml git commit -m "Add DeepMind (UK)" git push origin add-deepmind
All entries are automatically validated against the schema when you submit a PR. The CI pipeline will:
- Convert YAML to JSON
- Validate against
schemas/startup.schema.json - Report any validation errors
Each index type has its own schema definition in the schemas/ directory:
| Schema | Description | Status |
|---|---|---|
startup.schema.json |
Startup/Company entries | Active |
product.schema.json |
AI Product entries | Planned |
service.schema.json |
AI Service entries | Planned |
We welcome contributions from everyone! Please read our CONTRIBUTING.md for detailed guidelines.
- Add new startup/company entries
- Update existing entries with new information
- Fix errors or outdated data
- Improve documentation
- Propose new schemas for additional index types
- Report bugs or suggest improvements
Be respectful, constructive, and inclusive. We aim to maintain a welcoming community for all contributors.
IMPORTANT: This is a crowd-sourced database. All data is provided "AS IS" without warranty of any kind.
Please read our full DISCLAIMER.md before using this data. Key points:
- Data is submitted by community volunteers and not independently verified
- You must independently verify all information before use
- Not responsible for inaccuracies, errors, or outdated information
- Inclusion does not constitute endorsement or recommendation
Data is crowdsourced from:
- Community contributions
- Publicly available information
- Company websites and press releases
- Public databases and news sources
- Add product index schema
- Add service index schema
- Add investor/VC index schema
- Add search API/web interface
- Add data freshness indicators
- Add automated data enrichment
This project is licensed under the MIT License - see the LICENSE file for details.
- All contributors who help build and maintain this index
- The open-source community for tools and inspiration
- Companies making the AI landscape more transparent
Got questions? Open an issue or start a discussion.
Ready to contribute? Check out CONTRIBUTING.md
Made with by the Open Intelligence Index community