|
| 1 | +# DataChain Studio |
| 2 | + |
| 3 | +DataChain Studio is a web application that enables Machine Learning and Data teams to seamlessly |
| 4 | + |
| 5 | +- [Run and track jobs](user-guide/jobs/index.md) |
| 6 | +- [Track experiments and manage models](user-guide/experiments/index.md) (via DVC integration) |
| 7 | +- [Collaborate on data projects](user-guide/team-collaboration.md) |
| 8 | + |
| 9 | +DataChain Studio supports multiple workflows: |
| 10 | +- **DataChain workflows**: For unstructured data processing and transformation |
| 11 | +- **DVC + Git workflows**: For ML experiment tracking and model registry, maintaining Git as the single-source-of-truth |
| 12 | + |
| 13 | +Sign in to DataChain Studio using your GitHub.com, GitLab.com, or Bitbucket.org account, or with your email address. Explore the demo projects and datasets, and [let us know](user-guide/troubleshooting.md#support) if you need any help getting started. |
| 14 | + |
| 15 | +## Why DataChain Studio? |
| 16 | + |
| 17 | +- Simplify data processing job tracking, visualization, and collaboration. |
| 18 | +- Support both modern DataChain workflows and traditional DVC experiment tracking. |
| 19 | +- Keep your code, data and processing connected at all times. |
| 20 | +- Apply your existing software engineering stack for data and ML teams. |
| 21 | +- Build a comprehensive data processing and ML platform for transparency and discovery across all your projects. |
| 22 | +- For DVC projects, maintain Git as the single-source-of-truth and use [GitOps](https://www.gitops.tech/) for deployment and automation. |
| 23 | + |
| 24 | +## Getting Started |
| 25 | + |
| 26 | +New to DataChain Studio? Start with these guides: |
| 27 | + |
| 28 | +- **[User Guide](user-guide/index.md)** - Learn how to use DataChain Studio features |
| 29 | +- **[API Reference](api/index.md)** - Integrate with Studio programmatically |
| 30 | +- **[Webhooks](webhooks.md)** - Set up event notifications |
| 31 | +- **[Self-hosting](self-hosting/index.md)** - Deploy your own Studio instance |
| 32 | + |
| 33 | +## Key Features |
| 34 | + |
| 35 | +### Dataset Management |
| 36 | +- Track and version your datasets |
| 37 | +- Visualize data processing pipelines |
| 38 | +- Share datasets across teams |
| 39 | + |
| 40 | +### Job Processing |
| 41 | +- Run data processing jobs in the cloud |
| 42 | +- Monitor job progress and logs |
| 43 | +- Schedule recurring data processing tasks |
| 44 | + |
| 45 | +### ML Experiment Tracking (DVC Integration) |
| 46 | +- Track and compare ML experiments |
| 47 | +- Manage model lifecycle and registry |
| 48 | +- Visualize metrics and plots |
| 49 | +- Git-based experiment versioning |
| 50 | + |
| 51 | +### Team Collaboration |
| 52 | +- Share projects with team members |
| 53 | +- Control access with role-based permissions |
| 54 | +- Integrate with development workflows |
| 55 | + |
| 56 | +### API Integration |
| 57 | +- RESTful API for programmatic access |
| 58 | +- Webhook notifications for automation |
| 59 | +- Command-line tools for developers |
| 60 | + |
| 61 | + |
| 62 | +Visit [studio.datachain.ai](https://studio.datachain.ai) to get started, or learn about [self-hosting](self-hosting/index.md) for enterprise deployments. |
0 commit comments