Skip to content

Commit 6849c4d

Browse files
TC-MOBraniganLee
andauthored
docs: integrate playbook into integrate with apify doc (#1608)
restructure the document expand with technical part of integration playbok add TODO for OAuth formatting fixes style guide fixes --------- Co-authored-by: Daniel Lee <[email protected]>
1 parent 873da7c commit 6849c4d

File tree

1 file changed

+174
-11
lines changed

1 file changed

+174
-11
lines changed
Lines changed: 174 additions & 11 deletions
Original file line numberDiff line numberDiff line change
@@ -1,36 +1,199 @@
11
---
2-
title: Integrate with Apify 🤝
2+
title: Integrate with Apify
33
sidebar_label: Create new integration
44
description: Learn about how to integrate your service with Apify to benefit from a mutual integration.
55
sidebar_position: 90.00
66
slug: /integrations/integrate
77
---
88

9-
If you are building a service and your users could benefit from integrating with Apify or vice versa, we would love to hear from you! Contact us at [[email protected]](mailto:[email protected]) to discuss potential collaboration. We are always looking for ways to make our platform more useful and powerful for our users. Here are some examples of services and tools integrating with Apify:
9+
If you are building a service and your users could benefit from integrating with Apify or vice versa, we would love to hear from you! Contact us at [[email protected]](mailto:[email protected]) to discuss potential collaboration. We are always looking for ways to make our platform more useful and powerful for our users.
10+
11+
## Why integrate with Apify
12+
13+
Apify is the leading platform for web scraping, AI agents, and automation tools. By integrating Apify into your platform, you enable users to incorporate real-time, structured data from the web with zero scraping infrastructure on your side.
14+
15+
[Apify Store](https://apify.com/store) contains thousands of pre-built Actors, ready-made tools for web scraping and automation.
16+
17+
## Integration types
18+
19+
An Apify integration can be _general_, allowing users to integrate any Actor from Apify Store into their workflows (or their own Actors), or _Actor-specific_, enabling targeted automation like integrating [Instagram Scraper](https://apify.com/apify/instagram-scraper) for use cases like social media monitoring.
20+
21+
### General integrations
22+
23+
General integrations allow users to integrate Actors into their workflows by connecting Apify with other platforms. Examples include:
1024

11-
- [Langchain](./ai/langchain.md) integration enables developers to build their projects, such as AI chatbots, with Langchain to feed them with data from the web easily.
1225
- [Zapier](./workflows-and-notifications/zapier.md) integration allows Zapier users to enrich their automation workflows with data from the web or to add additional Actions performed by [Apify Actors](https://apify.com/store).
1326
- [Keboola](/platform/integrations/keboola) integration enables Keboola users to easily pull data crawled from the web into their data pipelines.
1427

15-
and many more that you can see at [integrations](./index.mdx).
28+
### Actor-specific integrations
29+
30+
Actor-specific integrations are designed for targeted use cases. While they work similarly to general integrations, they help users find the right Apify tools more easily and provide a better experience. Examples include:
31+
32+
- [Make Instagram Scraper integration](https://www.make.com/en/integrations/apify-instagram-scraper)
33+
- [Lindy.ai Instagram integration](https://www.lindy.ai/integrations/instagram)
34+
35+
For more examples both general and Actor-specific, check [integrations](./index.mdx).
1636

1737
## Integrating with Apify
1838

19-
To integrate your service with Apify, you have two options. You can either:
39+
To integrate your service with Apify, you have two options:
2040

21-
- build an [Apify Actor](https://docs.apify.com/platform/actors) that will be used as integration within the [Apify Console](https://console.apify.com)
22-
- build an external integration, such as [Zapier](./workflows-and-notifications/zapier.md).
41+
- Build an external integration using the [Apify API](https://docs.apify.com/api/v2)
42+
- Build an [Apify Actor](https://docs.apify.com/platform/actors) that will be used as integration within [Apify Console](https://console.apify.com)
2343

2444
![Integration-ready Actors](./images/integration-ready-actors.png)
2545

2646
### Building an integration Actor
2747

28-
The best way to reach out to Apify users is directly within [Apify Console](https://console.apify.com). To do that, you need to build an integrable Actor that can be piped into other Actors to upload existing data into a database. This can then be easily configured within Apify Console. Follow the [guide on building integration-ready Actors](./actors/integration_ready_actors.md).
48+
One way to reach out to Apify users is directly within [Apify Console](https://console.apify.com). To do that, you need to build an integrable Actor that can be piped into other Actors to upload existing data into a database. This can then be easily configured within Apify Console. Follow the [guide on building integration-ready Actors](./actors/integration_ready_actors.md).
2949

3050
### Building an external integration
3151

32-
Alternatively, you can let your users manage the connection directly on your side using [Apify API](https://docs.apify.com/api/v2) and our API clients for [JavaScript](/api/client/js/) or [Python](/api/client/python/). This way, you can provide a seamless experience for your users and let them manage the connection directly from your service.
33-
34-
For inspiration, take a look at the public repositories of our existing external integrations [Zapier](https://github.com/apify/apify-zapier-integration) or [Keboola](https://github.com/apify/keboola-ex-apify) both done in JavaScript or [Airbyte](https://github.com/airbytehq/airbyte/tree/master/airbyte-integrations/connectors/source-apify-dataset) written in Python.
52+
An alternative way is to let your users manage the connection directly on your side using [Apify API](https://docs.apify.com/api/v2) and our API clients for [JavaScript](/api/client/js/) or [Python](/api/client/python/). This way, users can manage the connection directly from your service.
3553

3654
![Airbyte sources tab](./images/airbyte-sources-web.png)
55+
56+
### Authentication methods
57+
58+
Apify supports two main authentication methods for secure API access.
59+
60+
_OAuth 2.0_ - Use OAuth 2.0 to allow users to authorize your integration without sharing their credentials.
61+
<!-- TODO expand section possibly? -->
62+
63+
_API token_ - Apify user generates personal API token from Apify account settings page. For more information, see [API Token documentation](https://docs.apify.com/platform/integrations/api#api-token).
64+
65+
### API implementation
66+
67+
To build an integration, core API endpoints can be mapped as **actions and triggers** inside your platform.
68+
69+
#### Action endpoints
70+
71+
##### Run an Actor
72+
73+
Triggers the execution of any Apify Actor by ID, allowing users to start custom or public web scraping and automation Actors with specified input parameters.
74+
75+
Recommended features:
76+
77+
- Select Actor: The Actor list will be pre-populated with Actors that the user created or used, using the [List of Actors API](https://docs.apify.com/api/v2/acts-get) and enriched with Actors from the store, which the user has not run already using [API](https://docs.apify.com/api/v2/store-get).
78+
- Synchronous vs. asynchronous run: flow will wait until the run/task finishes (consider a timeout on your platform side)
79+
- Input UI: upon selecting an Actor, dynamically display specific Actor input and preload default example values based on the Actor Input schema. Alternatively, allow users to insert a JSON input for the Actor.
80+
- Additionally, it should include the option to choose [Actor build](https://docs.apify.com/platform/actors/running/runs-and-builds), [memory](https://docs.apify.com/platform/actors/running/usage-and-resources#memory), and [timeout](https://docs.apify.com/platform/actors/running/usage-and-resources#memory).
81+
- Field mapping: allowing users to map fields to data acquired in previous steps of the workflow.
82+
83+
##### Run a task
84+
85+
Starts a predefined task (a saved Actor configuration), making it easy for users to run recurring or templated workflows without redefining inputs each time.
86+
87+
Recommended features:
88+
89+
- _Select task_: The task list will be pre-populated with tasks that the user created, using the [List of tasks](https://docs.apify.com/api/v2/actor-tasks-get) API.
90+
- _Synchronous vs. asynchronous run_: the flow will wait until the run/task finishes (considering timeout on your platform side)
91+
- _JSON input field_: possibility to add a JSON input to override the task input.
92+
93+
##### Get dataset items
94+
95+
Fetches structured results (JSON, CSV, etc.) generated by a previously run Actor or task, which can be used as input for further workflow steps.
96+
97+
Recommended features:
98+
99+
- _Dataset_: Dropdown (user's datasets) or ID/String input. Populated via [Datasets API](https://docs.apify.com/api/v2/datasets-get).
100+
- _Limit (optional)_: The maximum number of dataset items to fetch. If empty, the default limit will be used.
101+
- _Offset (optional)_: The offset in the dataset from where to start fetching the items. If empty, it will be from the beginning.
102+
103+
##### Get a key-value store item
104+
105+
Retrieves a specific item from a key-value store, commonly used to access metadata, snapshots, logs, or one-off results generated during Actor execution.
106+
107+
Recommended features:
108+
109+
- _Key-value store_: Dropdown (user's KV stores) or ID/String input. Populated via [Key-value Stores API](https://docs.apify.com/api/v2/key-value-stores-get).
110+
- _Record key_: value (string)
111+
112+
##### Scrape a single URL
113+
114+
Runs Apify's [Website Content Crawler](https://apify.com/apify/website-content-crawler) in synchronous mode to extract structured data from a single web page - ideal for on-demand URL scraping inside agents or automation flows.
115+
116+
Recommended features:
117+
118+
- _URL_: that you intend to scrape (string)
119+
- _Crawler type_: Dropdown menu, allowing users to choose from the following options:
120+
- _Headless web browser_ - Useful for websites with anti-scraping protections and JavaScript rendering. It recognizes common blocking patterns like CAPTCHAs and automatically retries blocked requests through new sessions.
121+
- _Stealthy web browser (default)_ - Another headless web browser with anti-blocking measures enabled. Try this if you encounter anti-bot protections while scraping.
122+
- _Raw HTTP client_ - High-performance crawling mode that uses raw HTTP requests to fetch pages. It's faster and cheaper, but might not work on all websites.
123+
124+
##### Universal API call
125+
126+
A node to send API requests to Apify, allowing advanced users to configure or query Actors, tasks, datasets, or other API endpoints programmatically.
127+
128+
#### Trigger endpoints
129+
130+
##### Watch Actor runs
131+
132+
Monitors the status of an Actor run by ID, useful for triggering follow-up steps once a job has completed. Triggered when a specific Actor run reaches terminal status (succeeded, failed, timed out, aborted).
133+
134+
Recommended features:
135+
136+
- _Select Actor runs to watch_: Dropdown (list of user's Actors). Populated via [List Actors API](https://docs.apify.com/api/v2/acts-get)
137+
138+
##### Watch task runs
139+
140+
Similar to watching Actor runs, this tracks the progress and completion status of a specific task run to allow event-driven actions in a workflow.
141+
142+
Recommended features:
143+
144+
- _Select Actor tasks to watch_: Dropdown (list of user's tasks). Populated via [List tasks API](https://docs.apify.com/api/v2/actor-tasks-get).
145+
146+
### Pricing options
147+
148+
Choose between two pricing models based on your integration setup.
149+
150+
#### Direct user billing
151+
152+
Users create their own Apify accounts and are billed directly by Apify for their usage. This model gives users full control over their Apify usage and billing.
153+
154+
#### Whitelabel access
155+
156+
Users access Apify through your platform without needing an Apify account. Apify bills you based on consumption, and you factor costs into your pricing.
157+
158+
### Monitoring and tracking
159+
160+
To help Apify monitor and support your integration, every API request should identify your platform. You can do this in one of two ways:
161+
162+
- Preferred:
163+
- Use the `x-apify-integration-platform` header with your platform name (e.g., make.com, zapier).
164+
- If your platform has multiple Apify apps, also include the `x-apify-integration-app-id` header with the unique app ID.
165+
- Alternative:
166+
- Set a custom `User-Agent` header that identifies your platform.
167+
168+
These identifiers enable better analytics and support for your integration.
169+
170+
## Technical resources
171+
172+
### Apify API
173+
174+
[Apify API](https://docs.apify.com/api) provides an extensive REST API that covers all of the features of the Apify platform. You can download the complete OpenAPI schema of Apify API in the [YAML](https://docs.apify.com/api/openapi.yaml) or [JSON](https://docs.apify.com/api/openapi.json) formats. Apify provides official libraries for JavaScript and Python to access API.
175+
176+
- [Apify API Reference](https://docs.apify.com/api/v2)
177+
- Client libraries
178+
- [JavaScript/TypeScript/Node.js](https://docs.apify.com/api/client/js/)
179+
- [Python](https://docs.apify.com/api/client/python/)
180+
181+
### Reference implementations
182+
183+
For inspiration, check out the public repositories of Apify's existing external integrations:
184+
185+
- Zapier
186+
- [Zapier integration documentation](https://docs.apify.com/platform/integrations/zapier)
187+
- [Source code on Github](https://github.com/apify/apify-zapier-integration)
188+
- Make.com
189+
- [Make.com integration documentation](https://docs.apify.com/platform/integrations/make)
190+
- Keboola
191+
- [Keboola integration documentation](https://docs.apify.com/platform/integrations/keboola)
192+
- [Source code on GitHub](https://github.com/apify/keboola-ex-apify/) (JavaScript)
193+
- [Google Maps Reviews Scraper integration](https://github.com/apify/keboola-gmrs/) (Actor-specific)
194+
- Airbyte
195+
- [Source code on GitHub](https://github.com/airbytehq/airbyte/tree/master/airbyte-integrations/connectors/source-apify-dataset) (Python)
196+
- Pipedream
197+
- [Source code on GitHub](https://github.com/PipedreamHQ/pipedream/tree/65e79d1d66cf0f2fca5ad20a18acd001f5eea069/components/apify)
198+
199+
For technical support, please contact us at [[email protected]](mailto:[email protected]).

0 commit comments

Comments
 (0)