You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
See the top of the file for the type definition for what you can configure:
67
+
See [config.ts](src/config.ts) for all available options. Here is a sample of the common configu options:
70
68
71
69
```ts
72
70
typeConfig= {
@@ -80,15 +78,6 @@ type Config = {
80
78
maxPagesToCrawl:number;
81
79
/** File name for the finished data */
82
80
outputFileName:string;
83
-
/** Optional cookie to be set. E.g. for Cookie Consent */
84
-
cookie?: { name:string; value:string };
85
-
/** Optional function to run for each page found */
86
-
onVisitPage?: (options: {
87
-
page:Page;
88
-
pushData: (data:any) =>Promise<void>;
89
-
}) =>Promise<void>;
90
-
/** Optional timeout for waiting for a selector to appear */
91
-
waitForSelectorTimeout?:number;
92
81
};
93
82
```
94
83
@@ -104,18 +93,6 @@ npm start
104
93
105
94
To obtain the `output.json` with a containerized execution. Go into the `containerapp` directory. Modify the `config.ts` same as above, the `output.json`file should be generated in the data folder. Note : the `outputFileName` property in the `config.ts` file in containerapp folder is configured to work with the container.
The crawl will generate a file called `output.json` at the root of this project. Upload that [to OpenAI](https://platform.openai.com/docs/assistants/overview) to create your custom assistant or custom GPT.
0 commit comments