You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: src/guides/how-to-guides/import-historical-data.md
+6-6Lines changed: 6 additions & 6 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -9,7 +9,7 @@ When transitioning over to Segment, customers commonly want to import historical
9
9
## Method 1: Using a Custom Solution
10
10
11
11
### General Instructions
12
-
Use any [server-side library](https://segment.com/docs/connections/sources/#server), which sends requests in batches to improve performance. Once you have data to import, follow the steps below:
12
+
Use any [server-side library](/docs/connections/sources/#server), which sends requests in batches to improve performance. Once you have data to import, follow the steps below:
13
13
14
14
1. Export or collect the data to be imported.
15
15
@@ -28,9 +28,9 @@ Use any [server-side library](https://segment.com/docs/connections/sources/#ser
28
28
29
29
3. Once you've done that, you'll need to write an application or worker to send the data to Segment.
30
30
31
-
You will need to cycle through each set of data and map it to a Segment server-side library method or build an array matching the [HTTP Import API format](https://segment.com/docs/connections/sources/catalog/libraries/server/http/#import).
31
+
You will need to cycle through each set of data and map it to a Segment server-side library method or build an array matching the [HTTP Import API format](/docs/connections/sources/catalog/libraries/server/http/#import).
32
32
33
-
**Tip**:We recommend using a Segment library forthisprocess, as they will set contextual message fields like `message_id` (used for deduping) and `sent_at` (used for correctly client clock skew) that ourAPIwill use to ensure correct behavior upon ingestion.
33
+
**Tip**:Segment recommends using a Segment library forthisprocess, as they set contextual message fields like `message_id` (used for deduping) and `sent_at` (used for correctly client clock skew) that Segment's API uses to correct behavior upon ingestion.
34
34
35
35
**Tip**: The server-side libraries will automatically batch requests to optimize for performance and prevent linear request volume. This batching behavior is modifiable, and some of the underlying libraries implement a configurable max queue size that may discard messages if you enqueue requests much faster than the client can flush them. We recommend overriding the max queue size parameter for the library to a high value you're comfortable you can remain under in your batch job.
36
36
@@ -47,10 +47,10 @@ Additionally, one of Segment's Software Engineers developed a React App with mor
47
47
[MarketLytics](http://marketlytics.com/){:target="_blank"} has documented their experience using the alpha prototype importer and offer some [helpful visuals and tips](http://marketlytics.com/blog/import-historic-data-to-segment){:target="_blank"}.
48
48
49
49
### Alternative solution
50
-
Ifaserver-sidelibrarydoesn't meet your needs, you can use the Segment [bulk import HTTP API](https://segment.com/docs/connections/sources/catalog/libraries/server/http/#import) directly.
50
+
If a server-side library doesn'tmeetyourneeds, youcanusetheSegment [bulkimport HTTP API](/docs/connections/sources/catalog/libraries/server/http/#import) directly.
51
51
52
-
**Note:** When the historical data were initially exported by Segment, we recommend removing all the original `sent_at`, `message_id`, and `project_id` fields from the archived message before forwarding them to Segment.
0 commit comments