|
2 | 2 | title: Warehouse Schemas |
3 | 3 | --- |
4 | 4 |
|
5 | | -A **schema** describes the way that the data in a warehouse is organized. Schemas include a detailed description of database elements (tables, views, indexes, synonyms, etc.) and the relationships that exist between elements. |
6 | | - |
7 | | -Schemas of warehouse data are organized into the following template: <br/> |
8 | | -`<source>.<collection>.<property>` for example `segment-engineering.tracks.userId`, where Source refers to the source or project name (segment-engineering), collection refers to the event (tracks), and the property refers to the data being collected (userId). |
| 5 | +A **schema** describes the way that the data in a warehouse is organized. Schemas of warehouse data are organized into the following template: |
| 6 | +`<source>.<collection>.<property>`, for example `segment-engineering.tracks.userId`, where source refers to the source or project name (segment-engineering), collection refers to the event (tracks), and the property refers to the data being collected (userId). |
9 | 7 |
|
10 | 8 | > note "Warehouse column creation" |
11 | 9 | > **Note:** Segment creates tables for each of your custom events in your warehouse, with columns for each event's custom properties. Segment does not allow unbounded `event` or `property` spaces in your data. Instead of recording events like "Ordered Product 15", use a single property of "Product Number" or similar. |
@@ -137,7 +135,7 @@ The table below describes the schema in Segment Warehouses: |
137 | 135 |
|
138 | 136 | ## Identifies table |
139 | 137 |
|
140 | | -The `identifies` table stores the `.identify()` method calls =. Query it to find out user-level information. It has the following columns: |
| 138 | +The `identifies` table stores the `.identify()` method calls. Query it to find out user-level information. It has the following columns: |
141 | 139 |
|
142 | 140 | | method | property | |
143 | 141 | | --------------- | ------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------ | |
@@ -428,7 +426,7 @@ The data types that Segment currently supports include: |
428 | 426 |
|
429 | 427 | After analyzing the data from dozens of customers, we set the string column length limit at 512 characters. Longer strings are truncated. We found this was the sweet spot for good performance and ignoring non-useful data. |
430 | 428 |
|
431 | | -We special-case compression for some known columns, like event names and timestamps. The others default to LZO. We may add look-ahead sampling down the road, but from inspecting the datasets today this would be unnecessary complexity. |
| 429 | +We special-case compression for some known columns, like event names and timestamps. The others default to LZO. We may add look-ahead sampling down the road, but from inspecting the datasets today this would be unnecessary complex. |
432 | 430 |
|
433 | 431 | ## Timestamps |
434 | 432 |
|
@@ -476,4 +474,6 @@ All tables use `received_at` for the sort key. Amazon Redshift stores your data |
476 | 474 |
|
477 | 475 | [How do I give users permissions to my warehouse?](/docs/connections/storage/warehouses/add-warehouse-users/) |
478 | 476 |
|
| 477 | +[How frequently does data sync to my warehouse?](/docs/connections/storage/warehouses/warehouse-syncs/#sync-frequency) |
| 478 | + |
479 | 479 | Check out our [Frequently Asked Questions about Warehouses](/docs/connections/storage/warehouses/faq/) and [a list of helpful Redshift queries to get you started](/docs/connections/storage/warehouses/redshift-useful-sql). |
0 commit comments