Updating FAQ formatting [netlify-build]

forstisabella · forstisabella · commit fad7bd8dfc6c · 2022-07-12T15:52:55.000-04:00
diff --git a/src/connections/storage/catalog/data-lakes/index.md b/src/connections/storage/catalog/data-lakes/index.md
@@ -384,16 +384,14 @@ Running the `plan` command gives you an output that creates 19 new objects, unle
 
 ### Segment Data Lakes
 
-{% faq %}
-{% faqitem Do I need to create Glue databases? %}
+
+#### Do I need to create Glue databases?
 No, Data Lakes automatically creates one Glue database per source. This database uses the source slug as its name.
-{% endfaqitem %}
 
-{% faqitem What IAM role do I use in the Settings page? %}
+#### What IAM role do I use in the Settings page?
 Four roles are created when you set up Data Lakes using Terraform. You add the `arn:aws:iam::$ACCOUNT_ID:role/segment-data-lake-iam-role` role to the Data Lakes Settings page in the Segment web app.
-{% endfaqitem %}
 
-{% faqitem What level of access do the AWS roles have? %}
+#### What level of access do the AWS roles have?
 The roles which Data Lakes assigns during set up are:
 
 - **`segment-datalake-iam-role`** - This is the role that Segment assumes to access S3, Glue and the EMR cluster. It allows Segment access to:
@@ -408,54 +406,46 @@ The roles which Data Lakes assigns during set up are:
   - Access only to the specific S3 bucket used for Data Lakes.
 
 - **`segment_emr_autoscaling_role`** - Restricted role that can only be assumed by EMR and EC2. This is set up based on [AWS best practices](https://docs.aws.amazon.com/emr/latest/ManagementGuide/emr-iam-role-automatic-scaling.html).
-{% endfaqitem %}
 
-{% faqitem Why doesn't the Data Lakes Terraform module create an S3 bucket? %}
+
+#### Why doesn't the Data Lakes Terraform module create an S3 bucket?
 The module doesn't create a new S3 bucket so you can re-use an existing bucket for your Data Lakes.
-{% endfaqitem %}
 
-{% faqitem Does my S3 bucket need to be in the same region as the other infrastructure? %}
+#### Does my S3 bucket need to be in the same region as the other infrastructure?
 Yes, the S3 bucket and the EMR cluster must be in the same region.
-{% endfaqitem %}
 
-{% faqitem How do I connect a new source to Data Lakes? %}
+#### How do I connect a new source to Data Lakes?
 To connect a new source to Data Lakes:
 
 1. Ensure that the `workspace_id` of the Segment workspace is in the list of [external ids](https://github.com/segmentio/terraform-aws-data-lake/tree/master/modules/iam#external_ids) in the IAM policy. You can either update this from the AWS console, or re-run the [Terraform](https://github.com/segmentio/terraform-aws-data-lake) job.
 2. From your Segment workspace, connect the source to the Data Lakes destination.
-{% endfaqitem %}
 
-{% faqitem Can I configure multiple sources to use the same EMR cluster? %}
+#### Can I configure multiple sources to use the same EMR cluster?
 Yes, you can configure multiple sources to use the same EMR cluster. Segment recommends that the EMR cluster only be used for Data Lakes to ensure there aren't interruptions from non-Data Lakes job.
-{% endfaqitem %}
 
-{% faqitem Why don't I see any data in S3 or Glue after enabling a source? %}
+#### Why don't I see any data in S3 or Glue after enabling a source?
 If you don't see data after enabling a source, check the following:
 - Does the IAM role have the Segment account ID and workspace ID as the external ID?
 - Is the EMR cluster running?
 - Is the correct IAM role and S3 bucket configured in the settings?
 
 If all of these look correct and you're still not seeing any data, please [contact the Support team](https://segment.com/help/contact/).
-{% endfaqitem %}
 
-{% faqitem What are "Segment Output" tables in S3? %}
+#### What are "Segment Output" tables in S3?
 The `output` tables are temporary tables Segment creates when loading data. They are deleted after each sync.
-{% endfaqitem %}
 
-{% faqitem Can I make additional directories in the S3 bucket Data Lakes is using? %}
+#### Can I make additional directories in the S3 bucket Data Lakes is using?
 Yes, you can create new directories in S3 without interfering with Segment data.
 Do not modify, or create additional directories with the following names:
 - `logs/`
 - `segment-stage/`
 - `segment-data/`
 - `segment-logs/`
-{% endfaqitem %}
 
-{% faqitem What does "partitioned" mean in the table name? %}
+#### What does "partitioned" mean in the table name?
 `Partitioned` just means that the table has partition columns (day and hour). All tables are partitioned, so you should see this on all table names.
-{% endfaqitem %}
 
-{% faqitem How can I use AWS Spectrum to access Data Lakes tables in Glue, and join it with Redshift data? %}
+#### How can I use AWS Spectrum to access Data Lakes tables in Glue, and join it with Redshift data?
 You can use the following command to create external tables in Spectrum to access tables in Glue and join the data with Redshift:
 
 Run the `CREATE EXTERNAL SCHEMA` command:
@@ -471,35 +461,25 @@ create external database if not exists;
 Replace:
 - [glue_db_name] = The Glue database created by Data Lakes which is named after the source slug
 - [spectrum_schema_name] = The schema name in Redshift you want to map to
-{% endfaqitem %}
-{% endfaq %}
 
 ### Azure Data Lakes
 
-{% faq %}
-
-{% faqitem Does my ALDS-enabled storage account need to be in the same region as the other infrastructure? %}
+#### Does my ALDS-enabled storage account need to be in the same region as the other infrastructure?
 Yes, your storage account and Databricks instance should be in the same region.
-{% endfaqitem %}
 
-{% faqitem What analytics tools are available to use with my Azure Data Lake? %}
+#### What analytics tools are available to use with my Azure Data Lake?
 Azure Data Lakes supports the following post-processing tools:
   - PowerBI
   - Azure HDInsight
   - Azure Synapse Analytics
   - Databricks
-{% endfaqitem %}
 
-{% faqitem What can I do to troubleshoot my Databricks database? %}
+#### What can I do to troubleshoot my Databricks database?
 If you encounter errors related to your Databricks database, try adding the following line to the config: <br/>
 ```py
 spark.sql.hive.metastore.schema.verification.record.version false
 ```
 <br/>After you've added to your config, restart your cluster so that your changes can take effect. If you continue to encounter errors, [contact Segment Support](https://segment.com/help/contact/){:target="_blank"}.
-{% endfaqitem %}
-
-{% faqitem What do I do if I get a "Version table does not exist" error when setting up the Azure MySQL database? %}
-Check your Spark configs to ensure that the information you entered about the database is correct, then restart the cluster. The Databricks cluster automatically initializes the Hive Metastore, so an issue with your config file will stop the table from being created.  If you continue to encounter errors, [contact Segment Support](https://segment.com/help/contact/){:target="_blank"}.
-{% endfaqitem %}
 
-{% endfaq %}
+#### What do I do if I get a "Version table does not exist" error when setting up the Azure MySQL database?
+Check your Spark configs to ensure that the information you entered about the database is correct, then restart the cluster. The Databricks cluster automatically initializes the Hive Metastore, so an issue with your config file will stop the table from being created.  If you continue to encounter errors, [contact Segment Support](https://segment.com/help/contact/){:target="_blank"}.
diff --git a/src/connections/storage/data-lakes/index.md b/src/connections/storage/data-lakes/index.md
@@ -22,7 +22,7 @@ Segment Data Lakes sends Segment data to a cloud data store, either AWS S3 or Az
 
 To learn more about Segment Data Lakes, check out the Segment blog post [Introducing Segment Data Lakes](https://segment.com/blog/introducing-segment-data-lakes/){:target="_blank"}.
 
-## How Segment Data Lakes work
+## How Data Lakes work
 
 Segment currently supports Data Lakes hosted on two cloud providers: Amazon Web Services (AWS) and Microsoft Azure. Each cloud provider has a similar system for managing data, but offer different query engines, post-processing systems, and analytics options. 
 
@@ -170,28 +170,27 @@ The Data Lakes and Warehouses products are compatible using a mapping, but do no
 When you use Data Lakes, you can either use Data Lakes as your _only_ source of data and query all of your data directly from S3 or ADLS or you can use Data Lakes in addition to a data warehouse.
 
 ## FAQ
-{% faq %}
 
-{% faqitem What AWS Data Lake features are not supported in the Azure Data Lakes public beta? %}
+### What AWS Data Lake features are not supported in the Azure Data Lakes public beta?
 The following capabilities are supported by Segment Data Lakes but not by the Azure Data Lakes public beta:
   - EU region support
   - Deduplication
   - Sync History and Sync Health in Segment app
-{% endfaqitem %}
 
-{% faqitem Can I send all of my Segment data into Data Lakes? %}
+
+#### Can I send all of my Segment data into Data Lakes?
 Data Lakes supports data from all event sources, including website libraries, mobile, server and event cloud sources. Data Lakes doesn't support loading [object cloud source data](/docs/connections/sources/#object-cloud-sources), as well as the users and accounts tables from event cloud sources.
-{% endfaqitem %}
 
-{% faqitem Are user deletions and suppression supported? %}
+
+### Are user deletions and suppression supported?
 Segment doesn't support User deletions in Data Lakes, but supports [user suppression](/docs/privacy/user-deletion-and-suppression/#suppressed-users).
-{% endfaqitem %}
 
-{% faqitem How does Data Lakes handle schema evolution? %}
+
+### How does Data Lakes handle schema evolution?
 As the data schema evolves and new columns are added, Segment Data Lakes will detect any new columns. New columns will be appended to the end of the table in the Glue Data Catalog.
-{% endfaqitem %}
 
-{% faqitem How does Data Lakes work with Protocols? %}
+
+### How does Data Lakes work with Protocols?
 Data Lakes doesn't have a direct integration with [Protocols](/docs/protocols/).
 
 Any changes to events at the source level made with Protocols also change the data for all downstream destinations, including Data Lakes.
@@ -204,21 +203,20 @@ Data types and labels available in Protocols aren't supported by Data Lakes.
 
 - **Data Types** - Data Lakes infers the data type for each event using its own schema inference systems instead of using a data type set for an event in Protocols. This might lead to the data type set in a data lake being different from the data type in the tracking plan. For example, if you set `product_id` to be an integer in the Protocols Tracking Plan, but the event is sent into Segment as a string, then Data Lakes may infer this data type as a string in the Glue Data Catalog.
 - **Labels** - Labels set in Protocols aren't sent to Data Lakes.
-{% endfaqitem %}
 
-{% faqitem How frequently does my Data Lake sync? %}
+
+### How frequently does my Data Lake sync?
 Data Lakes offers 12 syncs in a 24 hour period and doesn't offer a custom sync schedule or selective sync.
-{% endfaqitem %}
 
-{% faqitem What is the cost to use AWS Glue? %}
+
+### What is the cost to use AWS Glue?
 You can find details on Amazon's [pricing for Glue](https://aws.amazon.com/glue/pricing/){:target="_blank"} page. For reference, Data Lakes creates 1 table per event type in your source, and adds 1 partition per hour to the event table.
-{% endfaqitem %}
 
-{% faqitem What is the cost to use Microsoft Azure? %}
+### What is the cost to use Microsoft Azure?
 You can find details on Microsoft's [pricing for Azure](https://azure.microsoft.com/en-us/pricing/){:target="_blank"} page. For reference, Data Lakes creates 1 table per event type in your source, and adds 1 partition per hour to the event table.
-{% endfaqitem %}
 
-{% faqitem What limits does AWS Glue have? %}
+
+### What limits does AWS Glue have?
 AWS Glue has limits across various factors, such as number of databases per account, tables per account, and so on. See the [full list of Glue limits](https://docs.aws.amazon.com/general/latest/gr/glue.html#limits_glue){:target="_blank"} for more information.
 
 The most common limits to keep in mind are:
@@ -230,14 +228,10 @@ Segment stops creating new tables for the events after you exceed this limit. Ho
 
 You should also read the [additional considerations in Amazon's documentation](https://docs.aws.amazon.com/emr/latest/ReleaseGuide/emr-hive-metastore-glue.html){:target="_blank"} when using AWS Glue Data Catalog.
 
-{% endfaqitem %}
-
-{% faqitem What analytics tools are available to use with my Azure Data Lake? %}
+### What analytics tools are available to use with my Azure Data Lake?
 Azure Data Lakes supports the following analytics tools:
   - PowerBI
   - Azure HDInsight
   - Azure Synapse Analytics
   - Databricks
-{% endfaqitem %}
 
-{% endfaq %}
diff --git a/src/connections/storage/data-lakes/sync-history.md b/src/connections/storage/data-lakes/sync-history.md
@@ -35,24 +35,18 @@ Above the Daily Row Volume table is an overview of the total syncs for the curre
 To access the Sync history page from the Segment app, open the **My Destinations** page and select the data lake. On the data lakes settings page, select the **Health** tab.
 
 ## Data Lakes Reports FAQ
-{% faq %}
-{% faqitem How long is a data point available? %}
+
+### How long is a data point available?
 The health tab shows an aggregate view of the last 30 days worth of data, while the sync history retains the last 100 syncs.
-{% endfaqitem %}
 
-{% faqitem How do sync history and health compare? %}
+### How do sync history and health compare?
 The sync history feature shows detailed information about the most recent 100 syncs to a data lake, while the health tab shows just the number of rows synced to the data lake over the last 30 days.
-{% endfaqitem %}
 
-{% faqitem What timezone is the time and date information in? %}
+### What timezone is the time and date information in?
 All dates and times on the sync history and health pages are in the user's local time. 
-{% endfaqitem %}
 
-{% faqitem When does the data update? %}
+### When does the data update?
 The sync data for both reports updates in real time.
-{% endfaqitem %}
 
-{% faqitem When do syncs occur? %}
-Syncs occur approximately every two hours. Users cannot choose how frequently the data lake syncs. 
-{% endfaqitem %}
-{% endfaq %}
+### When do syncs occur?
+Syncs occur approximately every two hours. Users cannot choose how frequently the data lake syncs. 
diff --git a/src/connections/storage/data-lakes/sync-reports.md b/src/connections/storage/data-lakes/sync-reports.md
@@ -264,13 +264,10 @@ Internal errors occur in Segment's internal systems, and should resolve on their
 
 ## FAQ
 
-{% faq %}
-{% faqitem How are Data Lakes sync reports different from the sync data for Segment Warehouses? %}
+### How are Data Lakes sync reports different from the sync data for Segment Warehouses?
 Both Warehouses and Data Lakes provide similar information about syncs, including the start and finish time, rows synced, and errors.
 
 However, Warehouse sync information is only available in the Segment app: on the Sync History page and Warehouse Health pages. With Data Lakes sync reports, the raw sync information is sent directly to your data lake. This means you can query the raw data and answer your own questions about syncs, and use the data to power alerting and monitoring tools.
-{% endfaqitem %}
-{% faqitem What happens if a sync is partly successful? %}
-Sync reports are currently generated only when a sync completes, or when it fails. Partial failure reporting is not currently supported.
-{% endfaqitem %}
-{% endfaq %}
+
+### What happens if a sync is partly successful?
+Sync reports are currently generated only when a sync completes, or when it fails. Partial failure reporting is not currently supported.