Merge pull request #298925 from Prakash496/csvdocfix

Stacyrch140 · web-flow · commit f62fd4da0631 · 2025-05-09T10:55:29.000-04:00
Removed postman references in csv ingestion file
diff --git a/articles/energy-data-services/tutorial-csv-ingestion.md b/articles/energy-data-services/tutorial-csv-ingestion.md
@@ -18,87 +18,265 @@ Comma-separated values (CSV) parser ingestion provides the capability to ingest
 
 In this tutorial, you learn how to:
 
-> [!div class="checklist"]
->
-> * Ingest a sample wellbore data CSV file into an Azure Data Manager for Energy instance by using Postman.
-> * Search for storage metadata records created during CSV ingestion by using Postman.
+> * Ingest a sample wellbore data CSV file into an Azure Data Manager for Energy instance by using `cURL`.
+> * Search for storage metadata records created during CSV ingestion by using `cURL`.
 
 ## Prerequisites
-
-Before you start this tutorial, complete the following prerequisites.
+* An Azure subscription
+* An instance of [Azure Data Manager for Energy](quickstart-create-microsoft-energy-data-services-instance.md) created in your Azure subscription
+* cURL command-line tool installed on your machine
+* Generate the service principal access token to call the Seismic APIs. See [How to generate auth token](how-to-generate-auth-token.md).
 
 ### Get details for the Azure Data Manager for Energy instance
 
-* You need an Azure Data Manager for Energy instance. If you don't already have one, create one by following the steps in [Quickstart: Create an Azure Data Manager for Energy instance](quickstart-create-microsoft-energy-data-services-instance.md).
 * For this tutorial, you need the following parameters:
 
-  | Parameter          | Value to use             | Example                               | Where to find this value           |
-  | ------------------ | ------------------------ |-------------------------------------- |-------------------------------------- |
-  | `CLIENT_ID`          | Application (client) ID  | `00001111-aaaa-2222-bbbb-3333cccc4444`  | You use this app or client ID when registering the application with the Microsoft identity platform. See [Register an application](../active-directory/develop/quickstart-register-app.md#register-an-application). |
-  | `CLIENT_SECRET`      | Client secrets           |  `_fl******************`                | Sometimes called an *application password*, a client secret is a string value that your app can use in place of a certificate to identity itself. See [Add a client secret](../active-directory/develop/quickstart-register-app.md#add-a-client-secret).|
-  | `TENANT_ID`          | Directory (tenant) ID    | `72f988bf-86f1-41af-91ab-xxxxxxxxxxxx`  | Hover over your account name in the Azure portal to get the directory or tenant ID. Alternately, search for and select **Microsoft Entra ID** > **Properties** > **Tenant ID** in the Azure portal. |
-  | `SCOPE`              | Application (client) ID  | `00001111-aaaa-2222-bbbb-3333cccc4444`  | This value is the same as the app or client ID mentioned earlier. |
-  | `refresh_token`      | Refresh token value      | `0.ATcA01-XWHdJ0ES-qDevC6r...........`  | Follow [How to generate auth token](how-to-generate-auth-token.md) to create a refresh token and save it. You need this refresh token later to generate a user token. |
-  | `DNS`                | URI                      | `<instance>.energy.Azure.com`         | Find this value on the overview page of the Azure Data Manager for Energy instance.|
-  | `data-partition-id`  | Data partitions        | `<data-partition-id>`  | Find this value on the Data Partitions page of the Azure Data Manager for Energy instance.|
+| Parameter | Value to use | Example | Where to find this value |
+|----|----|----|----|
+| `DNS` | URI | `<instance>.energy.azure.com` | Find this value on the overview page of the Azure Data Manager for Energy instance. |
+| `data-partition-id` | Data partitions | `<data-partition-id>` | Find this value on the Data Partitions section within the Azure Data Manager for Energy instance. |
+| `access_token`       | Access token value       | `0.ATcA01-XWHdJ0ES-qDevC6r...........`| Follow [How to generate auth token](how-to-generate-auth-token.md) to create an access token and save it.|
 
 Follow the [Manage users](how-to-manage-users.md) guide to add appropriate entitlements for the user who's running this tutorial.
 
-### Set up Postman and execute requests
-
-1. Download and install the [Postman](https://www.postman.com/) desktop app.
-
-1. Import the following files into Postman:
-
-   * [CSV workflow Postman collection](https://raw.githubusercontent.com/microsoft/meds-samples/main/postman/IngestionWorkflows.postman_collection.json)  
-   * [CSV workflow Postman environment](https://raw.githubusercontent.com/microsoft/meds-samples/main/postman/IngestionWorkflowEnvironment.postman_environment.json)
-
-   To import the Postman collection and environment variables, follow the steps in [Importing data into Postman](https://learning.postman.com/docs/getting-started/importing-and-exporting-data/#importing-data-into-postman).
-  
-1. Update **CURRENT VALUE** for the Postman environment with the information that you obtained in the details of the Azure Data Manager for Energy instance.
-
-1. The Postman collection for CSV parser ingestion contains 10 requests that you must execute sequentially.
-
-   Be sure to choose **Ingestion Workflow Environment** before you trigger the Postman collection.
-
-   :::image type="content" source="media/tutorial-csv-ingestion/tutorial-postman-choose-environment.png" alt-text="Screenshot of the Postman environment." lightbox="media/tutorial-csv-ingestion/tutorial-postman-choose-environment.png":::
-
-1. Trigger each request by selecting the **Send** button.
-
-   On every request, Postman validates the actual API response code against the expected response code. If there's any mismatch, the test section indicates failures.
-
-Here's an example of a successful Postman request:
-
-:::image type="content" source="media/tutorial-csv-ingestion/tutorial-postman-test-success.png" alt-text="Screenshot of a successful Postman call." lightbox="media/tutorial-csv-ingestion/tutorial-postman-test-success.png":::
-
-Here's an example of a failed Postman request:
-
-:::image type="content" source="media/tutorial-csv-ingestion/tutorial-postman-test-failure.png" alt-text="Screenshot of a failed Postman call." lightbox="media/tutorial-csv-ingestion/tutorial-postman-test-failure.png":::
-
-## Ingest wellbore data by using Postman
-
-To ingest a sample wellbore data CSV file into the Azure Data Manager for Energy instance by using the Postman collection, complete the following steps:
-
-1. **Get a User Access Token**: Generate the user token, which will be used to authenticate further API calls.
-1. **Create a Schema**: Generate a schema that adheres to the columns present in the CSV file.
-1. **Get Schema details**: Get the schema created in the previous step and validate it.
-1. **Create a Legal Tag**: Create a legal tag that will be added to the CSV data for data compliance purposes.
-1. **Get a signed URL for uploading a CSV file**: Get the signed URL path to which the CSV file will be uploaded.
-1. **Upload a CSV file**: Download the [Wellbore.csv](https://github.com/microsoft/meds-samples/blob/main/test-data/wellbore.csv) sample to your local machine, and then select this file in Postman by clicking the **Select File** button.
-
-    :::image type="content" source="media/tutorial-csv-ingestion/tutorial-select-csv-file.png" alt-text="Screenshot of uploading a CSV file." lightbox="media/tutorial-csv-ingestion/tutorial-select-csv-file.png":::
-1. **Upload CSV file metadata**: Upload the file metadata information, such as file location and other relevant fields.
-1. **Create a CSV Parser Ingestion Workflow**: Create the directed acyclic graph (DAG) for the CSV parser ingestion workflow.
-1. **Trigger a CSV Parser Ingestion Workflow**: Trigger the DAG for the CSV parser ingestion workflow.
-1. **Search for ingested CSV Parser Ingestion Workflow status**: Get the status of the CSV parser's DAG run.
-
-## Search for ingested wellbore data by using Postman
-
-To search for the storage metadata records created during the CSV ingestion by using the Postman collection, complete the following step:
-
-* **Search for ingested CSV records**: Search for the CSV records created earlier.
-
-  :::image type="content" source="media/tutorial-csv-ingestion/tutorial-search-success.png" alt-text="Screenshot of searching ingested CSV records." lightbox="media/tutorial-csv-ingestion/tutorial-search-success.png":::
+### Set up your environment
+
+Ensure you have `cURL` installed on your system. You will use it to make API calls.
+
+## Ingest wellbore data by using `cURL`
+
+To ingest a sample wellbore data CSV file into the Azure Data Manager for Energy instance, complete the following steps:
+Replace the placeholders (`<DNS>`, `<access_token>`, etc.) with the appropriate values.
+
+### 1. Create a Schema
+
+Run the following `cURL` command to create a schema:
+
+```bash
+curl -X POST "https://<DNS>/api/schema-service/v1/schema" \
+     -H "Authorization: Bearer <access_token>" \
+     -H "Content-Type: application/json" \
+     -H "data-partition-id: <data-partition-id>" \
+     -d '{
+           "schemaInfo": {
+               "schemaIdentity": {
+                   "authority": "<data-partition-id>",
+                   "source": "shapeFiletest",
+                   "entityType": "testEntity",
+                   "schemaVersionPatch": 1,
+                   "schemaVersionMinor": 0,
+                   "schemaVersionMajor": 0
+               },
+               "status": "DEVELOPMENT"
+           },
+           "schema": {
+               "$schema": "http://json-schema.org/draft-07/schema#",
+               "title": "Wellbore",
+               "type": "object",
+               "properties": {
+                   "UWI": {
+                       "type": "string",
+                       "description": "Unique Wellbore Identifier"
+                   }
+               }
+           }
+       }'
+```
+
+**Sample Response:**
+```json
+{
+  "id": "schema-12345",
+  "status": "DEVELOPMENT"
+}
+```
+Save the `id` from the response for use in subsequent steps.
+
+### 2. Create a Legal Tag
+
+Run the following `cURL` command to create a legal tag:
+
+```bash
+curl -X POST "https://<DNS>/api/legal/v1/legaltags" \
+     -H "Authorization: Bearer <access_token>" \
+     -H "Content-Type: application/json" \
+     -H "data-partition-id: <data-partition-id>" \
+     -d '{
+           "name": "LegalTagName",
+           "description": "Legal Tag added for Well",
+           "properties": {
+               "contractId": "123456",
+               "countryOfOrigin": ["US", "CA"],
+               "dataType": "Third Party Data",
+               "exportClassification": "EAR99",
+               "originator": "Schlumberger",
+               "personalData": "No Personal Data",
+               "securityClassification": "Private",
+               "expirationDate": "2025-12-25"
+           }
+       }'
+```
+
+**Sample Response:**
+```json
+{
+  "name": "LegalTagName",
+  "status": "Created"
+}
+```
+
+### 3. Get a Signed URL for Uploading a CSV File
+
+Run the following `cURL` command to get a signed URL:
+
+```bash
+curl -X GET "https://<DNS>/api/file/v2/files/uploadURL" \
+     -H "Authorization: Bearer <access_token>" \
+     -H "data-partition-id: <data-partition-id>"
+```
+
+**Sample Response:**
+```json
+{
+  "SignedURL": "https://storageaccount.blob.core.windows.net/container/file.csv?sv=...",
+  "FileSource": "file-source-12345"
+}
+```
+
+Save the `SignedURL` and `FileSource` from the response for use in the next steps.
+
+### 4. Upload a CSV File
+
+Download the [Wellbore.csv](https://github.com/microsoft/meds-samples/blob/main/test-data/wellbore.csv) sample to your local machine. Then, run the following `cURL` command to upload the file:
+
+```bash
+curl -X PUT -T "Wellbore.csv" "<SignedURL>" -H "x-ms-blob-type: BlockBlob"     
+```
+
+**Sample Response:**
+```json
+{
+  "status": "Success"
+}
+```
+
+### 5. Upload CSV File Metadata
+
+Run the following `cURL` command to upload metadata for the CSV file:
+
+```bash
+curl -X POST "https://<DNS>/api/file/v2/files/metadata" \
+     -H "Authorization: Bearer <access_token>" \
+     -H "Content-Type: application/json" \
+     -H "data-partition-id: <data-partition-id>" \
+     -d '{
+           "kind": "osdu:wks:dataset--File.Generic:1.0.0",
+           "acl": {
+               "viewers": ["data.default.viewers@<data-partition-id>.dataservices.energy"],
+               "owners": ["data.default.owners@<data-partition-id>.dataservices.energy"]
+           },
+           "legal": {
+               "legaltags": ["<data-partition-id>-LegalTagName"],
+               "otherRelevantDataCountries": ["US"],
+               "status": "compliant"
+           },
+           "data": {
+               "DatasetProperties": {
+                   "FileSourceInfo": {
+                       "FileSource": "<FileSource>"
+                   }
+               }
+           }
+       }'
+```
+
+**Sample Response:**
+```json
+{
+  "id": "metadata-12345",
+  "status": "Created"
+}
+```
+
+Save the `id`, which is the uploaded file's id, from the response for use in the next step.
+
+
+### 6. Trigger a CSV Parser Ingestion Workflow
+
+Run the following `cURL` command to trigger the ingestion workflow:
+
+```bash
+curl -X POST "https://<DNS>/api/workflow/v1/workflow/csv-parser/workflowRun" \
+     -H "Authorization: Bearer <access_token>" \
+     -H "Content-Type: application/json" \
+     -H "data-partition-id: <data-partition-id>" \
+     -d '{
+           "executionContext": {
+               "id": "<uploadedFileId>",
+               "dataPartitionId": "<data-partition-id>"
+           }
+       }'
+```
+
+**Sample Response:**
+```json
+{
+  "runId": "workflow-12345",
+  "status": "Running"
+}
+```
+
+Save the `runId` from the response for use in the next step.
+
+### 7. Check the status of the workflow and wait for its completion.
+
+Run the following `cURL` command to check the status of the workflow run:
+
+```bash
+curl -X GET "https://<DNS>/api/workflow/v1/workflow/csv-parser/workflowRun/<runId>" \
+     -H "Authorization: Bearer <access_token>" \
+     -H "Content-Type: application/json" \
+     -H "data-partition-id: <data-partition-id>"      
+```
+
+**Sample Response:**
+```json
+{
+  "runId": "workflow-12345",
+  "status": "Completed"
+}
+```
+
+Keep checking every few seconds, until the response indicates a successful completion.
+
+### 8. Search for Ingested CSV Records
+
+Run the following `cURL` command to search for ingested records:
+
+```bash
+curl -X POST "https://<DNS>/api/search/v2/query" \
+     -H "Authorization: Bearer <access_token>" \
+     -H "Content-Type: application/json" \
+     -H "data-partition-id: <data-partition-id>" \
+     -d '{
+           "kind": "osdu:wks:dataset--File.Generic:1.0.0"
+       }'
+```
+
+**Sample Response:**
+```json
+{
+  "results": [
+    {
+      "id": "dataset-12345",
+      "kind": "osdu:wks:dataset--File.Generic:1.0.0",
+      "status": "Available"
+    }
+  ]
+}
+```
+
+You should be able to see the records in the search results.
 
 ## Next step