You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: scenarios/basics/entity_resolution/README.md
+26-12Lines changed: 26 additions & 12 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -1,27 +1,41 @@
1
+
# AWS Entity Resolution Java Program
2
+
1
3
## Overview
2
-
This AWS Entity Resolution basic scenario demonstrates how to interact with the AWS Entity Resolution service using an AWS SDK. The scenario covers various operations such as creating a schema mapping, creating a matching workflow, starting a matching job, and so on.
4
+
This AWS Entity Resolution basic scenario demonstrates how to interact with the AWS Entity Resolution service using an AWS SDK. This Java application demonstrates how to use AWS Entity Resolution to integrate and deduplicate data from multiple sources using machine learning-based matching. The program walks through setting up AWS resources, uploading structured data, defining schema mappings, creating a matching workflow, and running a matching job.
3
5
4
-
## Key Operations
5
6
6
-
1.**Create an AWS Entity Resolution schema mapping**:
7
-
- This step creates an AWS Entity Resolution schema mapping by invoking the `createSchemaMapping` method.
7
+
**Note:** See the [specification document](SPECIFICATION.md) for a complete list of operations.
8
8
9
-
2.**Create an AWS Entity Resolution workflow**:
10
-
- This step creates an AWS Entity Resolution matching workflow by invoking the `createMatchingWorkflow` method.
9
+
## Features
11
10
12
-
3.**Start a matching aorkflow**:
13
-
- This step starts the AWS Entity Resolution matching workflow by invoking the `startMatchingJob` method.
11
+
1. Uses AWS CloudFormation to create necessary resources:
14
12
15
-
4.**Get workflow job details**:
16
-
- This step gets workflow job details by invoking the `getMatchingJob` method.
13
+
- AWS Glue Data Catalog table
17
14
15
+
- AWS IAM role
18
16
19
-
**Note:** See the [specification document](SPECIFICATION.md) for a complete list of operations.
17
+
- AWS S3 bucket
18
+
19
+
- AWS Entity Resolution Schema
20
+
21
+
2. Uploads sample JSON and CSV data to S3
22
+
23
+
3. Creates schema mappings for JSON and CSV datasets
24
+
25
+
4. Creates and starts an Entity Resolution matching workflow
26
+
27
+
5. Retrieves job details and schema mappings
28
+
29
+
6. Lists available schema mappings
30
+
31
+
7. Tags AWS resources for better organization
32
+
33
+
8. Views the results of the workflow
20
34
21
35
## Resources
22
36
23
37
This Basics scenario requires an IAM role that has permissions to work with the AWS Entity Resolution service,
24
-
an AWS Glue database, and two S3 buckets. A CDK script is provided to create these resources.
38
+
an AWS Glue database, and an S3 bucket. A CDK script is provided to create these resources.
25
39
See the resources [Readme](../../../resources/cdk/entityresolution_resources/README.md) file.
0 commit comments