Skip to content
This repository was archived by the owner on Dec 30, 2024. It is now read-only.

Commit cdfad32

Browse files
authored
Merge pull request #67 from aws-solutions/feature/v2.0.0
README updates for v2.0.0
2 parents 86d6869 + 5b3de02 commit cdfad32

File tree

1 file changed

+17
-9
lines changed

1 file changed

+17
-9
lines changed

README.md

Lines changed: 17 additions & 9 deletions
Original file line numberDiff line numberDiff line change
@@ -1,17 +1,25 @@
11
## Discovering Hot Topics using Machine Learning
22

3-
The Discovering Hot Topics Using Machine Learning solution helps you identify the most dominant topics associated with your products, policies, events, and brands. Implementing this solution helps you react quickly to new growth opportunities, address negative brand associations, and deliver higher levels of customer satisfaction.
3+
The Discovering Hot Topics Using Machine Learning solution identifies the most dominant topics associated with your products, policies, events, and brands. This enables you to react quickly to new growth opportunities, address negative brand associations, and deliver a higher level of customer satisfaction for your business. In addition to helping you understand what your customers are saying about your brand, this solution gives you insights into topics that are relevant to your business.
44

5-
The solution automates digital asset (text and image) ingestion from twitter, RSS news feeds, and YouTube comments to provide near-real-time inferences using machine learning algorithms through Amazon Comprehend, Amazon Translate, and Amazon Rekognition to perform topic modeling, sentiment analysis, entity and key phrase detection, and detect any unsafe images. The solution then visualizes these large-scale customer analyses using an Amazon QuickSight dashboard. This guide provides step-by-step instructions for deploying this solution including a pre-built dashboard that provides you with the context and insights necessary to identify trends that help or harm your brand.
5+
This solution deploys an AWS CloudFormation template to automate data ingestion from these sources:
66

7-
The solution performs the following key features:
7+
- Twitter
8+
- RSS news feeds
9+
- YouTube comments tied to videos
10+
- Reddit (comments from subreddits of interest)
11+
- custom data in JSON or XLSX format
812

9-
- **Performs topic modeling to detect dominant topics**: identifies the terms that collectively form a topic from within customer feedback
10-
- **Identifies the sentiment of what customers are saying**: uses contextual semantic search to understand the nature of online discussions
11-
- **Determines if images associated with your brand contain unsafe content**: detects unsafe and negative imagery in content
12-
- **Helps customers identify insights in near real-time**: you can use a visualization dashboard to better understand context, threats, and opportunities almost instantly
13+
This solution uses pre-trained machine learning (ML) models from Amazon Comprehend, Amazon Translate, and Amazon Rekognition to provide these benefits:
1314

14-
This solution deploys an AWS CloudFormation template that supports Twitter, RSS feeds, and YouTube comments as data source options for ingestion, but the solution can be customized to aggregate other social media platforms and internal enterprise systems.
15+
- **Detecting dominant topics using topic modeling**-identifies the terms that collectively form a topic.
16+
- **Identifying the sentiment of what customers are saying**—uses contextual semantic search to understand the nature of online discussions.
17+
- **Determining if images associated with your brand contain unsafe content**—detects unsafe and negative imagery in content.
18+
- **Helping you identify insights in near-real-time**—uses a visual dashboard to understand context, threats, and opportunities almost instantly.
19+
20+
The solution can be customized to aggregate other social media platforms and internal enterprise systems. The default CloudFormation deployment sets up custom ingestion configuration with parameters and an Amazon Simple Storage Service (Amazon S3) bucket to allow Amazon Transcribe Call Analytics output to be processed for natural language processing (NLP) analysis.
21+
22+
With minimal configuration changes in the custom ingestion functionality, this solution can ingest data from both internal systems and external data sources, such as transcriptions from call center calls, product reviews, movie reviews, and community chat forums including Twitch and Discord. This is done by exporting the custom data in JSON or XLSX format from the respective platforms and then uploading it to an Amazon Simple Storage Service (Amazon S3) bucket that is created when deploying this solution. More details on how to customize this feature, please refer Customizing Amazon Amazon S3 ingestion.
1523

1624
For a detailed solution deployment guide, refer to [Discovering Hot Topics using Machine Learning](https://aws.amazon.com/solutions/implementations/discovering-hot-topics-using-machine-learning)
1725

@@ -33,7 +41,7 @@ Deploying this solution with the default parameters builds the following environ
3341

3442
The architecture of the solution includes the following key components and workflows:
3543

36-
1. Ingestion – Twitter, RSS feeds, and YouTube comments ingestion and management using Lambda functions, Amazon DynamoDB, and Amazon CloudWatch Event Scheduler. For detailed reference architecture diagrams for Twitter, YouTube comment, and RSS news feed ingestion, refer to the [implementation guide](https://docs.aws.amazon.com/solutions/latest/discovering-hot-topics-using-machine-learning/overview.html).
44+
1. Ingestion – Twitter, RSS feeds, YouTube comments, Reddit ingestion and management using Lambda functions, Amazon DynamoDB, and Amazon CloudWatch Event Scheduler. For detailed reference architecture diagrams for Twitter, YouTube comment, RSS news feed, and Reddit ingestion refer to the [implementation guide](https://docs.aws.amazon.com/solutions/latest/discovering-hot-topics-using-machine-learning/overview.html).
3745

3846
2. Data Stream — The data is buffered through Amazon Kinesis Data Streams to provide resiliency and throttle incoming requests. The Data Streams have a configured DLQ to catch any errors in processing feeds.
3947

0 commit comments

Comments
 (0)