Glimpse

Glimpse is a data engineering project designed to:

Read in latest news snippets from source API
Combined and process text, metadata into a single input
Call LLM API to generate a single paragraph prompt
Use prompt to feed into image generation API
Use social platform API to automatically generate content daily

System Components

External APIs

AWS Ecosystem

Social Media

Project Architecture

Components that are not IaC:

Updating of AWS credentials/IAM user
SNS topic set up + subscriptions
Pandas lambda layer
OpenAI lambda layer
Creation of environment variables for content create lambda
Posting of content

Useful Links

Development Workflow

Create and branch off new issue
Install all local dependencies in requirements.txt as well as setting up serverless locally
Use #%% magic from vscode jupyter extension to run isolated lambda functions
Replicate variables locally using sample files
To test, upload a sample raw_feed.json from local into glimpse-landing-dev through the AWS console. This should kick off the pipeline automatically
If everything runs correctly, an email should be sent to jtsw1990@gmail.com with the content feed
If not, review the logs, check each lambda's latest timestamp to identify error messages
Delete the raw_feed.json from glimpse-landing-dev and feature.json from glimpse-feature-store if applicable to keep things clean
Repeat steps 3-8 until tests run as expected
Run ruff check . --fix to highlight any linting issues
Run sls deploy to push latest adjustments to AWS (Note the components not included in IAC above and apply accordingly)
Run git workflow to push to feature branch
Merge back into main

Project Goals

Become a wizard in building infrastructure
To know the right practices and tools to avoid running notebooks manually in datascience projects
To be able to weigh options for different solutions given a specific stack and situation
Have fun learning and hopefully build something cool along the way

Optional Goals

Get used to the standard git development process (TBD) which will help with work
Create a personal project template that can be reused
Add an element of content creation to this

Name		Name	Last commit message	Last commit date
Latest commit History 37 Commits
.github/workflows		.github/workflows
docs		docs
src		src
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
requirements.txt		requirements.txt
serverless.yml		serverless.yml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Glimpse

System Components

External APIs

AWS Ecosystem

Social Media

Project Architecture

Useful Links

Development Workflow

Project Goals

Optional Goals

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Languages

License

jtsw1990/glimpse-gpt-pipeline

Folders and files

Latest commit

History

Repository files navigation

Glimpse

System Components

External APIs

AWS Ecosystem

Social Media

Project Architecture

Useful Links

Development Workflow

Project Goals

Optional Goals

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Languages

Packages