Seahorse Backend

This repository provides the backend setup and data-loading instructions for the Seahorse project. The process involves loading large biological datasets into a secure AWS RDS (PostgreSQL) database using EC2 and S3 as intermediaries. To enable secure and scalable communication between the frontend and backend, we use AWS Lambda functions as API endpoints that interface with the database and return data to the frontend.

Overview

SEAHORSE leverages structured multi-omic datasets for hypothesis generation and validation. For security and compliance, data imports to the RDS instance are performed within a secure EC2 environment with connectivity to both the S3 bucket (where input files reside) and the RDS database.

Deployment

Configure Terraform:

cd deployment/terraform
terraform init

Deploy the site:
```shell
terraform apply

Cleanup

Delete the site:

terraform destroy

Data Loading Workflow

Copy Data Files to S3 Bucket
Place all .tsv.gz data files into your S3 bucket (e.g., seahorse-data-jq).
Launch an EC2 Instance
- Ensure the EC2 instance has permissions (IAM role) to access the S3 bucket and to connect to the RDS PostgreSQL instance (security group rules).
- SSH into the EC2 instance.
Transfer SQL Scripts
- Upload init.sql and import.sql to the EC2 instance (e.g., with scp or S3).
Run Initialization Script
This creates the necessary tables in your RDS PostgreSQL database:
```
psql -h <rds-endpoint> -U <db-username> -d <db-name> -f init.sql
```

Run Import Script Import all data from S3 into the created tables:

psql -h <rds-endpoint> -U <db-username> -d <db-name> -f import.sql

API & Lambda Integration

To allow the frontend to communicate securely with the backend database, you must set up AWS Lambda functions to serve as API endpoints.

For detailed instructions on setting up the Lambda functions, see
How to create Lambda functions.

Name		Name	Last commit message	Last commit date
Latest commit History 49 Commits
deployment/terraform		deployment/terraform
lambda_functions		lambda_functions
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
import.sql		import.sql
init.sql		init.sql
terraform.tfstate		terraform.tfstate

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Seahorse Backend

Overview

Deployment

Cleanup

Data Loading Workflow

API & Lambda Integration

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Seahorse Backend

Overview

Deployment

Cleanup

Data Loading Workflow

API & Lambda Integration

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages