improvement: reuse docker compose for e2e tests #93

fangpenlin · 2025-12-20T05:08:17Z

Description 📣

ref: https://linear.app/infisical/issue/PLATFRM-134/make-a-smarter-docker-compose-cache-system-to-avoid-recreating-compose

based on: #91

Tests 🛠️

The way you test this PR is mostly like the #91. The major difference is that now we have a cache system for the docker-compose. If we found a compose (multiple containers with a compose label) already meets the spec, we will not create a new one but reuse it instead. The purpose is to speed up development cycle.

To ensure this system works, you can run the only relay test case we have right now with TESTCONTAINERS_RYUK_DISABLED=true. This will stop the Ryuk container from cleaning up our compose containers. After successfully running the relay test case, you can then run again to see how long it takes and if it reuses the existing container.

After that, you can also set CLI_E2E_DISABLE_COMPOSE_CACHE=1 to disable the cache and you should see it boot the container from the ground up instead of using the cache.

Type ✨

gitguardian · 2026-01-06T02:48:41Z

⚠️ GitGuardian has uncovered 1 secret following the scan of your pull request.

Please consider investigating the findings and remediating the incidents. Failure to do so may lead to compromising the associated services or software components.

🔎 Detected hardcoded secret in your pull request

GitGuardian id	GitGuardian status	Secret	Commit	Filename
9387833	Triggered	Generic Password	`22fcc91`	e2e/packages/client/reset_db.go	View secret

🛠 Guidelines to remediate hardcoded secrets

Understand the implications of revoking this secret by investigating where it is used in your code.
Replace and store your secret safely. Learn here the best practices.
Revoke and rotate this secret.
If possible, rewrite git history. Rewriting git history is not a trivial act. You might completely break other contributing developers' workflow and you risk accidentally deleting legitimate data.

To avoid such incidents in the future consider

following these best practices for managing and storing secrets including API keys and other credentials
install secret detection on pre-commit to catch secret before it leaves your machine and ease remediation.

^{🦉 GitGuardian detects secrets in your source code to help developers and security teams secure the modern development process. You are seeing this because you or someone else with access to this repository has authorized GitGuardian to scan your pull request.}

greptile-apps · 2026-01-10T04:08:58Z

Greptile Overview

Greptile Summary

This PR introduces a smart Docker Compose caching system to avoid recreating containers for E2E tests, along with new database and Redis reset utilities.

Key Changes

Container Reuse System (compose.go):

Implements SHA1-based naming to identify identical compose configurations
Checks for existing running containers and reuses them instead of recreating
Introduces RunningCompose wrapper to handle pre-existing containers
Refactors Stack API to directly manage compose lifecycle

Reset Utilities (reset_db.go, reset_redis.go, reset.go):

New PostgreSQL reset function that truncates all tables while preserving migrations
New Redis reset function using FlushAll
Integration layer to reset both services via compose stack

Test Improvements (helpers_test.go):

Added Reset() and ResetAndBootstrap() methods for test isolation
Separated Bootstrap logic from Up for better reusability
Cleaner API with direct Stack integration

Critical Security Issue

SQL Injection Vulnerability in reset_db.go (line 125): Table names from information_schema are directly interpolated into TRUNCATE statements without proper identifier escaping. While this is test code and the risk is lower, a malicious table name could execute arbitrary SQL. This must be fixed by using proper identifier quoting.

Other Issues Found

Context misuse in compose.go (line 48): Uses context.Background() instead of passed context
Panic usage: RunningCompose uses panic instead of returning errors
Race condition potential: Container state checking has TOCTOU window
Cleanup timing: t.Cleanup() registered after operations that could fail

Architecture Assessment

The caching approach is sound and will significantly speed up test execution by reusing containers. The reset mechanism properly isolates tests while avoiding the overhead of full container recreation. However, the SQL injection issue needs immediate attention before merging.

Confidence Score: 2/5

This PR contains a critical SQL injection vulnerability that must be fixed before merging
Score reflects the CRITICAL security issue in reset_db.go (SQL injection via unescaped table identifiers) plus the context misuse bug in compose.go. While this is test code and the architecture is sound, the SQL injection represents a significant security risk that prevents approval. The other issues (panic usage, race conditions, cleanup timing) are less critical but should also be addressed.
e2e/packages/client/reset_db.go requires immediate attention for SQL injection fix (line 125). e2e/packages/infisical/compose.go needs context bug fix (line 48).

Important Files Changed

File Analysis

Filename	Score	Overview
e2e/packages/infisical/compose.go	3/5	Major refactor adding Docker container reuse logic with SHA1-based naming. Contains context misuse bug (line 48), potential race condition in container state checking, and panic usage instead of error returns. Core caching functionality appears sound.
e2e/packages/infisical/reset.go	4/5	New helper to reset Infisical services by calling client reset functions. Clean implementation with proper error handling and context propagation.
e2e/packages/client/reset_db.go	1/5	New database reset utility with CRITICAL SQL injection vulnerability (line 125) due to unescaped table identifiers. Also has potential conflict issues with hardcoded super_admin insertion. Requires immediate security fix.
e2e/packages/client/reset_redis.go	4/5	New Redis reset utility using FlushAll to clear all databases. Clean implementation with proper connection handling. FlushAll scope (all DBs vs single DB) should be clarified in comments.
e2e/relay/helpers_test.go	4/5	Refactored to use new Stack API and added Reset/ResetAndBootstrap methods. Cleanup registration timing could be improved but overall changes are sound and improve test isolation.

greptile-apps

_{7 files reviewed, 7 comments}

_{Edit Code Review Agent Settings | Greptile}

e2e/packages/infisical/compose.go

greptile-apps · 2026-01-10T04:09:02Z

e2e/packages/infisical/compose.go

+	if len(containers) > 0 {
+		services := make([]string, 0, len(s.Project.Services))
+		for name := range s.Project.Services {
+			services = append(services, name)
+		}
+
+		missingServices := make(map[string]int, len(services))
+		for _, service := range services {
+			missingServices[service] = 1
+		}
+		for _, c := range containers {
+			if c.State == container.StateRunning {
+				serviceName, ok := c.Labels[api.ServiceLabel]
+				if !ok {
+					continue
+				}
+				_, ok = missingServices[serviceName]
+				if ok {
+					delete(missingServices, serviceName)
+				}
+			}
+		}
+
+		if len(missingServices) == 0 {
+			provider, err := testcontainers.NewDockerProvider(testcontainers.WithLogger(log.Default()))
+			if err != nil {
+				return err
+			}
+			s.dockerCompose = &RunningCompose{
+				name:       uniqueName,
+				client:     dockerClient,
+				provider:   provider,
+				services:   services,
+				containers: make(map[string]*testcontainers.DockerContainer),
+			}
+			slog.Info("Found existing running containers", "name", uniqueName)
+			// Found existing compose, reuse instead
+			return s.dockerCompose.Up(ctx)
+		}


⚠️ Potential Race Condition in Container State Check

The code checks if containers exist and are running, then decides to reuse them. However, there's a time-of-check to time-of-use (TOCTOU) race condition here. Between checking the container state (lines 61-82) and actually using it (line 98), the containers could:

Stop running

Be removed by another process

Change state

While this is test code and the race window is small, in concurrent test scenarios this could lead to flaky tests where containers appear to exist but fail to respond.

Recommendation: Consider adding retry logic or additional state validation when attempting to reuse existing containers.

The cache system is not meant for CI test or any form of parallel way of running yet. For now, the main focus is on speeding up the development cycle of writing the test. Because otherwise we will waste tons of time in just waiting the compose container to boot up. If we really care about it, we will either set CLI_E2E_DISABLE_COMPOSE_CACHE=1 to disable this feature on CI or add a lock to it.

greptile-apps · 2026-01-10T04:09:03Z

e2e/packages/infisical/compose.go

+	panic("Cannot modify running compose")
+}
+
+func (c *RunningCompose) WithEnv(m map[string]string) compose.ComposeStack {
+	panic("Cannot modify running compose")
+}
+
+func (c *RunningCompose) WithOsEnv() compose.ComposeStack {
+	panic("Cannot modify running compose")
+}


⚠️ Using Panic Instead of Error Return

These methods use panic() to indicate that operations are not supported on running compose instances. While this makes the intent clear, panicking in library code is generally considered poor practice as it prevents callers from gracefully handling the error.

Recommendation: Consider returning an error instead:

Suggested change

panic("Cannot modify running compose")

}

func (c *RunningCompose) WithEnv(m map[string]string) compose.ComposeStack {

panic("Cannot modify running compose")

}

func (c *RunningCompose) WithOsEnv() compose.ComposeStack {

panic("Cannot modify running compose")

}

func (c *RunningCompose) WaitForService(s string, strategy wait.Strategy) compose.ComposeStack {

// Cannot modify running compose - return self to maintain interface compatibility

return c

}

func (c *RunningCompose) WithEnv(m map[string]string) compose.ComposeStack {

// Cannot modify running compose - return self to maintain interface compatibility

return c

}

func (c *RunningCompose) WithOsEnv() compose.ComposeStack {

// Cannot modify running compose - return self to maintain interface compatibility

return c

}

Or if you want to preserve the error behavior, log a warning instead of panicking since these methods are part of the ComposeStack interface and may be called by library code that doesn't expect panics.

_{Note: If this suggestion doesn't match your team's coding style, reply to this and let me know. I'll remember it for next time!}

Nope, this library is for test. We don't want to handle the error, cashing the app is the easy way to tell the developer that you shouldn't do that. There's no need for recovery from that kind of error anyway, and this is not code for production anyway.

greptile-apps · 2026-01-10T04:09:04Z

e2e/packages/client/reset_db.go

+		if _, ok := opts.SkipTables[table]; ok {
+			continue
+		}
+		builder.WriteString(fmt.Sprintf("TRUNCATE TABLE %s RESTART IDENTITY CASCADE;\n", table))


🚨 CRITICAL: SQL Injection Vulnerability - Unescaped Table Identifiers

This line directly interpolates table names from the database into a SQL TRUNCATE statement without proper identifier quoting. While table names come from information_schema, a malicious actor who can create tables with special characters in their names could potentially inject SQL.

Attack Scenario:
If an attacker can create a table named something like: myTable; DROP DATABASE infisical; --, this could execute arbitrary SQL commands.

Example of vulnerability:

table := `myTable; DROP TABLE important_table; --` // Results in: TRUNCATE TABLE myTable; DROP TABLE important_table; -- RESTART IDENTITY CASCADE;

Fix: Use PostgreSQL identifier quoting with pgx.Identifier or properly escape identifiers:

Suggested change

builder.WriteString(fmt.Sprintf("TRUNCATE TABLE %s RESTART IDENTITY CASCADE;\n", table))

builder.WriteString(fmt.Sprintf("TRUNCATE TABLE %s RESTART IDENTITY CASCADE;\n", pgx.Identifier{schema, table}.Sanitize()))

Or use double quotes for identifier escaping:

// Escape table names properly escapedTable := strings.ReplaceAll(table, `"`, `""`) builder.WriteString(fmt.Sprintf(`TRUNCATE TABLE "%s" RESTART IDENTITY CASCADE;`, escapedTable) + "\n")

Security Impact: HIGH - Could allow arbitrary SQL execution in test environments, potentially affecting other tests or leaking data.

heh, hope, this is an e2e test running with a one time use database in a container. there's zero value to SQL inject it, not to mention the table name is from the db itself instead of user input. you have no idea what you're talking about, it's just the stats making you say that it looks like a SQL injection attack vector unfortunately 😅

greptile-apps · 2026-01-10T04:09:05Z

e2e/packages/client/reset_redis.go

+	slog.Info("Connected to Redis", "pong", pong)
+
+	// Clear all keys in the current database
+	err = rdb.FlushAll(ctx).Err()


⚠️ FlushAll Affects All Redis Databases

The code uses FlushAll() which clears all databases in the Redis instance (databases 0-15 by default), not just the currently selected one. This could be problematic if:

Multiple test suites run in parallel and use different Redis database numbers

The Redis instance is shared with other applications during development

Consider: Using FlushDB() instead to only flush the current database, or explicitly select a specific database number before flushing:

// Flush only the current database (safer for shared Redis) err = rdb.FlushDB(ctx).Err()

Or if you intentionally want to clear all databases (current behavior is likely correct for e2e tests):

// FlushAll clears all databases - this is intentional for e2e test reset err = rdb.FlushAll(ctx).Err()

Add a comment to clarify the intent.

Nope, I don't need to add a comment. The intention is very clear: reset_redis. And this is under e2e folder / package, it's obviously for e2e test to reset the redis db.

e2e/relay/helpers_test.go

greptile-apps · 2026-01-10T04:09:07Z

e2e/packages/client/reset_db.go

+	_, err = conn.Exec(ctx,
+		`INSERT INTO public.super_admin ("id", "fipsEnabled", "initialized", "allowSignUp") VALUES ($1, $2, $3, $4)`,
+		"00000000-0000-0000-0000-000000000000", true, false, true)


⚠️ Hardcoded ID May Cause Conflicts

The code inserts a hardcoded all-zeros identifier for the super_admin record. While this is consistent and deterministic for testing, there are potential issues:

If the truncate operation fails for the super_admin table (due to it being in SkipTables), this INSERT will fail with a unique constraint violation

The hardcoded values may not match what a real initialization would create

Recommendation: Consider using ON CONFLICT DO UPDATE or ON CONFLICT DO NOTHING to make this operation idempotent and prevent failures on repeated calls.

nope, if the truncate failed for whatever reason, the whole test should just fail. And hope, the super admin is exactly like that using this uuid. the quality of review is really bad this time unfortunately

fangpenlin changed the base branch from main to PLATFRM-125-e2e-test-poc December 20, 2025 05:08

fangpenlin force-pushed the PLATFRM-134-reuse-docker-compose-for-e2e-tests branch from 78a567c to f508d06 Compare January 6, 2026 02:48

fangpenlin force-pushed the PLATFRM-125-e2e-test-poc branch 3 times, most recently from d6f09f0 to 7fa0723 Compare January 6, 2026 19:56

Base automatically changed from PLATFRM-125-e2e-test-poc to main January 7, 2026 17:29

fangpenlin added 15 commits January 7, 2026 11:07

Simplified code

c13c9af

Use unique name

3cc2789

Finding existing containers

07524e5

Add running compose

ed6b73c

Return value

34adcb3

Get tables

fee751f

reset db

42afc90

Clear redis db as well

6e31f01

Add deps

930a243

Refactor code

f4883cd

Use new reset

ba0572e

Add reset function for helper as well

ef1687a

Provide more helpers

ba35c96

Remove not needed bootstrap in reset

bdfb90b

Add missing reset

22fcc91

fangpenlin force-pushed the PLATFRM-134-reuse-docker-compose-for-e2e-tests branch from f508d06 to 22fcc91 Compare January 7, 2026 23:49

fangpenlin added 2 commits January 9, 2026 17:50

Update deps

1af6088

Add CLI_E2E_DISABLE_COMPOSE_CACHE env var

30ef5ea

fangpenlin marked this pull request as ready for review January 10, 2026 04:04

greptile-apps bot reviewed Jan 10, 2026

View reviewed changes

fangpenlin added 2 commits January 9, 2026 20:34

Adjust the cleanup order

887fcf0

Fix ctx pass in

800e91b

fangpenlin requested a review from akhilmhdh January 10, 2026 04:41

	builder.WriteString(fmt.Sprintf("TRUNCATE TABLE %s RESTART IDENTITY CASCADE;\n", table))
	builder.WriteString(fmt.Sprintf("TRUNCATE TABLE %s RESTART IDENTITY CASCADE;\n", pgx.Identifier{schema, table}.Sanitize()))

improvement: reuse docker compose for e2e tests #93

Are you sure you want to change the base?

improvement: reuse docker compose for e2e tests #93

Conversation

fangpenlin commented Dec 20, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description 📣

Tests 🛠️

Type ✨

Uh oh!

gitguardian bot commented Jan 6, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

⚠️ GitGuardian has uncovered 1 secret following the scan of your pull request.

Uh oh!

greptile-apps bot commented Jan 10, 2026

Greptile Overview

Greptile Summary

Key Changes

Critical Security Issue

Other Issues Found

Architecture Assessment

Confidence Score: 2/5

Important Files Changed

Uh oh!

greptile-apps bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

greptile-apps bot Jan 10, 2026

Choose a reason for hiding this comment

Uh oh!

fangpenlin Jan 10, 2026

Choose a reason for hiding this comment

Uh oh!

greptile-apps bot Jan 10, 2026

Choose a reason for hiding this comment

Uh oh!

fangpenlin Jan 10, 2026

Choose a reason for hiding this comment

Uh oh!

greptile-apps bot Jan 10, 2026

Choose a reason for hiding this comment

Uh oh!

fangpenlin Jan 10, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

greptile-apps bot Jan 10, 2026

Choose a reason for hiding this comment

Uh oh!

fangpenlin Jan 10, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

greptile-apps bot Jan 10, 2026

Choose a reason for hiding this comment

Uh oh!

fangpenlin Jan 10, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

fangpenlin commented Dec 20, 2025 •

edited

Loading

gitguardian bot commented Jan 6, 2026 •

edited

Loading

fangpenlin Jan 10, 2026 •

edited

Loading