Skip to content

AbhishekDatta/environment-realism-toolkit

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

1 Commit
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Environment Realism Toolkit

A practical toolkit for auditing environment gaps before chaos testing, based on lessons from production failures.

What This Is

Most chaos tests pass in staging and fail in production. Not because the tests are bad, but because staging lies about what production will do.

This toolkit helps you find those lies before they cost you an incident.

What's Included

  • 30-Minute Audit Template: Spreadsheet format for comparing staging vs production
  • Gap Detection Scripts: Automated checks for common environment differences
  • Mock Configuration Examples: How to make your mocks behave like real dependencies under stress
  • Config Diff Tools: Scripts to compare critical settings across environments
  • Real Examples: Anonymized audit results showing what we found and what we did about it

Quick Start

  1. Run the 30-minute audit using the template in /templates/environment-audit-template.md
  2. Use the gap detection scripts in /scripts/ to automate common checks
  3. Review the examples in /examples/ to see what realistic audits look like
  4. Implement realistic mocks using configs from /mocks/

The Five Critical Gaps

This toolkit focuses on the five gaps that most commonly invalidate chaos test results:

  1. Traffic Realism: Does your load pattern mirror production spikes?
  2. Dependency Realism: Do your mocks behave like real services under stress?
  3. Data Realism: Is your data volume and distribution realistic?
  4. Configuration Realism: Do timeouts, pools, and retries match production?
  5. Temporal Realism: Are you testing during the right time windows?

Background

This toolkit came from a Black Friday incident where we passed every chaos test in staging and production collapsed anyway. The problem wasn't our testing approach. It was the environment we tested in.

Read the full story:

⚠️ Important: Read Before Using

This toolkit involves chaos engineering, which can break production systems if used incorrectly.

Before using any scripts or running chaos experiments:

  1. Read SECURITY.md for critical security and production considerations
  2. Get appropriate approvals from your organization
  3. Start in safe, non-production environments
  4. Have monitoring, kill switches, and rollback plans ready

TL;DR: These are educational examples and starting points, not production-ready tools. Review, customize, and test thoroughly before use. You are responsible for what you break.

Contributing

Found a gap we missed? Built a useful script? Open a PR or issue.

License

MIT License - See LICENSE for details.

Related Resources

This toolkit is part of CERA (Chaos Engineering Requirement Analysis), a framework for structured chaos testing:

About

Practical toolkit for auditing environment gaps before chaos testing.

Resources

License

Contributing

Security policy

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors