Code Organization Patterns

This document provides guidance on file organization patterns and best practices for maintaining code quality in the GitHub Agentic Workflows project.

Recommended Patterns to Follow
File Organization Principles
When to Create New Files
File Size Guidelines
Domain-Specific Patterns
Anti-Patterns to Avoid
Decision Trees

Recommended Patterns to Follow

The codebase follows several consistent patterns that should be emulated:

1. Create Functions Pattern (`create_*.go`)

Pattern: One file per GitHub entity creation operation

Examples:

create_issue.go - GitHub issue creation logic
create_pull_request.go - Pull request creation logic
create_discussion.go - Discussion creation logic
create_code_scanning_alert.go - Code scanning alert creation
create_agent_task.go - Agent session creation logic
create_pr_review_comment.go - PR review comment creation

Why it works:

Clear separation of concerns
Enables quick location of specific functionality
Prevents files from becoming too large
Facilitates parallel development
Simplifies test organization and coverage

When to use:

Creating handlers for GitHub API operations
Implementing safe output processors
Building distinct feature modules

2. Engine Separation Pattern

Pattern: Each AI engine has its own file with shared helpers in engine_helpers.go

Examples:

copilot_engine.go (971 lines) - GitHub Copilot engine
claude_engine.go (340 lines) - Claude engine
codex_engine.go (639 lines) - Codex engine
custom_engine.go (300 lines) - Custom engine support
agentic_engine.go (450 lines) - Base agentic engine interface
engine_helpers.go (424 lines) - Shared engine utilities

Why it works:

Engine-specific logic is isolated
Shared code is centralized in engine_helpers.go
Allows addition of new engines without affecting existing ones
Clear boundaries reduce merge conflicts

When to use:

Implementing new AI engines
Adding engine-specific features
Refactoring engine functionality

3. Expression Builder Pattern (`expressions.go`)

Pattern: Cohesive functionality organized in a single, focused file

Examples:

expressions.go (948 lines) - Expression tree building and rendering
strings.go (153 lines) - String utility functions
artifacts.go (60 lines) - Artifact handling
args.go (65 lines) - Argument parsing

Why it works:

All related functionality in one place
Provides clear view of the complete feature
Reduces navigation between files
Promotes cohesive design

When to use:

Building domain-specific utilities
Implementing self-contained features
Creating reusable components

4. Test Organization Pattern

Pattern: Tests live alongside implementation files with descriptive names

Examples:

Feature tests: feature.go + feature_test.go
Integration tests: feature_integration_test.go
Specific scenario tests: feature_scenario_test.go

Examples from codebase:

create_issue.go + create_issue_assignees_test.go + create_issue_backward_compat_test.go
copilot_engine.go + copilot_engine_test.go
engine_helpers.go + engine_helpers_test.go + engine_helpers_shared_test.go

Why it works:

Tests are co-located with implementation
Clear test purpose from filename
Supports test coverage requirements
Separates integration from unit tests

File Organization Principles

1. Prefer Many Small Files Over Large Ones

Good: Multiple focused files (100-500 lines each)

create_issue.go (160 lines)
create_pull_request.go (238 lines)
create_discussion.go (118 lines)

Avoid: Single large file with all creation logic (600+ lines)

2. Group by Functionality, Not by Type

Good: Feature-based organization

create_issue.go            # Issue creation logic
create_issue_test.go       # Issue creation tests
add_comment.go             # Comment addition logic
add_comment_test.go        # Comment tests

Avoid: Type-based organization

models.go                  # All structs
logic.go                   # All business logic
tests.go                   # All tests

3. Use Descriptive File Names

Good:

create_pull_request_reviewers_test.go - Clear what's being tested
engine_error_patterns_infinite_loop_test.go - Specific scenario
copilot_mcp_http_integration_test.go - Clear scope and type

Avoid:

utils.go - Too vague
helpers.go - Too generic (unless truly shared like engine_helpers.go)
misc.go - Indicates poor organization

4. Keep Related Code Together

When implementing a feature:

Create main implementation file (feature.go)
Add unit tests (feature_test.go)
Add integration tests if needed (feature_integration_test.go)
Add scenario-specific tests (feature_scenario_test.go)

When to Create New Files

Create a New File When:

Implementing a new safe output type
- Pattern: create_<entity>.go
- Example: Adding create_gist.go for gist creation
Adding a new engine
- Pattern: <engine-name>_engine.go
- Example: gemini_engine.go for Google Gemini support
Building a new domain feature
- Pattern: <feature-name>.go
- Example: webhooks.go for webhook handling
Current file exceeds 800 lines
- Consider splitting by logical boundaries
- Extract related functionality to new file
Adding significant test coverage
- Pattern: feature_<scenario>_test.go
- Example: create_issue_assignees_test.go

Extend Existing Files When:

Adding to existing functionality
- Example: Adding a field to CreateIssuesConfig in create_issue.go
Fixing bugs in existing code
- Keep fixes in the same file as the original code
File is still under 500 lines
- No need to split unless logic is truly independent
Adding related helper functions
- Example: Adding to strings.go for string utilities

File Size Guidelines

Function Count Threshold

Guideline: Consider splitting files when they exceed 50 functions.

Note: This is a guideline, not a hard rule. Domain complexity may justify larger files.

Monitoring: Run make check-file-sizes to identify files approaching the 50-function threshold.

Current Large Files

The following files are justified despite their size due to domain complexity:

js.go (41 functions, 914 lines) - JavaScript bundling and execution with many embed directives
permissions.go (37 functions, 945 lines) - Permission handling with many GitHub Actions permission types
scripts.go (37 functions, 397 lines) - Script generation with specialized functions for workflow steps
compiler_safe_outputs_consolidated.go (30 functions, 1267 lines) - Consolidated safe output handling

Recommended Sizes

Small files: 50-200 lines
- Utilities, single-purpose functions, helper methods
- Examples: args.go (65 lines), artifacts.go (60 lines)
Medium files: 200-500 lines
- Most feature implementations
- Examples: create_issue.go (160 lines), add_comment.go (210 lines)
Large files: 500-800 lines
- Complex features with many aspects
- Examples: permissions.go (905 lines), safe_outputs.go (811 lines)
Very large files: 800+ lines
- Core infrastructure only
- Examples: compiler.go (1596 lines), copilot_engine.go (971 lines)
- Consider refactoring if possible

Red Flags

⚠️ Warning signs that a file should be split:

Multiple distinct responsibilities
Difficulty naming the file
Scrolling excessively to find code
Merge conflicts frequently occur
Tests are hard to organize

Domain-Specific Patterns

Validation Organization

Current approach: Centralized validation.go (714 lines)

When to add to validation.go:

Schema validation logic
Cross-cutting validation concerns
Frontmatter field validation

When to use domain-specific validation:

Engine-specific validation in <engine>_engine.go
Feature-specific validation alongside feature code
Example: Network validation in network-related files

Extraction Functions

Centralized extraction (validation.go):

func extractString(data map[string]any, key string) string
func extractBool(data map[string]any, key string) bool

Domain-specific extraction (feature files):

// In create_issue.go
func parseTitlePrefixFromConfig(configMap map[string]any) string
func parseLabelsFromConfig(configMap map[string]any) []string

Guideline: Use centralized extractors for primitive types, domain-specific parsers for complex types.

Compiler Organization

The compiler is split across multiple files:

compiler.go (1596 lines) - Main compilation logic
compiler_yaml.go (1020 lines) - YAML generation
compiler_jobs.go (806 lines) - Job generation
compiler_test.go (6058 lines) - Comprehensive tests

This demonstrates that even large subsystems benefit from logical file splits.

Anti-Patterns to Avoid

❌ 1. God Files

Problem: Single file doing everything

// Don't create files like this
workflow.go (5000+ lines)  // Everything related to workflows

Solution: Split by responsibility

workflow_parser.go
workflow_compiler.go
workflow_validation.go

❌ 2. Vague Naming

Problem: Non-descriptive file names

utils.go
helpers.go
misc.go
common.go

Solution: Use specific names

string_utils.go        // If really needed
engine_helpers.go      // Shared engine utilities

❌ 3. Mixed Concerns

Problem: Unrelated functionality in one file

// In create_issue.go - DON'T DO THIS
func CreateIssue() {}
func ValidateNetwork() {}  // Unrelated!
func CompileYAML() {}      // Unrelated!

Solution: Keep files focused on one domain

❌ 4. Test Pollution

Problem: All tests in one massive file

workflow_test.go (10000+ lines)  // All tests

Solution: Split by scenario

workflow_parser_test.go
workflow_compiler_test.go
workflow_integration_test.go

❌ 5. Premature Abstraction

Problem: Creating files before patterns emerge

// Don't create these preemptively
future_feature_helpers.go
maybe_needed_utils.go

Solution: Wait until you have 2-3 use cases, then extract common patterns

Decision Trees

Should I Create a New File?

Is this a new safe output type (create_*)?
├─ YES → Create create_<entity>.go
└─ NO
   │
   Is this a new AI engine?
   ├─ YES → Create <engine>_engine.go
   └─ NO
      │
      Is current file > 800 lines?
      ├─ YES → Consider splitting by logical boundaries
      └─ NO
         │
         Is this functionality independent?
         ├─ YES → Create new file
         └─ NO → Add to existing file

Should I Split an Existing File?

Is the file > 1000 lines?
├─ YES → SHOULD split
└─ NO
   │
   Is the file > 800 lines?
   ├─ YES → CONSIDER splitting
   └─ NO
      │
      Does it have multiple responsibilities?
      ├─ YES → CONSIDER splitting
      └─ NO
         │
         Are there frequent merge conflicts?
         ├─ YES → CONSIDER splitting
         └─ NO → Keep as is

What Should I Name This File?

Is it a create operation for GitHub entity?
├─ YES → create_<entity>.go
└─ NO
   │
   Is it an AI engine implementation?
   ├─ YES → <engine>_engine.go
   └─ NO
      │
      Is it shared helpers for a subsystem?
      ├─ YES → <subsystem>_helpers.go
      └─ NO
         │
         Is it a cohesive feature?
         ├─ YES → <feature>.go
         └─ NO → Reconsider the organization

Examples from the Codebase

Recommended: Create Functions

pkg/workflow/
├── create_issue.go                    (160 lines)
├── create_issue_test.go               (various test files)
├── create_pull_request.go             (238 lines)
├── create_pull_request_test.go
├── create_discussion.go               (118 lines)
├── create_code_scanning_alert.go      (136 lines)
└── create_agent_task.go               (120 lines)

Recommended: Engine Organization

pkg/workflow/
├── agentic_engine.go                  (450 lines) - Base interface
├── copilot_engine.go                  (971 lines) - Copilot implementation
├── claude_engine.go                   (340 lines) - Claude implementation
├── codex_engine.go                    (639 lines) - Codex implementation
├── custom_engine.go                   (300 lines) - Custom engine
└── engine_helpers.go                  (424 lines) - Shared utilities

Recommended: Focused Utilities

pkg/workflow/
├── strings.go                         (153 lines) - String utilities
├── expressions.go                     (948 lines) - Expression handling
├── artifacts.go                       (60 lines) - Artifact management
└── args.go                            (65 lines) - Argument parsing

Quick Reference

When adding a feature, ask yourself:

✅ Does this fit logically in an existing file under 500 lines? → Add there
✅ Is this a new GitHub entity creation? → create_<entity>.go
✅ Is this a new engine? → <engine>_engine.go
✅ Is this a cohesive, self-contained feature? → <feature>.go
❌ Am I creating a "utils" or "helpers" file? → Reconsider the name
❌ Will this file have multiple unrelated responsibilities? → Split it up

When refactoring, ask yourself:

✅ Is the file over 800 lines? → Consider splitting
✅ Are there distinct logical sections? → Extract to separate files
✅ Would splitting improve testability? → Do it
❌ Am I just moving code around without improving organization? → Don't do it

Contributing to Organization Patterns

If you discover new patterns or anti-patterns:

Document them in this file
Provide concrete examples from the codebase
Explain the rationale
Update decision trees if needed
Submit a pull request with your improvements

Remember: Good organization emerges from consistent patterns, not rigid rules.

FilesExpand file tree

code-organization.md

Latest commit

History

code-organization.md

File metadata and controls

Code Organization Patterns

Table of Contents

Recommended Patterns to Follow

1. Create Functions Pattern (create_*.go)

2. Engine Separation Pattern

3. Expression Builder Pattern (expressions.go)

4. Test Organization Pattern

File Organization Principles

1. Prefer Many Small Files Over Large Ones

2. Group by Functionality, Not by Type

3. Use Descriptive File Names

4. Keep Related Code Together

When to Create New Files

Create a New File When:

Extend Existing Files When:

File Size Guidelines

Function Count Threshold

Current Large Files

Recommended Sizes

Red Flags

Domain-Specific Patterns

Validation Organization

Extraction Functions

Compiler Organization

Anti-Patterns to Avoid

❌ 1. God Files

❌ 2. Vague Naming

❌ 3. Mixed Concerns

❌ 4. Test Pollution

❌ 5. Premature Abstraction

Decision Trees

Should I Create a New File?

Should I Split an Existing File?

What Should I Name This File?

Examples from the Codebase

Recommended: Create Functions

Recommended: Engine Organization

Recommended: Focused Utilities

Quick Reference

Contributing to Organization Patterns

1. Create Functions Pattern (`create_*.go`)

3. Expression Builder Pattern (`expressions.go`)