[Pattern Draft] AI Code Generation Context (#834)

amburi · web-flow · commit 3f5c7bf505ff · 2025-09-01T00:04:26.000+02:00
* feat: create new structure pattern ai-code-generation-context
diff --git a/README.md b/README.md
@@ -96,6 +96,7 @@ Our mission
 * [InnerSource Incentives and Disincentives](/patterns/1-initial/incentives-and-disincentives.md) - *Lack of awareness for incentives as well well as disincentives for InnerSource contribution decrease the chances of an InnerSource project receiving contributions; this is addressed by sharing a comprehensive list of potential incentives and disincentives.*
 * [Walk the InnerSource talk](/patterns/1-initial/walk-the-innersource-talk.md) - *Teams across the organization are encouraged to adopt InnerSource principles such as working openly, sharing code, and collaborating transparently. But, if the team behind the InnerSource initiative doesn’t follow these practices themselves, it undermines credibility and adoption. Therefore, this team should lead by example: documenting their decisions as code, working in the open, and treating their work as an InnerSource project to build trust and show others how it’s done.*
 * [Require InnerSource before Open Source](/patterns/1-initial/innersource-before-open-source.md) - *Maintaining and managing open source projects can be challenging for organizations, due to a lack of internal infrastructure and people with the knowledge of the required collaboration practices. By requiring projects to be InnerSource before becoming open source, teams have time to establish the necessary internal support, governance, and collaboration skills needed for successful community engagement.*
+* [AI Code Generation Context](patterns/1-initial/ai-code-generation-context.md) - *AI tools generate code that diverges from project standards and architectural patterns. Provide an AI Code Generation Context within the repositories to guide AI tools in producing contributions that align with existing project conventions, reducing review friction and maintaining code consistency.*
 
 <!--
 NOTE: The 'Initial' Patterns below don't have a Patlet yet, which is essential for readers to quickly browse our patterns.
diff --git a/patterns/1-initial/ai-code-generation-context.md b/patterns/1-initial/ai-code-generation-context.md
@@ -0,0 +1,162 @@
+## Title
+
+AI Code Generation Context
+
+## Patlet
+
+AI tools generate code that diverges from project standards and architectural patterns. Provide an AI Code Generation Context within the repositories to guide AI tools in producing contributions that align with existing project conventions, reducing review friction and maintaining code consistency.
+
+## Problem
+
+With the growing use of AI tools (like GitHub Copilot, ChatGPT, or custom LLMs), InnerSource contributors are increasingly using generative AI to write code. However, without project-specific context, these tools often produce code that diverges from the project's architectural patterns, naming conventions, or quality standards. This leads to friction during reviews, inconsistent codebases, and technical debts or additional burden on maintainers.
+
+## Context
+
+* InnerSource adoption is in place across the organization.
+* Developers are encouraged to use AI tools to improve their productivity.
+* Contributors may not be familiar with or fail to meticulously prompt when generating code through AI, using the target codebase's idioms, architecture, or constraints.
+* Maintainers want to empower contributors using AI while preserving code consistency and maintainability.
+* Multiple teams are contributing to shared repositories with varying levels of familiarity with project standards.
+* Code review processes are becoming bottlenecked by AI-generated code that requires significant rework.
+
+## Forces
+
+* **AI Model Limitations**: AI models generate code based on generalized training data, not project-specific patterns, leading to generic solutions that may not fit the project's architecture
+* **Knowledge Gap**: New contributors using AI tools might unknowingly bypass existing architectural norms, coding standards, and established patterns
+* **Review Overhead**: AI-assisted PRs can significantly increase review load if not aligned with existing practices, as maintainers must spend time explaining and correcting deviations
+* **Productivity vs. Quality Trade-off**: While AI tools boost individual productivity, they can reduce overall team productivity if the generated code requires extensive rework
+* **Context Switching Cost**: Developers benefit from AI tools only when they have the right contextual grounding, but manually providing this context for each AI interaction is time-consuming
+* **Inconsistent Standards**: Different AI tools and different prompting approaches by contributors can lead to wildly inconsistent code styles and patterns
+* **Maintenance Burden**: Creating and maintaining the comprehensive AI context requires ongoing effort from maintainers
+* **Tool Integration Complexity**: Different AI tools have different ways of consuming context, making it challenging to create universal guidance
+* **AI Tool Cost Constraints**: Comprehensive AI context increases processing costs (AI tools charge based on "tokens" \* units of text measurement) and usage limits, requiring strategic balance between context completeness and efficiency
+
+## Solution
+
+Provide an **AI Code Generation Context** folder within the repository to guide AI tools in producing contributions that align with project standards. This folder serves as a comprehensive reference that contributors can easily provide to AI tools, ensuring that generated code adheres to project conventions.
+
+### Implementation Structure
+
+Create an `innersource-ai/` folder in the repository root containing:
+
+#### Core Documentation Files (Required)
+
+`PROMPT.md`: Project-specific instructions for AI tools
+
+* Naming conventions (variables, functions, classes, files)
+* Logging patterns and error handling approaches
+* Testing strategy and preferred testing frameworks
+* Code formatting and style preferences
+* Common anti-patterns to avoid
+* Preferred libraries and frameworks for specific tasks
+
+`ARCHITECTURE.md`: Lightweight system overview
+
+* High-level system architecture and component relationships
+* Data flow patterns and API design principles
+* Dependency management guidelines
+* Module organization and layering principles
+* Integration patterns with external systems
+
+`STYLE_GUIDE.md`: Comprehensive coding guidelines
+
+* Language-specific style rules
+* Code organization patterns
+* Documentation standards
+* Performance considerations
+* Security guidelines and common vulnerabilities to avoid
+
+#### Enhancements (Optional)
+
+##### Practical Examples
+
+`EXAMPLES/`: Sample code files demonstrating best practices
+
+* `good-examples/`: Well-written code snippets with explanations
+* `bad-examples/`: Common mistakes with explanations of why they're problematic
+* `refactoring-examples/`: Before/after code showing proper improvements
+* Template files for common patterns (controllers, services, utilities)
+
+##### Configuration and Tooling
+
+`CONFIG/`: Shared formatter and analysis configurations
+
+* `.prettierrc`, `.eslintrc`, or equivalent formatting configs
+* `.editorconfig` for consistent editor settings
+* Static analysis tool configurations
+* Pre-commit hook configurations
+
+`INTEGRATION/`: AI tool-specific integration guides
+
+* GitHub Copilot workspace configuration
+* Custom GPT instructions and prompts
+* IDE plugin configurations
+* CLI tool integration examples
+
+##### Advanced Features
+
+`EMBEDDINGS/` (Optional): For advanced LLM integrations
+
+* Searchable knowledge base of project patterns
+* Vector embeddings of code examples
+* Semantic search capabilities for finding relevant patterns
+
+**Context Efficiency**: Start with core documentation files (~1000 words of context) to balance context value with AI tool costs. Expand strategically based on measured impact on review cycles and code quality.
+
+**Naming Convention**: The suggested file and folder names follow industry common practices. However, codebase owners may choose alternative names that are more discoverable and relatable to their specific project or codebase. Any chosen naming convention should be clearly documented and communicated to contributors through proper documentation. Should files like [AGENTS.md](https://agents.md) become standard in the future, the naming conventions in this pattern might be adapted accordingly.
+
+### Usage Patterns
+
+#### For Contributors
+
+* **Before Starting**: Review the AI context to understand project standards
+* **During Development**: Reference specific files when prompting AI tools
+* **Code Generation**: Include relevant context snippets in AI prompts
+* **Validation**: Use provided examples to verify AI-generated code aligns with standards
+
+#### For AI Tool Integration
+
+* **Direct Reference**: Copy relevant sections into AI tool prompts
+* **URL Integration**: Reference files via repository URLs in AI tools that support web access
+* **IDE Integration**: Configure AI plugins to automatically include context
+* **Custom Workflows**: Integrate context into CI/CD pipelines for automated validation
+
+### Maintenance Strategy
+
+* **Version Control**: Track changes to AI context alongside code changes
+* **Regular Updates**: Review and update context as project standards evolve
+* **Community Contribution**: Allow contributors to suggest improvements to AI context
+* **Metrics Tracking**: Monitor the effectiveness of AI context through code review metrics
+
+## Resulting Context
+
+* **Improved Code Quality**: AI-assisted contributions become consistent with existing code standards and architectural patterns from the first submission
+* **Reduced Review Friction**: Maintainers can trust incoming PRs more readily, significantly reducing review fatigue and time-to-merge
+* **Enhanced Contributor Experience**: Contributors using AI produce better, more maintainable code even on their first attempts, leading to increased confidence and participation
+* **Scalable Collaboration**: Opens the door to scalable, AI-aware InnerSource collaboration across teams without sacrificing code quality
+* **Knowledge Preservation**: Project knowledge becomes more explicit and accessible, reducing dependency on tribal knowledge
+* **Faster Onboarding**: New contributors can leverage AI tools effectively from day one, reducing the learning curve for project-specific patterns
+* **Consistent Evolution**: As AI tools improve, the context ensures that enhanced capabilities are channeled toward project-appropriate solutions
+
+## Rationale
+
+This pattern addresses the fundamental mismatch between AI tools' general training and project-specific requirements. By providing structured, easily consumable context, we enable AI tools to generate code that feels like it was written by an experienced project contributor rather than an outsider.
+
+The `innersource-ai/` folder approach is intentionally explicit and discoverable, making it clear to both humans and AI tools where to find project-specific guidance. The modular structure allows teams to implement incrementally, starting with basic style guides and expanding to more sophisticated examples and configurations as needed.
+
+This solution balances the productivity benefits of AI tools with the quality requirements of professional software development, creating a sustainable approach to AI-assisted InnerSource collaboration.
+
+## Status
+
+* Initial
+* Drafted in August 2025
+
+## Authors
+
+* [Amburi Roy](https://www.linkedin.com/in/amburi/)
+
+## Related Patterns
+
+* [Standard Base Documentation](../2-structured/base-documentation.md) - Provides the foundation documentation that AI context builds upon
+* [InnerSource Portal](../2-structured/innersource-portal.md) - Could include AI context availability as searchable metadata for project discovery
+* [Trusted Committer](../2-structured/trusted-committer.md) - Trusted Committers are responsible for maintaining and evolving the AI context