Contributor Guide

Dev Environment Tips

If you make changes to the frontend, make sure to run pnpm lint-fix within the web folder
If you make changes to the backend or sdk, make sure to run ruff format and ruff check --fix within the sdk or api folder
If you update Ant Design tokens, run pnpm generate:tailwind-tokens in the web folder and commit the generated file

Testing Instructions

Tests are currently still not working and should not be run

PR instructions

If the user provides you with the issue id, title the PR: [issue-id] fix(frontend): <Title> where fix is the type (fix, feat, chore, ci, doc, test.. [we're using better-branch) and frontend is where and it could be api, sdk, frontend, docs, ..

Import Aliases Best Practices

The monorepo uses TypeScript path aliases for cleaner imports. Understanding when to use each pattern is important for maintainability.

Available Aliases

@/oss/* - Resolves with fallback order: ee/src/* → oss/src/*
@agenta/oss/src/* - Explicit import from OSS package (npm workspace)
@/agenta-oss-common/* - Similar fallback to @/oss/* (less common)

When to Use Each Pattern

Use `@/oss/*` for shared utilities and state

Use this pattern when importing shared utilities, helpers, types, hooks, or state that work the same in both EE and OSS:

// ✅ Good - Shared utilities
import {getEnv} from "@/oss/lib/helpers/dynamicEnv"
import {useAppTheme} from "@/oss/components/Layout/ThemeContextProvider"
import {User, JSSTheme} from "@/oss/lib/Types"
import {selectedOrgIdAtom} from "@/oss/state/org"
import axios from "@/oss/lib/api/assets/axiosConfig"

Why: The fallback mechanism allows EE to override implementations if needed, while falling back to OSS by default.

Use `@agenta/oss/src/*` for explicit OSS imports

Use this pattern when EE code needs to explicitly reference the OSS version of a component or page, typically for:

Extending/wrapping OSS components
Re-exporting OSS pages with EE enhancements
Ensuring you get the OSS implementation (not an EE override)

// ✅ Good - Explicit OSS component import
import OssSidebarBanners from "@agenta/oss/src/components/SidebarBanners"
import ObservabilityPage from "@agenta/oss/src/pages/w/[workspace_id]/p/[project_id]/observability"
import {DeploymentRevisions} from "@agenta/oss/src/lib/types_ee"

Why: This bypasses the fallback mechanism and guarantees you're importing from the OSS package.

Never use relative paths for cross-package imports

// ❌ Bad - Fragile and hard to maintain
import OssSidebarBanners from "../../../../oss/src/components/SidebarBanners"

// ✅ Good - Use explicit alias
import OssSidebarBanners from "@agenta/oss/src/components/SidebarBanners"

Why: Relative paths break easily with refactoring and are harder to read.

Examples in the Codebase

Shared utilities with @/oss/*:

web/ee/src/state/billing/atoms.ts - Uses @/oss/* for API utilities, types, and state atoms
web/ee/src/hooks/useCrispChat.ts - Uses @/oss/* for environment helpers

Explicit OSS imports with @agenta/oss/src/*:

web/ee/src/components/SidebarBanners/index.tsx - Wraps OSS component
web/ee/src/pages/w/[workspace_id]/p/[project_id]/apps/[app_id]/traces/index.tsx - Re-exports OSS page
web/ee/src/components/DeploymentHistory/DeploymentHistory.tsx - Uses EE-specific types from OSS

Quick Decision Guide

Are you in EE code importing from OSS?
├─ Is it a component/page that EE extends or wraps?
│  └─ Use: @agenta/oss/src/*
├─ Is it a utility, helper, type, or state atom?
│  └─ Use: @/oss/*
└─ Not sure?
   └─ Use: @agenta/oss/src/* (explicit is safer)

Architecture Overview

Our folder structure follows a module-based architecture that prioritizes maintainability, reusability, and clear separation of concerns.

Core Principles

Modular Organization
- Modules represent distinct feature areas (similar to pages)
- Each module is self-contained with its own components, hooks, and assets
- Shared functionality is elevated to appropriate hierarchy levels
Component Structure
- Components are organized by their scope of use
- Each component may contain:
  - Presentational logic (Component.tsx)
  - UI-only subcomponents (components/*.tsx)
  - Component-specific hooks (hooks/*.ts)
  - Local constants and utilities (assets/*.ts)
  - Type definitions (types.d.ts)
Code Movement Guidelines The following rules determine where code should live:
- Module-specific code stays within the module
- Components used across multiple modules move to root /components
- Hooks used across multiple modules move to root /hooks
- UI elements, constants, or utilities used across modules move to root /assets
- Types used across modules move to root types.d.ts

State Management

Store Organization

Each module can have its own store folder containing:
- Jotai atoms for reactive state
- Global store at root level for cross-module state

State Movement Guidelines
- State used only within a component stays as local state
- State shared between components in a module uses module-level store
- State shared across modules moves to root /store
- Consider these factors when choosing state location:
  - Scope of state usage
  - Frequency of updates
  - Performance implications
  - Data persistence requirements
State Management Tools
- Prefer Jotai atoms for all kind of shared state
- Local component state for UI-only concerns
Avoiding Prop Drilling
- When state is only meaningful within a component tree: Use Jotai atoms instead of prop drilling
- Prop drilling (passing props through multiple levels) makes code brittle and hard to maintain
- Atoms allow any component in the tree to access state without intermediate components knowing about it

Example - Avoid prop drilling:

❌ Don't do this:

function Parent() {
    const [selectedId, setSelectedId] = useState(null)
    return <Child1 selectedId={selectedId} setSelectedId={setSelectedId} />
}

function Child1({selectedId, setSelectedId}) {
    // Child1 doesn't use these props, just passes them down
    return <Child2 selectedId={selectedId} setSelectedId={setSelectedId} />
}

function Child2({selectedId, setSelectedId}) {
    return <GrandChild selectedId={selectedId} setSelectedId={setSelectedId} />
}

function GrandChild({selectedId, setSelectedId}) {
    // Finally uses them here
    return <div onClick={() => setSelectedId(123)}>{selectedId}</div>
}

✅ Use atoms instead:

// In module store or appropriate location
export const selectedIdAtom = atom<string | null>(null)

function Parent() {
    return <Child1 />
}

function Child1() {
    // No props needed
    return <Child2 />
}

function Child2() {
    return <GrandChild />
}

function GrandChild() {
    // Direct access to state
    const [selectedId, setSelectedId] = useAtom(selectedIdAtom)
    return <div onClick={() => setSelectedId(123)}>{selectedId}</div>
}

When to use atoms vs props:

Use props when: Parent component owns/controls the state, single level passing, or props are configuration/callbacks
Use atoms when: State needs to be shared across non-parent-child components, multiple levels of drilling, or state is module/feature-scoped

Persisted State with LocalStorage

For state that needs to persist across browser sessions, use atomWithStorage from jotai/utils:

import {atomWithStorage} from "jotai/utils"

// Simple usage - automatically syncs with localStorage
export const rowHeightAtom = atomWithStorage<"small" | "medium" | "large">(
    "agenta:table:row-height", // localStorage key
    "medium", // default value
)

// Usage in components - same as regular atoms
const [rowHeight, setRowHeight] = useAtom(rowHeightAtom)

For storing app/module-scoped data:

// Storage atom holds all app-specific data
const selectedVariantsByAppAtom = atomWithStorage<Record<string, string[]>>(
    "agenta_selected_revisions_v2",
    {},
)

// Derived atom provides scoped access per app
export const selectedVariantsAtom = atom(
    (get) => {
        const appId = get(routerAppIdAtom) || "__global__"
        const all = get(selectedVariantsByAppAtom)
        return all[appId] || []
    },
    (get, set, next: string[]) => {
        const appId = get(routerAppIdAtom) || "__global__"
        const all = get(selectedVariantsByAppAtom)
        set(selectedVariantsByAppAtom, {...all, [appId]: next})
    },
)

For nullable strings, use custom stringStorage:

import {stringStorage} from "@/oss/state/utils/stringStorage"

export const recentAppIdAtom = atomWithStorage<string | null>(
    "agenta:recent-app",
    null,
    stringStorage, // Handles null values properly
)

When to use atomWithStorage:

User preferences (theme, row height, view mode)
Recently used items (recent app, recent filter)
UI state that should persist (sidebar open/closed, panel sizes)
Form drafts or temporary data

Best practices:

Prefix keys with agenta: for consistency (e.g., "agenta:table:row-height")
Use TypeScript types for type safety
Provide sensible defaults
For complex objects, atomWithStorage handles JSON serialization automatically
For nullable strings, use stringStorage helper

Examples in codebase:

web/oss/src/components/EvalRunDetails2/state/rowHeight.ts - User preference
web/oss/src/state/app/atoms/fetcher.ts - Recent app tracking
web/oss/src/components/Playground/state/atoms/core.ts - App-scoped selections

Implementation Strategy

Current Approach: Gradual adoption during regular development
Migration: Update components to follow this structure as they are modified
No Big Bang: Avoid large-scale refactoring
Progressive Enhancement: Easy to implement incrementally

This structure supports:

Clear ownership and responsibility
Easy code review and modification
Identification of reusable patterns
Natural code organization based on usage
Scalable architecture that grows with the application

Data Fetching Best Practices

Primary Pattern: Jotai Atoms with TanStack Query

For data fetching, use atomWithQuery from jotai-tanstack-query. This combines Jotai's reactive state with TanStack Query's caching and synchronization.

When to use atomWithQuery:

Fetching data from APIs
When query depends on other atoms (e.g., projectIdAtom, appIdAtom)
Sharing data across multiple components
Need caching, loading states, and automatic refetching

Basic Pattern:

import {atomWithQuery} from "jotai-tanstack-query"

export const dataQueryAtom = atomWithQuery((get) => {
    const projectId = get(projectIdAtom) // Read dependencies
    
    return {
        queryKey: ["data", projectId], // Include all dependencies
        queryFn: () => fetchData(projectId),
        staleTime: 60_000,
        refetchOnWindowFocus: false,
        enabled: !!projectId, // Conditional fetching
    }
})

// Usage in components
const query = useAtomValue(dataQueryAtom)
const data = query.data
const isLoading = query.isPending

For parameterized queries, use atomFamily:

export const itemQueryAtomFamily = atomFamily((itemId: string) =>
    atomWithQuery((get) => {
        const projectId = get(projectIdAtom)
        return {
            queryKey: ["item", itemId, projectId],
            queryFn: () => fetchItem(itemId),
            enabled: !!itemId && !!projectId,
        }
    })
)

// Usage
const itemQuery = useAtomValue(itemQueryAtomFamily(itemId))

Derived atoms for data transformation:

export const dataAtom = selectAtom(
    dataQueryAtom,
    (res) => res.data ?? [],
    deepEqual
)

Mutations and invalidation:

export const createItemAtom = atom(
    null,
    async (_get, _set, payload) => {
        const res = await createItem(payload)
        await queryClient.invalidateQueries({queryKey: ["items"]})
        return res
    }
)

Key Principles:

Include all reactive dependencies in queryKey
Use enabled for conditional queries
Use selectAtom for derived data
Invalidate queries after mutations
Set appropriate staleTime for caching

Examples in codebase:

web/oss/src/state/profile/selectors/user.ts - Simple query
web/oss/src/state/environment/atoms/fetcher.ts - Multi-dependency query
web/oss/src/state/queries/atoms/fetcher.ts - Atom family with parameters
web/oss/src/state/testset/hooks/useTestset.ts - Hook wrapper pattern

Entity Controller Pattern

For entities requiring CRUD operations with draft state, loading indicators, and cache management, use the Entity Controller Pattern. This provides a unified API that abstracts multiple atoms into a single cohesive interface.

Full documentation: web/oss/src/state/entities/shared/README.md

Quick Decision - Which API to Use:

Need	API	Returns
Full state + actions	`entity.controller(id)`	`[state, dispatch]`
Data only	`entity.selectors.data(id)`	`T \| null`
Loading/error	`entity.selectors.query(id)`	`QueryState<T>`
Dirty indicator	`entity.selectors.isDirty(id)`	`boolean`
Single cell (tables)	`entity.selectors.cell({id, col})`	`unknown`
Dispatch in atoms	`entity.actions.update/discard`	Write atom

Basic Usage:

import {testcase} from "@/oss/state/entities/testcase"

// Full controller - state + dispatch
function TestcaseEditor({testcaseId}: {testcaseId: string}) {
  const [state, dispatch] = useAtom(testcase.controller(testcaseId))

  if (state.isPending) return <Skeleton />
  if (!state.data) return <NotFound />

  return (
    <Input
      value={state.data.input}
      onChange={(e) => dispatch({
        type: "update",
        changes: {input: e.target.value}
      })}
    />
  )
}

// Fine-grained selector - only re-renders on data change
function TestcaseDisplay({testcaseId}: {testcaseId: string}) {
  const data = useAtomValue(testcase.selectors.data(testcaseId))
  if (!data) return null
  return <div>{data.input}</div>
}

Reading Multiple Entities:

// Create a derived atom that subscribes to all selected entities
const useMultipleTestcases = (ids: string[]) => {
  const dataAtom = useMemo(
    () => atom((get) => ids.map(id => get(testcase.selectors.data(id))).filter(Boolean)),
    [ids.join(",")]
  )
  return useAtomValue(dataAtom)
}

Anti-Patterns to Avoid:

// BAD - No reactivity, snapshot read
const globalStore = getDefaultStore()
const data = globalStore.get(testcase.selectors.data(id))

// GOOD - Proper subscription
const data = useAtomValue(testcase.selectors.data(id))

// BAD - Variable shadowing
import {testcase} from "@/oss/state/entities/testcase"
const {testcase, ...rest} = entity  // Shadows import!

// GOOD - Rename destructured variable
const {testcase: testcaseField, ...rest} = entity

Available Controllers:

Entity	Import	Description
Testcase	`testcase` from `@/oss/state/entities/testcase`	Testcase with cell subscriptions + drill-in
Trace Span	`traceSpan` from `@/oss/state/entities/trace`	Trace span with attribute drill-in
Revision	`revision` from `@/oss/state/entities/testset`	Revision with column management
Testset	`testset` from `@/oss/state/entities/testset`	Testset with list/detail queries

Architecture:

┌─────────────────────────────────────────────────────────────────┐
│                       Controller                                 │
│  ┌─────────────┐  ┌─────────────┐  ┌─────────────┐              │
│  │   Query     │  │   Draft     │  │   isDirty   │              │
│  │ (server)    │→ │  (local)    │→ │  (derived)  │              │
│  └─────────────┘  └─────────────┘  └─────────────┘              │
│         ↓               ↓                                        │
│  ┌─────────────────────────────────────────────────────────────┐│
│  │                 Entity Atom (merged)                        ││
│  └─────────────────────────────────────────────────────────────┘│
└─────────────────────────────────────────────────────────────────┘

Query atoms are the single source of truth for server data
Draft atoms store local changes only
Entity atoms merge: query.data + draft → merged entity
Dirty detection compares draft to server data

Legacy: SWR Pattern (avoid for new code)

We previously used SWR with Axios for data fetching. This pattern is still present in older code but should not be used for new features.

❌ Avoid: useEffect for Data Fetching

Don't use useEffect with manual state management for data fetching:

// DON'T DO THIS
useEffect(() => {
    fetchData().then(setData).catch(setError)
}, [])

Use atomWithQuery instead (see above).

Styling Best Practices

Use Tailwind CSS (Preferred)

Always prefer Tailwind utility classes over CSS-in-JS or separate CSS files for styling whenever possible.

✅ Preferred: Tailwind classes

// Good - Uses Tailwind utilities
<main className="flex flex-col grow h-full overflow-hidden items-center justify-center">
    <Card className="max-w-[520px] w-[90%] text-center">
        <Typography.Title level={3} className="!mb-2">
            Unable to establish connection
        </Typography.Title>
    </Card>
</main>

❌ Avoid: CSS-in-JS (react-jss, styled-components)

// Avoid - Creates extra overhead and complexity
const useStyles = createUseStyles((theme: JSSTheme) => ({
    collapseContainer: {
        "& .ant-collapse-header": {
            backgroundColor: `#FAFAFB !important`,
        },
    },
}))

function Component() {
    const classes = useStyles()
    return <div className={classes.collapseContainer}>...</div>
}

❌ Avoid: Inline styles

// Avoid - Not themeable, harder to maintain
<div style={{maxWidth: "520px", width: "90%", textAlign: "center"}}>

When CSS-in-JS is acceptable:

Complex Ant Design component overrides that can't be done with Tailwind
Dynamic theme-dependent styles that require JS calculations
Legacy components (refactor to Tailwind when touching the code)

Tailwind benefits:

No style bloat or unused CSS
Consistent design system
Better performance (no runtime style injection)
Easier to read and maintain
Works seamlessly with Ant Design

Examples in codebase:

web/oss/src/components/CustomWorkflowBanner/index.tsx - Good Tailwind usage
web/oss/src/components/ChatInputs/ChatInputs.tsx - Mixed (being migrated)

React Best Practices

Component Reusability

Before implementing similar functionality in multiple places, consider reusability:

When you notice patterns that could be extracted:

Don't immediately refactor - Jumping straight to abstraction can over-engineer
Ask the developer with context about the potential for reuse
Provide analysis: Show where similar code exists and potential benefits/costs of refactoring

Example prompt when detecting reusability:

I notice this table cell rendering logic is similar to:
- components/EvalRunDetails2/TableCells/MetricCell.tsx
- components/Evaluators/cells/MetricDisplayCell.tsx

Before implementing, would you like me to:
A) Create a reusable component (requires refactoring both existing usages)
B) Proceed with current implementation (can consolidate later if pattern repeats)

The trade-off: (A) takes more time now but improves maintainability; (B) is faster but may create tech debt.

When to extract components:

Used in 3+ places with similar logic
Complex logic that benefits from isolation
Clear, stable interface that won't change often

When NOT to extract:

Only used twice (wait for third usage to confirm pattern)
Requirements are still evolving
Small, simple components (< 20 lines)

Performance Considerations

Critical for evaluations and observability features - these handle large datasets:

Minimize Re-renders
- Use useMemo for expensive computations
- Use React.memo for components that receive stable props
- Avoid inline functions/objects in render (especially in lists)

// ❌ Bad - Creates new function every render
{items.map(item => <Row key={item.id} onClick={() => handleClick(item)} />)}

// ✅ Good - Stable callback
const handleRowClick = useCallback((item) => handleClick(item), [])
{items.map(item => <Row key={item.id} onClick={handleRowClick} item={item} />)}

Optimize Query Updates
- Be mindful of queryKey dependencies - don't include frequently changing values unnecessarily
- Use select option in queries to extract only needed data
- Consider staleTime for data that doesn't change often

// ❌ Bad - Refetches on every UI update
atomWithQuery((get) => ({
    queryKey: ["data", get(currentTimeAtom)], // currentTimeAtom updates every second!
    queryFn: fetchData
}))

// ✅ Good - Only refetches when meaningful dependencies change
atomWithQuery((get) => ({
    queryKey: ["data", get(projectIdAtom), get(filterAtom)],
    queryFn: fetchData,
    staleTime: 60_000 // Cache for 1 minute
}))

Virtualization for Large Lists
- Use virtual scrolling for lists with 100+ items
- Reference: InfiniteVirtualTable component
Debounce/Throttle User Input
- Debounce search inputs, filters
- Throttle scroll handlers, resize handlers

Modular Component Design

Keep components focused and decoupled:

✅ Good: Component owns its internal concerns

// Component only needs IDs, fetches its own data
function UserCard({userId}: {userId: string}) {
    const user = useAtomValue(userQueryAtomFamily(userId))
    return <Card>{user.name}</Card>
}

// Parent doesn't need to know about user data structure
function UserList({userIds}: {userIds: string[]}) {
    return userIds.map(id => <UserCard key={id} userId={id} />)
}

❌ Bad: Parent must know too much

// Parent must fetch and pass everything
function UserCard({
    userName,
    userEmail,
    userAvatar,
    userRole,
    userDepartment
}: {/* many props */}) {
    return <Card>...</Card>
}

// Parent is tightly coupled to UserCard's needs
function UserList({userIds}: {userIds: string[]}) {
    const users = useAtomValue(usersQueryAtom) // Must fetch all data
    return users.map(user => (
        <UserCard
            key={user.id}
            userName={user.name}
            userEmail={user.email}
            userAvatar={user.avatar}
            userRole={user.role}
            userDepartment={user.department}
        />
    ))
}

Principles:

High cohesion: Component contains related logic together
Low coupling: Minimal dependencies on parent/sibling components
Props should be minimal: Pass IDs/keys, not entire data structures when possible
Components fetch their own data: Use atoms with queries for data needs
Single Responsibility: Each component does one thing well

Benefits:

Easier to test in isolation
Can reuse without bringing unnecessary dependencies
Changes to one component don't cascade to others
Clear interfaces and responsibilities

Avoiding Inline Array Props

Passing inline arrays of objects with heavy content such as JSX is considered a bad practice in React. This is because it can lead to unnecessary re-renders and performance issues. When you pass an inline array, a new array is created every time the component renders, causing React to think that the prop has changed even if the content is the same.

For example, in the AccordionTreePanel component, the items prop is passed an inline array of objects with JSX content:

❌ Avoid this pattern:

<AccordionTreePanel
  items={[
    {
      title: "Item 1",
      content: <div>Content 1</div>,
    },
    {
      title: "Item 2",
      content: <div>Content 2</div>,
    },
  ]}
/>

✅ Use this pattern:

import {useMemo} from "react"

const items = useMemo(
    () => [
        {
            title: "Item 1",
            content: <div>Content 1</div>,
        },
        {
            title: "Item 2",
            content: <div>Content 2</div>,
        },
    ],
    [],
)

<AccordionTreePanel items={items} />

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Contributor Guide

Dev Environment Tips

Testing Instructions

PR instructions

Import Aliases Best Practices

Available Aliases

When to Use Each Pattern

Use `@/oss/*` for shared utilities and state

Use `@agenta/oss/src/*` for explicit OSS imports

Never use relative paths for cross-package imports

Examples in the Codebase

Quick Decision Guide

Architecture Overview

Core Principles

State Management

Implementation Strategy

Data Fetching Best Practices

Entity Controller Pattern

❌ Avoid: useEffect for Data Fetching

Styling Best Practices

Use Tailwind CSS (Preferred)

React Best Practices

Component Reusability

Performance Considerations

Modular Component Design

Avoiding Inline Array Props

FilesExpand file tree

AGENTS.md

Latest commit

History

AGENTS.md

File metadata and controls

Contributor Guide

Dev Environment Tips

Testing Instructions

PR instructions

Import Aliases Best Practices

Available Aliases

When to Use Each Pattern

Use @/oss/* for shared utilities and state

Use @agenta/oss/src/* for explicit OSS imports

Never use relative paths for cross-package imports

Examples in the Codebase

Quick Decision Guide

Architecture Overview

Core Principles

State Management

Implementation Strategy

Data Fetching Best Practices

Entity Controller Pattern

❌ Avoid: useEffect for Data Fetching

Styling Best Practices

Use Tailwind CSS (Preferred)

React Best Practices

Component Reusability

Performance Considerations

Modular Component Design

Avoiding Inline Array Props

Use `@/oss/*` for shared utilities and state

Use `@agenta/oss/src/*` for explicit OSS imports