Skip to content

Overlap CUDA graph building and processing to minimize GPU idle time and improve tokens per seconds performance. #21788

Overlap CUDA graph building and processing to minimize GPU idle time and improve tokens per seconds performance.

Overlap CUDA graph building and processing to minimize GPU idle time and improve tokens per seconds performance. #21788

Workflow file for this run

name: EditorConfig Checker
on:
workflow_dispatch: # allows manual triggering
inputs:
create_release:
description: 'Create new release'
required: true
type: boolean
push:
branches:
- master
pull_request:
branches:
- master
concurrency:
group: ${{ github.workflow }}-${{ github.head_ref && github.ref || github.run_id }}
cancel-in-progress: true
jobs:
editorconfig:
runs-on: ubuntu-latest
steps:
- uses: actions/checkout@v4
- uses: editorconfig-checker/action-editorconfig-checker@v2
with:
version: v3.0.3
- run: editorconfig-checker