tests: Use `pytest` as a test runner for Jupyter and Python samples #176

kdestin · 2025-01-08T23:24:39Z

Description

This pull request adds initial support for using pytest to run Jupyter Notebooks to facilitate sample validation.

Specifically this pull request:

Adds infrastructure to enable testing of samples
- Test resource deployment with bicep
- pytest as a frontend for running sample validation
  - Custom plugin for automatic discovery of samples as "pytest tests" and executes them with papermill
  - Custom plugin that only runs samples that have changed in a pull request
Fixes some broken evaluate notebooks

Background

Azure/azureml-examples tests its samples using a github actions as a test runner. Maintaining this prior art revealed some pain points:

Difficult to orchestrate validation runs (everything runs in parallel all at once, with limited options for control).
Hard to run a non-trivial number of samples locally.
Monitoring story isn't ideal. GitHub's UI for actions isn't optimized for repos with hundreds of workflows, which.
Onboarding new samples into the test suite is manual and often isn't done by contributors (can be unclear whether untested samples are skipped intentionally).

Additionally, infrastructure deployment is based on a large collection of bash scripts, which can make reasoning about the resources that get deployed difficult.

Using pytest as a test runner for azureai-samples addresses several of those pain points:

The testing workflow is local-first. (Deploy resources with bicep, then run pytest).
Samples (specifically Jupyter Notebooks and Python samples with included tests) are automatically discovered, and must be manually opted out.
Test run orchestration can be controlled (sequential by default, configurable with plugins like pytest-xdist, pytest-retry, pytest-randomly, etc...)
Native support for generating test reports, in a format widely understood by other tools (junit xml reports)

Bicep for deployment should make maintaining infrastructure deployments easier (rule of least power).

Checklist

I have read the contribution guidelines
I have coordinated with the docs team (mldocs@microsoft.com) if this PR deletes files or changes any file names or file extensions.
This notebook or file is added to the CODEOWNERS file, pointing to the author or the author's team.

…ame`

Taken from template referred to in this article https://learn.microsoft.com/en-us/azure/machine-learning/how-to-manage-hub-workspace-template?view=azureml-api-2&tabs=cli

workspaces/endpoints is deprecated

This commit introduces a custom pytest plugin that forces pytest to only collect samples that have changed either: * in the working tree compared to HEAD * in HEAD compared to main We assume that a sample has changed if any file has changed in the directory we collected a test from, which is consistent with how the contributing guidelines specify to package a sample. But this criteria to detect change may need to be iterated on later.

To prevent pytest from trying to run template.ipynb

Project deployments sometimes error when they happen concurrently

See https://papermill.readthedocs.io/en/latest/usage-parameterize.html

Referred to the ARM export of a resource group with a set up ai project.

See https://papermill.readthedocs.io/en/latest/usage-parameterize.html

Redeploying CognitiveServices/accounts will sometimes fail with "publicNetworkAccess" is required.

kdestin added 30 commits January 11, 2024 10:19

chore: Preserve tags with nbclean

74ff96c

chore: Add jupyter dev-requirements

f9c79d7

feat: Add bicep files to setup an azure ai project

7f5eeaa

chore: Pin versions in dev-requirements.txt

f541d81

test: Add a conftest.py

33ae735

feat: Add a github action for running samples

eb87b14

refactor: Rename openai_endpoint_name to `azure_openai_connection_n…

c05fa25

…ame`

refactor: Replace Azure.{Speech,ContentSafety,OpenAI} with AiServices

1cd7d5c

Taken from template referred to in this article https://learn.microsoft.com/en-us/azure/machine-learning/how-to-manage-hub-workspace-template?view=azureml-api-2&tabs=cli

fix: Switch from workspaces/endpoints to workspaces/connections

359ef9f

workspaces/endpoints is deprecated

refactor: Rename default hub and project names

574d9a1

fix: Lower requested capacity for deployment

44b2473

chore: Disable ruff ANN101

b6288b0

chore: Move templates to .infra/templates

8fffd83

To prevent pytest from trying to run template.ipynb

test: Add a pytest section to tox.ini

147562f

fix: Serialize project deployments

67dc6b9

Project deployments sometimes error when they happen concurrently

chore: Update dev-requirements.txt

d4a3dbb

fix: Add missing installation extra for evaluate_nlp

1f2482c

refactor: Move user input to a cell suitable to be used with papermill

b450834

See https://papermill.readthedocs.io/en/latest/usage-parameterize.html

chore: Misc notebook metadata updates

9f6ec31

fix: Add missing credential parameters

ca5553a

fix: Fix enum value moved to different enum

97aa981

fix: Resolve broken call to Path.open

ec3a745

fix: Fix incorrect environment variable names

efdcbe8

feat: Add support for generating role assignments

6cf97bd

refactor: Remove application insights

a3d4e31

refactor: Update workspace connection to aoai and aiservices

af56a2f

Referred to the ARM export of a resource group with a set up ai project.

refactor: Update deployment module

54213b0

refactor: Update aiServices default name

44a6927

refactor: Update default rg name

b24774e

kdestin added 11 commits January 8, 2025 15:24

feat: Add .gitignores to ignore files created by samples

3828ac3

chore: Add pytest config to pyproject.toml

69bb351

feat: Expose the project location as an output

c48bb1f

tests: Setup parameterization for evaluate scenarios

65fba92

refactor: Move user input to a cell suitable to be used with papermill

203f831

See https://papermill.readthedocs.io/en/latest/usage-parameterize.html

fix: Fix broken invocations of Path.open

942aa96

chore: Update kernelspec displayname

ef16f48

style: Format changed_samples plugin

1aa4c03

ci: Bump python version to 3.9

7af3dc2

ci: Add principalId to resource deployment to enable role assignments

3dc9616

chore: Change azure ai project location to eastus2

af27338

kdestin requested a review from a team as a code owner January 8, 2025 23:24

kdestin added 5 commits January 8, 2025 18:28

ci: Bump pre-commit workflow to 3.9

1ba1541

ci: Fix branch filter for run-samples (notebooks/ -> scenarios/)

4331bf6

ci: Bump az/login to v2

f9bbfec

test: Add skips to untested samples

3d6732a

fix,ci: Allow access to secrets

36492b7

kdestin force-pushed the testing-infrastructure branch 6 times, most recently from 1fc2ccb to 644654f Compare January 9, 2025 21:28

nagkumar91 approved these changes Jan 9, 2025

View reviewed changes

fix: Enable publicNetworkAccess for aiservices

becef6a

Redeploying CognitiveServices/accounts will sometimes fail with "publicNetworkAccess" is required.

kdestin force-pushed the testing-infrastructure branch 2 times, most recently from bba19b7 to becef6a Compare January 13, 2025 19:36

kdestin added 2 commits January 13, 2025 14:50

feat: Allow specifying arbitrary git ref in changed sample plugin

e0c22aa

tests,feat: Run only changed samples

2e05267

kdestin merged commit 78704bd into Azure-Samples:main Jan 13, 2025
3 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

tests: Use `pytest` as a test runner for Jupyter and Python samples #176

tests: Use `pytest` as a test runner for Jupyter and Python samples #176

Uh oh!

kdestin commented Jan 8, 2025 •

edited

Loading

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

tests: Use pytest as a test runner for Jupyter and Python samples #176

tests: Use pytest as a test runner for Jupyter and Python samples #176

Uh oh!

Conversation

kdestin commented Jan 8, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Background

Checklist

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

tests: Use `pytest` as a test runner for Jupyter and Python samples #176

tests: Use `pytest` as a test runner for Jupyter and Python samples #176

kdestin commented Jan 8, 2025 •

edited

Loading