Skip to content

Conversation

@Pavan-Microsoft
Copy link
Contributor

Purpose

This pull request introduces several infrastructure and documentation updates to improve deployment clarity, environment setup, and configuration flexibility. The most significant changes include the removal of the Key Vault module and associated deployment scripts from the Bicep template, expanded documentation for data processing scripts, and updates to Python version requirements. Additionally, new outputs and parameters have been added to the Bicep template to support integration with AI and storage services.

Infrastructure changes:

  • Removed the Key Vault module and its configuration from infra/main.bicep, simplifying the deployment and secret management approach.
  • Removed deployment scripts for uploading demo data, creating search indexes, and creating SQL users/roles from infra/main.bicep, delegating these tasks to manual or external script execution.
  • Added new Bicep parameters and outputs for Content Understanding API version, backend user managed identity, storage account/container names, and AI Foundry resource IDs to facilitate integration and downstream automation. [1] [2] [3] [4]
  • Updated the DNS zone index and removed unused variables for clarity in infra/main.bicep. [1] [2]

Documentation and setup improvements:

  • Updated documentation in DeploymentGuide.md and CustomizeData.md to instruct users to run new process_sample_data.sh and process_custom_data.sh scripts for data processing, including detailed parameter instructions. [1] [2]
  • Clarified Python version requirements to explicitly support Python 3.9 through 3.11 in all relevant documentation. [1] [2]
  • Enhanced post-deployment instructions in azure.yaml to guide users on processing sample data via Bash scripts.

Dev environment updates:

  • Added the mssql-odbc-driver feature (version 17) to the devcontainer configuration for improved SQL Server connectivity.
  • Updated setup_env.sh to reference the new data processing scripts and ensure correct permissions.

Does this introduce a breaking change?

  • Yes
  • No

Golden Path Validation

  • I have tested the primary workflows (the "golden path") to ensure they function correctly without errors.

Deployment Validation

  • I have validated the deployment process successfully and all services are running as expected with this change.

- Introduced `04_cu_process_custom_data.py` for processing custom data and integrating with Azure services.
- Removed obsolete `azure_credential_utils.py` as its functionality is now integrated elsewhere.
- Updated `content_understanding_client.py` to improve error handling.
- Created `process_custom_data_scripts.sh` for streamlined script execution and dependency management.
- Enhanced `process_data_scripts.sh` to include additional parameters and improved error handling.
- Refactored `run_create_index_scripts.sh` to support Azure authentication and role assignment.
- Deleted `run_create_index_scripts_manual.sh` as its functionality is now covered in the updated script.
- Adjusted `run_process_data_scripts.sh` to reference the new Bicep file for custom data processing.
…ole assignments, and error handling; remove run_process_data_scripts.sh
…mands in Azure YAML and update SQL output directory path in Python script
…rove error handling in bash script for enabling public access
…lt dependencies and streamline parameter handling

- Removed Key Vault related parameters and configurations from Bicep templates.
- Updated Python scripts to accept command line arguments for necessary endpoints and models instead of retrieving them from Key Vault.
- Modified shell scripts to pass new parameters to Python scripts for improved flexibility and clarity.
- Cleaned up unused variables and consolidated logic for better maintainability.
…essing data

- Introduced a new script `process_custom_data.sh` to manage public network access for Azure resources and execute data processing.
- Implemented functions to enable and restore public access for Storage Account, AI Foundry, CU Foundry, and SQL Server.
- Added error handling and logging for network access changes.
- Refactored existing `process_sample_data.sh` to remove deployment output retrieval logic, now handled in `process_custom_data.sh`.
- Removed SQL table creation logic from `run_create_index_scripts.sh` to streamline the process.
…Azure services and Content Understanding API
…SSQL ODBC driver and correct script permissions
@Pavan-Microsoft Pavan-Microsoft marked this pull request as ready for review December 15, 2025 08:15
@Pavan-Microsoft Pavan-Microsoft marked this pull request as draft December 15, 2025 09:31
@Pavan-Microsoft Pavan-Microsoft marked this pull request as ready for review December 15, 2025 10:44
@Avijit-Microsoft Avijit-Microsoft merged commit ea9960e into dev Dec 15, 2025
5 checks passed
@github-actions
Copy link
Contributor

🎉 This PR is included in version 3.17.0 🎉

The release is available on GitHub release

Your semantic-release bot 📦🚀

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants