Huggingface deployer #4119

htahir1 · 2025-11-02T16:43:54Z

Describe changes

I implemented a new Hugging Face deployer component that allows deploying containerized applications to Hugging Face Spaces. This deployer complements the existing Hugging Face model deployer by providing a more general-purpose deployment option for any containerized application.

The implementation includes:

A new HuggingFaceDeployer class that extends ContainerizedDeployer
Configuration options for Space hardware, storage, and visibility
Support for authentication via token, secret, or environment variable
Methods to provision, deprovision, and check deployment status

Pre-requisites

Please ensure you have done the following:

I have read the CONTRIBUTING.md document.
I have added tests to cover my changes.
I have based my new branch on develop and the open PR is targeting develop. If your branch wasn't based on develop read Contribution guide on rebasing branch to develop.
IMPORTANT: I made sure that my changes are reflected properly in the following resources:
- ZenML Docs
- Dashboard: Needs to be communicated to the frontend team.
- Templates: Might need adjustments (that are not reflected in the template tests) in case of non-breaking changes and deprecations.
- Projects: Depending on the version dependencies, different projects might get affected.

Types of changes

Bug fix (non-breaking change which fixes an issue)
New feature (non-breaking change which adds functionality)
Breaking change (fix or feature that would cause existing functionality to change)
Other (add details above)

Implements a new Huggingface deployer that allows deploying ZenML pipelines as Docker-based Huggingface Spaces. This deployer extends ContainerizedDeployer and uses the huggingface_hub API to manage Space lifecycles. Key features: - Create and update Huggingface Spaces with Docker SDK - Support for hardware tier selection (CPU, GPU options) - Support for persistent storage tiers - Automatic Space naming with configurable prefixes - Full deployment lifecycle management (provision, status, deprovision) Implementation includes: - HuggingfaceDeployer: Main deployer class with Space management - HuggingfaceDeployerFlavor: Flavor registration for the stack - HuggingfaceDeployerConfig: Configuration including token and defaults - HuggingfaceDeployerSettings: Per-deployment settings for hardware/storage - HuggingfaceDeploymentMetadata: Metadata tracking for Space deployments The deployer automatically generates Dockerfile and README.md for Spaces, handles authentication via token or environment variables, and provides proper error handling for Space operations.

Addressed several logical issues found during code review: 1. Fixed environment variable name: Changed from HUGGING_FACE_HUB_TOKEN to the correct HF_TOKEN as per huggingface_hub documentation 2. Removed unused 'hardware' variable in _create_readme method 3. Improved SpaceStage status mapping to handle all possible states: - Added support for RUNNING_BUILDING, BUILD_ERROR, RUNTIME_ERROR - Added support for CONFIG_ERROR, NO_APP_FILE, DELETING - Better categorization of error vs pending states 4. Fixed Dockerfile environment variable escaping to properly handle backslashes followed by quotes 5. Added safety check for empty deployment names in _sanitize_space_name to prevent edge case failures 6. Fixed potential None access in log error message by checking if space_url exists before using it These fixes ensure more robust error handling and better compatibility with the Huggingface Spaces API.

Removed over-engineered code to make it minimal and maintainable: - Removed HuggingfaceDeploymentMetadata class, use simple dict instead - Removed _sanitize_space_name complexity, simplified to basic regex - Inlined _create_readme and _create_dockerfile methods - Simplified status mapping from 10+ cases to 3 simple cases - Removed unnecessary exception wrapping and verbose logging - Removed unused imports and helper methods The code is now ~200 lines instead of ~500 lines, much easier to understand and maintain while keeping all core functionality.

Fixed several bugs found during code review: 1. CRITICAL: Fixed Dockerfile env var escaping - values with quotes or backslashes would break the Dockerfile. Now properly escapes both. 2. Fixed empty space name handling - if deployment name only contains special characters, now defaults to 'deployment' instead of empty string 3. Added try/except around hardware and storage API calls to prevent them from crashing the entire provisioning if they fail 4. Changed flavor name from 'huggingface' to 'huggingface-spaces' to avoid confusion with the existing model deployer flavor These fixes make the deployer more robust and handle edge cases properly.

CLAassistant · 2025-11-02T16:44:00Z

Thank you for your submission! We really appreciate it. Like many open source projects, we ask that you all sign our Contributor License Agreement before we can accept your contribution.
1 out of 2 committers have signed the CLA.

✅ htahir1
❌ claude
_{You have signed the CLA already but the status is still pending? Let us recheck it.}

Aligned the new deployer with existing Hugging Face integration patterns: 1. Naming consistency: Changed 'Huggingface' to 'HuggingFace' (capital F) to match existing classes like HuggingFaceModelDeployer 2. Token security: Use SecretField for token to mark it as sensitive, following the same pattern as HuggingFaceModelDeployerConfig 3. Secret support: Added secret_name field to fetch tokens from ZenML secrets, providing flexibility like the model deployer 4. Stack validation: Added validator to ensure either token or secret_name is configured before deployment 5. Token retrieval: Implemented _get_token() method with priority: config.token > secret_name > HF_TOKEN env var All class names, patterns, and conventions now match the existing Hugging Face integration for consistency and maintainability.

The flavor name should be 'huggingface' not 'huggingface-spaces'. Model deployer and deployer are different component types so there's no naming collision.

Fixed all linting issues found by scripts/lint.sh and scripts/docstring.sh: 1. F821 - Added HfApi to TYPE_CHECKING imports to fix undefined name error 2. DAR302 - Removed 'Yields' section from do_get_deployment_state_logs since it only raises an exception 3. DAR401 - Added Exception to Raises section in do_deprovision_deployment to document the re-raised exception All linting checks now pass.

src/zenml/integrations/huggingface/__init__.py

- Created comprehensive documentation for the Hugging Face deployer in docs/book/component-guide/deployers/huggingface.md - Added entry to deployer table of contents (toc.md) - Updated deployers README.md with Hugging Face deployer information - Documented important limitation about Docker image accessibility and private registries - Included configuration examples, settings, and usage instructions

github-actions · 2025-11-02T19:45:42Z

Documentation Link Check Results

✅ Absolute links check passed
✅ Relative links check passed
_{Last checked: 2025-11-03 11:09:42 UTC}

## Major Changes ### 1. Added Deployment Server Entrypoint - Dockerfiles now include proper ENTRYPOINT and CMD instructions - Uses DeploymentEntrypointConfiguration to generate correct startup command - Deployment server now starts automatically with deployment ID parameter ### 2. Implemented Two-Mode Deployment System #### Mode 1: Image Reference (with container registry) - References pre-built Docker image from container registry - Lightweight Dockerfile with FROM, ENV, USER, ENTRYPOINT, CMD - Image must be publicly accessible (documented limitation) #### Mode 2: Full Build (without container registry) - Builds complete image from scratch in Hugging Face Spaces - Uploads source code, requirements.txt, and Dockerfile to Space - Generates full Dockerfile with dependency installation - **Solves private registry authentication problem!** ### 3. Implementation Details - Added _get_entrypoint_and_command() helper method - Added _generate_image_reference_dockerfile() for Mode 1 - Added _generate_full_build_dockerfile() for Mode 2 - Added _get_requirements_for_deployment() to gather dependencies - Modified do_provision_deployment() to detect stack configuration and choose mode - Mode selection based on stack.container_registry presence ### 4. Documentation Updates - Replaced "Important Limitations" section with "Deployment Modes" - Documented both deployment modes with use cases - Added clear workarounds for private registry issues - Provided examples for stack configuration ## Benefits - Fixes missing entrypoint issue - deployments can now start properly - Provides solution for private registry problem via full-build mode - Maintains backward compatibility for users with public registries - Automatic mode selection based on stack configuration

Replace custom _generate_full_build_dockerfile implementation with ZenML's internal PipelineDockerImageBuilder._generate_zenml_pipeline_dockerfile method to: - Eliminate code duplication - Ensure consistency with how ZenML builds Docker images elsewhere - Automatically benefit from future improvements to internal method - Reduce maintenance burden Changes: - Import PipelineDockerImageBuilder and json module - Modify _generate_full_build_dockerfile to call internal method - Create requirements_files in format expected by internal method - Merge environment/secrets into docker_settings.environment - Internal method handles: FROM, WORKDIR, ENV, apt packages, requirements, COPY, USER - Manually append ENTRYPOINT and CMD using json.dumps() for exec form - Update docstring to document use of internal method Benefits: - ~50 lines of code eliminated - Consistent with ZenML patterns (python_package_installer, installer_args, etc.) - Proper handling of docker_settings.local_project_install_command - Correct WORKDIR and permissions setup

Based on testing feedback, made the following critical fixes: ## Code Changes 1. **Fixed ENTRYPOINT/CMD Serialization (Line 217-218)** - Changed from str() to json.dumps() for proper Docker exec form - Before: str(entrypoint) → ['python', ...] (single quotes, invalid) - After: json.dumps(entrypoint) → ["python", ...] (double quotes, valid) - Fixes: "/bin/sh: 1: [python,: not found" error 2. **Fixed Default Port (Line 78, 107)** - Changed app_port default from 7860 to 8000 - 7860 is HF Spaces default, but ZenML server runs on 8000 - Updated both code and documentation 3. **Added Container Registry Requirement to Validator (Lines 102-140)** - Stack validator now requires container registry - Prevents deployment without pre-built image - Clear error message explains requirement 4. **Removed Full Build Mode (Lines 250-376 deleted)** - Deleted _generate_full_build_dockerfile method - Deleted _get_requirements_for_deployment method - Simplified do_provision_deployment to single mode - Removed unused imports (source_utils, PipelineDockerImageBuilder) - Full build mode caused issues: * Uploaded entire codebase (inefficient) * Configuration problems * Recommended to always use container registry ## Documentation Changes 1. **Updated Port Documentation** - Changed default from 7860 to 8000 in docs - Added note that 8000 is ZenML server default 2. **Removed Deployment Modes Section** - Deleted entire two-mode explanation - Simplified to single-mode operation 3. **Added Important Requirements Section** - Container Registry Requirement subsection - Explains why public access is needed - Lists recommended registries (Docker Hub, GHCR) - Example setup with GitHub Container Registry 4. **Added X-Frame-Options Configuration Section** - Documents iframe embedding issue - Provides complete code example - Shows DeploymentSettings with SecureHeadersConfig - Explains: xfo=False disables X-Frame-Options header - Without this, HF Spaces shows blank page ## Summary of Fixes ✅ ENTRYPOINT/CMD serialization (json.dumps) ✅ Default port 7860 → 8000 ✅ Container registry now required by validator ✅ Full build mode removed ✅ Documentation updated with requirements ✅ X-Frame-Options configuration documented These changes address all issues discovered during testing.

strickvl

One thing that's probably not ideal is that (esp in the context of setting these as public by default) we write the secrets to the Dockerfile which then means your HF tokens (and whatever else) is basically leaked / in public the moment you use our deployer. I think HfApi.add_space_secret(repo_id, key, value) and HfApi.add_space_variable(repo_id, key, value) are the preferred ways of handling secrets / env vars with spaces, which we should use instead of injecting them into the Dockerfile.

Otherwise a bunch of smaller comments below.

strickvl · 2025-11-03T08:56:01Z

docs/book/component-guide/deployers/huggingface.md

+
+* Hugging Face Spaces-specific settings:
+
+  * `space_hardware` (default: `None`): Hardware tier for the Space (e.g., `'cpu-basic'`, `'cpu-upgrade'`, `'t4-small'`, `'t4-medium'`, `'a10g-small'`, `'a10g-large'`). If not specified, uses free CPU tier.


Maybe reference this page? https://huggingface.co/docs/hub/spaces-gpus

strickvl · 2025-11-03T08:56:27Z

docs/book/component-guide/deployers/huggingface.md

+
+  * `space_hardware` (default: `None`): Hardware tier for the Space (e.g., `'cpu-basic'`, `'cpu-upgrade'`, `'t4-small'`, `'t4-medium'`, `'a10g-small'`, `'a10g-large'`). If not specified, uses free CPU tier.
+  * `space_storage` (default: `None`): Persistent storage tier for the Space (e.g., `'small'`, `'medium'`, `'large'`). If not specified, no persistent storage is allocated.
+  * `private` (default: `False`): Whether to create the Space as private. Public Spaces are visible to everyone.


Shouldn't we default to private?

@strickvl i dont think so actually... i think most of the time its a demo or something?

Perhaps. Personally I'd always rather something defaulted to private and then allowed me to turn it on / set it to public whenever I want. That way at least you avoid having leaked something you didn't want to.

strickvl · 2025-11-03T09:00:04Z

docs/book/component-guide/deployers/huggingface.md

+- Source code and dependencies uploadable to Hugging Face
+
+**Use when:**
+- You don't want to manage a container registry


earlier in the docs you said that a remote container registry was required

strickvl · 2025-11-03T09:00:38Z

docs/book/component-guide/deployers/README.md

 | [Docker](docker.md)                | `docker`   | Built-in      | Deploys pipelines as locally running Docker containers                                |
 | [GCP Cloud Run](gcp-cloud-run.md)            | `gcp`     | `gcp`         | Deploys pipelines to Google Cloud Run for serverless execution             |
 | [AWS App Runner](aws-app-runner.md)           | `aws`     | `aws`         | Deploys pipelines to AWS App Runner for serverless execution                       |
+| [Hugging Face](huggingface.md)                | `huggingface` | `huggingface` | Deploys pipelines to Hugging Face Spaces as Docker applications            |


src/zenml/integrations/huggingface/flavors/huggingface_deployer_flavor.py

strickvl · 2025-11-03T09:17:39Z

src/zenml/integrations/huggingface/deployers/huggingface_deployer.py

+                        ),
+                    )
+                except Exception as e:
+                    logger.warning(f"Failed to set hardware {hardware}: {e}")


Should we actually fail here instead of just logging a warning?

strickvl · 2025-11-03T09:18:10Z

src/zenml/integrations/huggingface/deployers/huggingface_deployer.py

+        stack: "Stack",
+        environment: Dict[str, str],
+        secrets: Dict[str, str],
+        timeout: int,


We never actually use this... let's use it or get rid of it.

We need to keep it i think for the base deployer

Maybe just log a warning message that it isn't used, in case someone sets it?

im following the lead here, perhaps i can just add add it in the docstring https://github.com/zenml-io/zenml/blob/claude/huggingface-deployer-011CUj51UperMPAytgCheWG1/src/zenml/deployers/local/local_deployer.py#L205

Adding a warning would pollute the logs

src/zenml/integrations/huggingface/deployers/huggingface_deployer.py

strickvl · 2025-11-03T09:21:50Z

src/zenml/integrations/huggingface/deployers/huggingface_deployer.py

+            The Space ID in format 'username/space-name'.
+        """
+        api = self._get_hf_api()
+        username = api.whoami()["name"]


does this work for an organization? def works for a normal user, but does it work if you want to deploy to the ZenML org on HF Spaces? Not sure it would.

strickvl · 2025-11-03T09:24:47Z

src/zenml/integrations/huggingface/deployers/huggingface_deployer.py

+                api.space_info(space_id)
+                logger.info(f"Updating existing Space: {space_id}")


I think that if someone changes the visibility of the space (from private to public or vice-versa) then it wouldn't actually make those changes with the way the code is currently written, so we should make sure to handle that.

## Security Issue Previously, environment variables and secrets were written directly into the Dockerfile using ENV instructions. This meant that: - ❌ ALL secrets were exposed in the Dockerfile - ❌ Anyone with access to the Space could view credentials - ❌ Especially dangerous since Spaces are PUBLIC by default (private: False) - ❌ Secrets were permanently in the repository history This was a critical security vulnerability that could leak API keys, database credentials, and other sensitive information. ## Security Fix ### Code Changes (huggingface_deployer.py) 1. **Updated _generate_image_reference_dockerfile (Lines 217-247)** - Removed environment and secrets parameters - No longer generates ENV lines in Dockerfile - Added security note in docstring - Dockerfile now only contains: FROM, USER, ENTRYPOINT, CMD 2. **Updated do_provision_deployment (Lines 312-363)** - Changed method call: removed environment, secrets args - Added secure environment variable handling (lines 337-350): * Uses api.add_space_variable() for each environment variable * Variables stored securely by Hugging Face * Not exposed in Dockerfile or repository - Added secure secrets handling (lines 352-363): * Uses api.add_space_secret() for each secret * Secrets encrypted and never exposed * Safe even with public Spaces ### Documentation Changes (huggingface.md) Added "Secure Secrets and Environment Variables" section (Lines 158-175): - Explains security approach with success hint - Documents use of HF Space Secrets/Variables API - Clarifies that nothing is written to Dockerfile - Emphasizes safety with public Spaces (the default) - Lists security benefits with checkmarks ## Security Improvements ✅ Secrets encrypted and never exposed in repository ✅ Environment variables managed through secure API ✅ No credentials in Dockerfile ✅ Safe to use with public Spaces (default behavior) ✅ No risk of credential leakage even if Space is public ✅ Follows security best practices ## How It Works Now 1. Dockerfile is generated WITHOUT any ENV instructions 2. After uploading Dockerfile, deployer calls: - `api.add_space_variable(repo_id, key, value)` for each env var - `api.add_space_secret(repo_id, key, value)` for each secret 3. Hugging Face injects these at runtime securely 4. No credentials ever appear in repository files This approach is the recommended way to handle secrets/env vars with Hugging Face Spaces according to their API documentation.

- Added link to https://huggingface.co/docs/hub/spaces-gpus for space_hardware setting - Helps users understand available GPU options and pricing - Addresses documentation feedback

Implements 8 improvements from code review: 1. Add organization support for deploying to HF organizations - Added 'organization' config parameter to HuggingFaceDeployerConfig - Updated _get_space_id() to check organization before falling back to username 2. Add space name length validation - Added HF_SPACE_NAME_MAX_LENGTH constant (96 chars) - Validate space name length and raise DeployerError if exceeded 3. Handle space visibility updates - Check if space_info.private != settings.private when updating - Call api.update_repo_visibility() to apply visibility changes 4. Fail on invalid hardware/storage instead of warning - Changed hardware/storage errors to raise DeploymentProvisionError - Added clear error messages with documentation links 5. Use proper error types for 404 detection - Import HfHubHTTPError from huggingface_hub.utils - Check e.response.status_code == 404 instead of string matching 6. Document timeout parameter in deprovision - Added docstring note that timeout is unused (deletion is immediate) 7. Remove unused space_exists variable - Removed space_exists assignments to fix lint error 8. Update documentation terminology - Changed "Docker applications" to "Docker Spaces" in deployers README All changes maintain backward compatibility and improve code quality.

The secret_name parameter was redundant since token is already a SecretField that supports ZenML's secret reference syntax ({{secret.key}}). Changes: - Removed secret_name from HuggingFaceDeployerConfig - Simplified _get_token() to just return config.token or environment variable - Updated validator to only check for token (not token or secret_name) - Updated documentation to show proper secret reference syntax: {{hf_token.token}} - Removed unused Client import (auto-removed by formatter) This simplifies the API and follows ZenML's standard pattern for secret handling.

Added detailed descriptions to all config fields following ZenML standards: - Minimum 30 characters - Action-oriented language - Concrete examples with realistic values - Clear format specifications and constraints Field descriptions added: - token: Authentication, secret syntax, permissions, example - organization: Purpose, example URL, permission requirements - space_hardware: Options with specs, GPU tiers, documentation link - space_storage: Tiers with sizes, persistence behavior - space_prefix: Purpose, naming example, length constraint All descriptions include practical examples and relevant documentation links to help users configure the deployer correctly.

Changed the default value of `private` parameter from False to True to follow security best practices. Private by default prevents accidental exposure of deployment information. Changes: - Set private=True as default in HuggingFaceDeployerSettings - Updated docstring to indicate default is True for security - Updated documentation to reflect new default value - Removed redundant private=True from code example (now default) - Updated security section to mention both private and public Spaces - Clarified that secure secrets handling protects credentials even in public Spaces Users can still explicitly set private=False to make Spaces publicly visible, but the safer default protects users who don't explicitly configure visibility.

src/zenml/integrations/huggingface/deployers/huggingface_deployer.py

…yer.py

stefannica · 2025-11-03T10:38:55Z

src/zenml/integrations/huggingface/deployers/huggingface_deployer.py

+            owner = api.whoami()["name"]
+
+        # Sanitize deployment name: alphanumeric, hyphens, underscores only
+        sanitized = re.sub(r"[^a-zA-Z0-9\-_]", "-", deployment.name).lower()


I recommend you use at least some characters from the deployment UUID as a suffix, to make these more unique. See the existing deployer implementations for a similar pattern.

Reason: the names are unique at the level of the project

strickvl · 2025-11-03T10:54:47Z

docs/book/component-guide/deployers/huggingface.md


 * Pass the token directly when registering the deployer using the `--token` parameter
-* (recommended) Store the token in a ZenML secret and reference it using the `--secret_name` parameter
+* (recommended) Store the token in a ZenML secret and reference it using secret syntax


Maybe link to the part of the docs where we describe what this is + how to use?

strickvl · 2025-11-03T10:58:09Z

src/zenml/integrations/huggingface/deployers/huggingface_deployer.py

+                        f"Updating Space visibility: "
+                        f"{'private' if settings.private else 'public'}"
+                    )
+                    api.update_repo_visibility(


From the HF docs: "update_repo_visibility method has been removed. Please use update_repo_settings instead." (https://huggingface.co/docs/huggingface_hub/v1.0.0.rc7/en/concepts/migration#other-deprecated-features)

stefannica · 2025-11-03T11:03:51Z

src/zenml/integrations/huggingface/deployers/huggingface_deployer.py

+                        private=settings.private,
+                        repo_type="space",
+                    )
+            except Exception:


This exception is too broad. Isn't there a more concrete exception that the HF API throws to tell you that the space you are fetching with api.space_info(space_id) is not found ?

stefannica · 2025-11-03T11:07:14Z

src/zenml/integrations/huggingface/deployers/huggingface_deployer.py

+                    logger.warning(
+                        f"Failed to set environment variable {key}: {e}"
+                    )


This should be an error. If you can't set the env vars, then the deployment will fail anyway. Better to fail early.

stefannica · 2025-11-03T11:07:43Z

src/zenml/integrations/huggingface/deployers/huggingface_deployer.py

+                        value=value,
+                    )
+                except Exception as e:
+                    logger.warning(f"Failed to set secret {key}: {e}")


Similarly here: this should be an error. If you can't set the secrets (required to connect to the ZenML server), then the deployment will fail anyway. Better to fail early.

stefannica · 2025-11-03T11:10:30Z

src/zenml/integrations/huggingface/deployers/huggingface_deployer.py

+            runtime = api.get_space_runtime(repo_id=space_id)
+
+            # Map Space stage to deployment status
+            if runtime.stage in ["RUNNING", "RUNNING_BUILDING"]:


What is this RUNNING_BUILDING stage ? Feels more like a PENDING deployment state than a RUNNING one. RUNNING should only be returned when the deployment is fully provisioned and ready to be health-checked (i.e. when the health endpoint is available).

stefannica · 2025-11-03T11:11:23Z

src/zenml/integrations/huggingface/deployers/huggingface_deployer.py

+            return DeploymentOperationalState(
+                status=status,
+                url=f"https://huggingface.co/spaces/{space_id}",
+                metadata={"space_id": space_id},


You should also store the external state in the metadata, to help with debugging.

stefannica · 2025-11-03T11:15:01Z

src/zenml/integrations/huggingface/deployers/huggingface_deployer.py

+            if e.response.status_code == 404:
+                logger.info(f"Space {space_id} already deleted")
+                return None
+            raise DeploymentNotFoundError(


Suggested change

raise DeploymentNotFoundError(

raise DeploymentDeprovisionError(

DeploymentNotFoundError is the same as saying "the resources have been deleted in the back-end", which is not the case here. Raising DeploymentDeprovisionError signals to the core functionality that something is wrong in the back-end and that the user should either check the back-end for errors or force-delete the deployment and risk leaving orphaned resources behind.

stefannica · 2025-11-03T11:17:21Z

src/zenml/integrations/huggingface/deployers/huggingface_deployer.py

+                from huggingface_hub import SpaceHardware
+
+                try:
+                    api.request_space_hardware(


Have you tried changing the hardware requirements for an existing deployment and then updating ? Doesn't calling this multiple times for the same HF space keep adding more hardware on the same space ? I.e. is this a "replace hardware resources" call or an "add hardware resources" call ?

stefannica · 2025-11-03T11:17:46Z

src/zenml/integrations/huggingface/deployers/huggingface_deployer.py

+                from huggingface_hub import SpaceStorage
+
+                try:
+                    api.request_space_storage(


Same here: is this a "replace storage resources" call or an "add storage resources" call ?

stefannica · 2025-11-03T11:19:27Z

src/zenml/integrations/huggingface/deployers/huggingface_deployer.py

+            logger.info(f"Setting {len(secrets)} secrets...")
+            for key, value in secrets.items():
+                try:
+                    api.add_space_secret(


Will calling this a second, third etc. time overwrite the previous values ? Maybe it's the function name that just sends the wrong message - adding something to the space feels like an additive operation, not a replacement one.

stefannica · 2025-11-03T11:22:52Z

src/zenml/integrations/huggingface/deployers/huggingface_deployer.py

+        )
+
+        api = self._get_hf_api()
+        space_id = self._get_space_id(deployment)


There's a corner-case here that you haven't addressed: if there's an existing space ID stored in deployment.deployment_metadata["space_id"] that doesn't match the new space ID, for example because the deployment was renamed or the prefix settings changed. When this happens, you could simply call do_deprovision_deployment to clean up the previous space.

claude added 4 commits November 2, 2025 16:41

htahir1 added staging-workspace backend Implementations related to the backend labels Nov 2, 2025

github-actions bot added the internal To filter out internal PRs and issues label Nov 2, 2025

claude added 3 commits November 2, 2025 16:46

Change deployer flavor name to 'huggingface'

546eb96

The flavor name should be 'huggingface' not 'huggingface-spaces'. Model deployer and deployer are different component types so there's no naming collision.

graphite-app bot reviewed Nov 2, 2025

View reviewed changes

src/zenml/integrations/huggingface/__init__.py Show resolved Hide resolved

htahir1 requested review from stefannica and strickvl November 2, 2025 20:59

claude added 3 commits November 3, 2025 07:30

strickvl requested changes Nov 3, 2025

View reviewed changes

claude added 6 commits November 3, 2025 10:06

Add reference to HF Spaces GPU documentation

3d3d317

- Added link to https://huggingface.co/docs/hub/spaces-gpus for space_hardware setting - Helps users understand available GPU options and pricing - Addresses documentation feedback

htahir1 commented Nov 3, 2025

View reviewed changes

src/zenml/integrations/huggingface/deployers/huggingface_deployer.py Outdated Show resolved Hide resolved

Update src/zenml/integrations/huggingface/deployers/huggingface_deplo…

1d84d2e

…yer.py

htahir1 requested a review from strickvl November 3, 2025 10:34

stefannica reviewed Nov 3, 2025

View reviewed changes

strickvl requested changes Nov 3, 2025

View reviewed changes

stefannica requested changes Nov 3, 2025

View reviewed changes


		* Hugging Face Spaces-specific settings:

		* `space_hardware` (default: `None`): Hardware tier for the Space (e.g., `'cpu-basic'`, `'cpu-upgrade'`, `'t4-small'`, `'t4-medium'`, `'a10g-small'`, `'a10g-large'`). If not specified, uses free CPU tier.

	\| [Hugging Face](huggingface.md) \| `huggingface` \| `huggingface` \| Deploys pipelines to Hugging Face Spaces as Docker applications \|
	\| [Hugging Face](huggingface.md) \| `huggingface` \| `huggingface` \| Deploys pipelines to Hugging Face Spaces as Docker Spaces \|

		api.space_info(space_id)
		logger.info(f"Updating existing Space: {space_id}")

	raise DeploymentNotFoundError(
	raise DeploymentDeprovisionError(

Huggingface deployer #4119

Are you sure you want to change the base?

Huggingface deployer #4119

Conversation

htahir1 commented Nov 2, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Describe changes

Pre-requisites

Types of changes

Uh oh!

CLAassistant commented Nov 2, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

github-actions bot commented Nov 2, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Documentation Link Check Results

Uh oh!

strickvl left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

htahir1 commented Nov 2, 2025 •

edited

Loading

CLAassistant commented Nov 2, 2025 •

edited

Loading

github-actions bot commented Nov 2, 2025 •

edited

Loading