feat(ai): Add prefer_in_cloud option for inference mode #9236

hsubox76 · 2025-09-02T18:07:11Z

Add prefer_in_cloud option - opposite of prefer_on_device.

Note: Used gemini-cli

Had it add tests to generative-model.test.ts instead of on generateContent/generateContentStream/countTokens for all 4 InferenceMode cases because I think the more of the pipeline it tests, the better it is - a lot of errors come from omitting or incorrectly passing params through multiple functions.

Had it extract the logic for on-device vs cloud into a helper function in helpers.ts even though it is only used in generate-content.ts for now, because we'll need to re-use the same logic for countTokens when we implement that (Chrome API should be ready now).

Docs review: Only need to review docs-devsite/ai.md.

This change introduces a new InferenceMode option, prefer_in_cloud. When this mode is selected, the SDK will attempt to use the cloud backend first. If the cloud call fails with a network-related error, it will fall back to the on-device model if available.

changeset-bot · 2025-09-02T18:07:15Z

🦋 Changeset detected

Latest commit: 337d34c

The changes in this PR will be included in the next version bump.

This PR includes changesets to release 2 packages

Name	Type
@firebase/ai	Minor
firebase	Minor

Not sure what this means? Click here to learn what changesets are.

Click here if you're a maintainer who wants to add another changeset to this PR

google-oss-bot · 2025-09-02T18:16:43Z

Size Report ¹

Affected Products

@firebase/ai
Type Base (a4848b4) Merge (3d413d7) Diff
browser 61.3 kB 62.2 kB +921 B (+1.5%)
main 64.8 kB 65.7 kB +921 B (+1.4%)
module 61.3 kB 62.2 kB +921 B (+1.5%)
firebase
Type Base (a4848b4) Merge (3d413d7) Diff
firebase-ai.js 48.4 kB 49.1 kB +674 B (+1.4%)

Test Logs

https://storage.googleapis.com/firebase-sdk-metric-reports/iAFF3hnDYw.html

google-oss-bot · 2025-09-02T18:35:42Z

Size Analysis Report ¹

Affected Products

`@firebase/ai`

AIError
Size
Type Base (a4848b4) Merge (3d413d7) Diff
size 6.54 kB 6.57 kB +34 B (+0.5%)
size-with-ext-deps 24.1 kB 24.1 kB +34 B (+0.1%)
AIErrorCode
Size
Type Base (a4848b4) Merge (3d413d7) Diff
size 6.54 kB 6.62 kB +76 B (+1.2%)
size-with-ext-deps 24.1 kB 24.2 kB +79 B (+0.3%)
AIModel
Size
Type Base (a4848b4) Merge (3d413d7) Diff
size 8.20 kB 8.24 kB +34 B (+0.4%)
size-with-ext-deps 25.8 kB 25.9 kB +34 B (+0.1%)
AnyOfSchema
Size
Type Base (a4848b4) Merge (3d413d7) Diff
size 8.86 kB 8.90 kB +34 B (+0.4%)
size-with-ext-deps 26.5 kB 26.5 kB +34 B (+0.1%)
ArraySchema
Size
Type Base (a4848b4) Merge (3d413d7) Diff
size 8.86 kB 8.90 kB +34 B (+0.4%)
size-with-ext-deps 26.5 kB 26.5 kB +34 B (+0.1%)
Backend
Size
Type Base (a4848b4) Merge (3d413d7) Diff
size 6.54 kB 6.57 kB +34 B (+0.5%)
size-with-ext-deps 24.1 kB 24.1 kB +34 B (+0.1%)
BackendType
Size
Type Base (a4848b4) Merge (3d413d7) Diff
size 6.54 kB 6.58 kB +34 B (+0.5%)
size-with-ext-deps 24.1 kB 24.1 kB +34 B (+0.1%)
BlockReason
Size
Type Base (a4848b4) Merge (3d413d7) Diff
size 6.65 kB 6.68 kB +34 B (+0.5%)
size-with-ext-deps 24.2 kB 24.3 kB +34 B (+0.1%)
BooleanSchema
Size
Type Base (a4848b4) Merge (3d413d7) Diff
size 8.87 kB 8.90 kB +34 B (+0.4%)
size-with-ext-deps 26.5 kB 26.5 kB +34 B (+0.1%)

`ChatSession`

Size

Type	Base (`a4848b4`)	Merge (`3d413d7`)	Diff
size	21.0 kB	21.6 kB	+551 B (+2.6%)
size-with-ext-deps	38.7 kB	39.3 kB	+563 B (+1.5%)

Dependency

Type Base (a4848b4) Merge (3d413d7) Diff

functions

33 dependencies

addHelpers
aggregateResponses
assignRoleToPartsAndValidateSendMessageRequest
chromeAdapterFactory
constructRequest
createEnhancedContentResponse
decodeInstanceIdentifier
factory
formatBlockErrorMessage
formatNewContent
generateContent
generateContentOnCloud
generateContentStream
generateContentStreamOnCloud
generateResponseSequence
getClientHeaders
getFunctionCalls
getHeaders
getInlineDataParts
getResponsePromise
getResponseStream
getText
hadBadFinishReason
hasValidCandidates
makeRequest
mapGenerateContentCandidates
mapGenerateContentRequest
mapGenerateContentResponse
mapPromptFeedback
processGenerateContentResponse
processStream
registerAI
validateChatHistory

34 dependencies

addHelpers
aggregateResponses
assignRoleToPartsAndValidateSendMessageRequest
callCloudOrDevice
chromeAdapterFactory
constructRequest
createEnhancedContentResponse
decodeInstanceIdentifier
factory
formatBlockErrorMessage
formatNewContent
generateContent
generateContentOnCloud
generateContentStream
generateContentStreamOnCloud
generateResponseSequence
getClientHeaders
getFunctionCalls
getHeaders
getInlineDataParts
getResponsePromise
getResponseStream
getText
hadBadFinishReason
hasValidCandidates
makeRequest
mapGenerateContentCandidates
mapGenerateContentRequest
mapGenerateContentResponse
mapPromptFeedback
processGenerateContentResponse
processStream
registerAI
validateChatHistory

+ callCloudOrDevice

variables

24 dependencies

AIErrorCode
AI_TYPE
Availability
BackendType
DEFAULT_API_VERSION
DEFAULT_DOMAIN
DEFAULT_FETCH_TIMEOUT_MS
DEFAULT_LOCATION
FinishReason
HarmSeverity
InferenceMode
LANGUAGE_TAG
PACKAGE_VERSION
POSSIBLE_ROLES
SILENT_ERROR
Task
VALID_PARTS_PER_ROLE
VALID_PART_FIELDS
VALID_PREVIOUS_CONTENT_ROLES
badFinishReasons
logger
name
responseLineRE
version

25 dependencies

AIErrorCode
AI_TYPE
Availability
BackendType
DEFAULT_API_VERSION
DEFAULT_DOMAIN
DEFAULT_FETCH_TIMEOUT_MS
DEFAULT_LOCATION
FinishReason
HarmSeverity
InferenceMode
LANGUAGE_TAG
PACKAGE_VERSION
POSSIBLE_ROLES
SILENT_ERROR
Task
VALID_PARTS_PER_ROLE
VALID_PART_FIELDS
VALID_PREVIOUS_CONTENT_ROLES
badFinishReasons
errorsCausingFallback
logger
name
responseLineRE
version

+ errorsCausingFallback

FinishReason
Size
Type Base (a4848b4) Merge (3d413d7) Diff
size 6.79 kB 6.83 kB +34 B (+0.5%)
size-with-ext-deps 24.4 kB 24.4 kB +34 B (+0.1%)
FunctionCallingMode
Size
Type Base (a4848b4) Merge (3d413d7) Diff
size 6.60 kB 6.63 kB +34 B (+0.5%)
size-with-ext-deps 24.2 kB 24.2 kB +34 B (+0.1%)

`GenerativeModel`

Size

Type	Base (`a4848b4`)	Merge (`3d413d7`)	Diff
size	24.7 kB	25.3 kB	+596 B (+2.4%)
size-with-ext-deps	42.5 kB	43.1 kB	+611 B (+1.4%)

Dependency

Type Base (a4848b4) Merge (3d413d7) Diff

functions

38 dependencies

addHelpers
aggregateResponses
assignRoleToPartsAndValidateSendMessageRequest
chromeAdapterFactory
constructRequest
countTokens
countTokensOnCloud
createEnhancedContentResponse
decodeInstanceIdentifier
factory
formatBlockErrorMessage
formatGenerateContentInput
formatNewContent
formatSystemInstruction
generateContent
generateContentOnCloud
generateContentStream
generateContentStreamOnCloud
generateResponseSequence
getClientHeaders
getFunctionCalls
getHeaders
getInlineDataParts
getResponsePromise
getResponseStream
getText
hadBadFinishReason
hasValidCandidates
makeRequest
mapCountTokensRequest
mapGenerateContentCandidates
mapGenerateContentRequest
mapGenerateContentResponse
mapPromptFeedback
processGenerateContentResponse
processStream
registerAI
validateChatHistory

39 dependencies

addHelpers
aggregateResponses
assignRoleToPartsAndValidateSendMessageRequest
callCloudOrDevice
chromeAdapterFactory
constructRequest
countTokens
countTokensOnCloud
createEnhancedContentResponse
decodeInstanceIdentifier
factory
formatBlockErrorMessage
formatGenerateContentInput
formatNewContent
formatSystemInstruction
generateContent
generateContentOnCloud
generateContentStream
generateContentStreamOnCloud
generateResponseSequence
getClientHeaders
getFunctionCalls
getHeaders
getInlineDataParts
getResponsePromise
getResponseStream
getText
hadBadFinishReason
hasValidCandidates
makeRequest
mapCountTokensRequest
mapGenerateContentCandidates
mapGenerateContentRequest
mapGenerateContentResponse
mapPromptFeedback
processGenerateContentResponse
processStream
registerAI
validateChatHistory

+ callCloudOrDevice

variables

24 dependencies

AIErrorCode
AI_TYPE
Availability
BackendType
DEFAULT_API_VERSION
DEFAULT_DOMAIN
DEFAULT_FETCH_TIMEOUT_MS
DEFAULT_LOCATION
FinishReason
HarmSeverity
InferenceMode
LANGUAGE_TAG
PACKAGE_VERSION
POSSIBLE_ROLES
SILENT_ERROR
Task
VALID_PARTS_PER_ROLE
VALID_PART_FIELDS
VALID_PREVIOUS_CONTENT_ROLES
badFinishReasons
logger
name
responseLineRE
version

25 dependencies

AIErrorCode
AI_TYPE
Availability
BackendType
DEFAULT_API_VERSION
DEFAULT_DOMAIN
DEFAULT_FETCH_TIMEOUT_MS
DEFAULT_LOCATION
FinishReason
HarmSeverity
InferenceMode
LANGUAGE_TAG
PACKAGE_VERSION
POSSIBLE_ROLES
SILENT_ERROR
Task
VALID_PARTS_PER_ROLE
VALID_PART_FIELDS
VALID_PREVIOUS_CONTENT_ROLES
badFinishReasons
errorsCausingFallback
logger
name
responseLineRE
version

+ errorsCausingFallback

GoogleAIBackend
Size
Type Base (a4848b4) Merge (3d413d7) Diff
size 6.55 kB 6.58 kB +34 B (+0.5%)
size-with-ext-deps 24.1 kB 24.2 kB +34 B (+0.1%)
HarmBlockMethod
Size
Type Base (a4848b4) Merge (3d413d7) Diff
size 6.60 kB 6.64 kB +34 B (+0.5%)
size-with-ext-deps 24.2 kB 24.2 kB +34 B (+0.1%)
HarmBlockThreshold
Size
Type Base (a4848b4) Merge (3d413d7) Diff
size 6.72 kB 6.75 kB +34 B (+0.5%)
size-with-ext-deps 24.3 kB 24.3 kB +34 B (+0.1%)
HarmCategory
Size
Type Base (a4848b4) Merge (3d413d7) Diff
size 6.79 kB 6.83 kB +34 B (+0.5%)
size-with-ext-deps 24.4 kB 24.4 kB +34 B (+0.1%)
HarmProbability
Size
Type Base (a4848b4) Merge (3d413d7) Diff
size 6.62 kB 6.65 kB +34 B (+0.5%)
size-with-ext-deps 24.2 kB 24.2 kB +34 B (+0.1%)
HarmSeverity
Size
Type Base (a4848b4) Merge (3d413d7) Diff
size 6.78 kB 6.82 kB +34 B (+0.5%)
size-with-ext-deps 24.4 kB 24.4 kB +34 B (+0.1%)
ImagenAspectRatio
Size
Type Base (a4848b4) Merge (3d413d7) Diff
size 6.66 kB 6.69 kB +34 B (+0.5%)
size-with-ext-deps 24.2 kB 24.3 kB +34 B (+0.1%)
ImagenImageFormat
Size
Type Base (a4848b4) Merge (3d413d7) Diff
size 6.82 kB 6.86 kB +34 B (+0.5%)
size-with-ext-deps 24.4 kB 24.4 kB +34 B (+0.1%)
ImagenModel
Size
Type Base (a4848b4) Merge (3d413d7) Diff
size 13.2 kB 13.2 kB +34 B (+0.3%)
size-with-ext-deps 30.8 kB 30.9 kB +34 B (+0.1%)
ImagenPersonFilterLevel
Size
Type Base (a4848b4) Merge (3d413d7) Diff
size 6.64 kB 6.67 kB +34 B (+0.5%)
size-with-ext-deps 24.2 kB 24.2 kB +34 B (+0.1%)
ImagenSafetyFilterLevel
Size
Type Base (a4848b4) Merge (3d413d7) Diff
size 6.71 kB 6.75 kB +34 B (+0.5%)
size-with-ext-deps 24.3 kB 24.3 kB +34 B (+0.1%)
InferenceMode
Size
Type Base (a4848b4) Merge (3d413d7) Diff
size 6.55 kB 6.58 kB +34 B (+0.5%)
size-with-ext-deps 24.1 kB 24.1 kB +34 B (+0.1%)
IntegerSchema
Size
Type Base (a4848b4) Merge (3d413d7) Diff
size 8.87 kB 8.90 kB +34 B (+0.4%)
size-with-ext-deps 26.5 kB 26.5 kB +34 B (+0.1%)
LiveGenerativeModel
Size
Type Base (a4848b4) Merge (3d413d7) Diff
size 12.6 kB 12.7 kB +34 B (+0.3%)
size-with-ext-deps 30.3 kB 30.3 kB +34 B (+0.1%)
LiveResponseType
Size
Type Base (a4848b4) Merge (3d413d7) Diff
size 6.66 kB 6.69 kB +34 B (+0.5%)
size-with-ext-deps 24.2 kB 24.3 kB +34 B (+0.1%)
LiveSession
Size
Type Base (a4848b4) Merge (3d413d7) Diff
size 9.05 kB 9.08 kB +34 B (+0.4%)
size-with-ext-deps 26.6 kB 26.7 kB +34 B (+0.1%)
Modality
Size
Type Base (a4848b4) Merge (3d413d7) Diff
size 6.67 kB 6.70 kB +34 B (+0.5%)
size-with-ext-deps 24.2 kB 24.3 kB +34 B (+0.1%)
NumberSchema
Size
Type Base (a4848b4) Merge (3d413d7) Diff
size 8.87 kB 8.90 kB +34 B (+0.4%)
size-with-ext-deps 26.5 kB 26.5 kB +34 B (+0.1%)
ObjectSchema
Size
Type Base (a4848b4) Merge (3d413d7) Diff
size 8.87 kB 8.90 kB +34 B (+0.4%)
size-with-ext-deps 26.5 kB 26.5 kB +34 B (+0.1%)
POSSIBLE_ROLES
Size
Type Base (a4848b4) Merge (3d413d7) Diff
size 6.59 kB 6.63 kB +34 B (+0.5%)
size-with-ext-deps 24.2 kB 24.2 kB +34 B (+0.1%)
ResponseModality
Size
Type Base (a4848b4) Merge (3d413d7) Diff
size 6.60 kB 6.63 kB +34 B (+0.5%)
size-with-ext-deps 24.2 kB 24.2 kB +34 B (+0.1%)
Schema
Size
Type Base (a4848b4) Merge (3d413d7) Diff
size 8.86 kB 8.89 kB +34 B (+0.4%)
size-with-ext-deps 26.5 kB 26.5 kB +34 B (+0.1%)
SchemaType
Size
Type Base (a4848b4) Merge (3d413d7) Diff
size 6.65 kB 6.69 kB +34 B (+0.5%)
size-with-ext-deps 24.2 kB 24.3 kB +34 B (+0.1%)
StringSchema
Size
Type Base (a4848b4) Merge (3d413d7) Diff
size 8.87 kB 8.90 kB +34 B (+0.4%)
size-with-ext-deps 26.5 kB 26.5 kB +34 B (+0.1%)
VertexAIBackend
Size
Type Base (a4848b4) Merge (3d413d7) Diff
size 6.55 kB 6.58 kB +34 B (+0.5%)
size-with-ext-deps 24.1 kB 24.2 kB +34 B (+0.1%)
getAI
Size
Type Base (a4848b4) Merge (3d413d7) Diff
size 7.00 kB 7.04 kB +34 B (+0.5%)
size-with-ext-deps 31.7 kB 31.7 kB +34 B (+0.1%)

`getGenerativeModel`

Size

Type	Base (`a4848b4`)	Merge (`3d413d7`)	Diff
size	25.1 kB	25.7 kB	+596 B (+2.4%)
size-with-ext-deps	42.8 kB	43.5 kB	+611 B (+1.4%)

Dependency

Type Base (a4848b4) Merge (3d413d7) Diff

functions

39 dependencies

addHelpers
aggregateResponses
assignRoleToPartsAndValidateSendMessageRequest
chromeAdapterFactory
constructRequest
countTokens
countTokensOnCloud
createEnhancedContentResponse
decodeInstanceIdentifier
factory
formatBlockErrorMessage
formatGenerateContentInput
formatNewContent
formatSystemInstruction
generateContent
generateContentOnCloud
generateContentStream
generateContentStreamOnCloud
generateResponseSequence
getClientHeaders
getFunctionCalls
getGenerativeModel
getHeaders
getInlineDataParts
getResponsePromise
getResponseStream
getText
hadBadFinishReason
hasValidCandidates
makeRequest
mapCountTokensRequest
mapGenerateContentCandidates
mapGenerateContentRequest
mapGenerateContentResponse
mapPromptFeedback
processGenerateContentResponse
processStream
registerAI
validateChatHistory

40 dependencies

addHelpers
aggregateResponses
assignRoleToPartsAndValidateSendMessageRequest
callCloudOrDevice
chromeAdapterFactory
constructRequest
countTokens
countTokensOnCloud
createEnhancedContentResponse
decodeInstanceIdentifier
factory
formatBlockErrorMessage
formatGenerateContentInput
formatNewContent
formatSystemInstruction
generateContent
generateContentOnCloud
generateContentStream
generateContentStreamOnCloud
generateResponseSequence
getClientHeaders
getFunctionCalls
getGenerativeModel
getHeaders
getInlineDataParts
getResponsePromise
getResponseStream
getText
hadBadFinishReason
hasValidCandidates
makeRequest
mapCountTokensRequest
mapGenerateContentCandidates
mapGenerateContentRequest
mapGenerateContentResponse
mapPromptFeedback
processGenerateContentResponse
processStream
registerAI
validateChatHistory

+ callCloudOrDevice

variables

25 dependencies

AIErrorCode
AI_TYPE
Availability
BackendType
DEFAULT_API_VERSION
DEFAULT_DOMAIN
DEFAULT_FETCH_TIMEOUT_MS
DEFAULT_HYBRID_IN_CLOUD_MODEL
DEFAULT_LOCATION
FinishReason
HarmSeverity
InferenceMode
LANGUAGE_TAG
PACKAGE_VERSION
POSSIBLE_ROLES
SILENT_ERROR
Task
VALID_PARTS_PER_ROLE
VALID_PART_FIELDS
VALID_PREVIOUS_CONTENT_ROLES
badFinishReasons
logger
name
responseLineRE
version

26 dependencies

AIErrorCode
AI_TYPE
Availability
BackendType
DEFAULT_API_VERSION
DEFAULT_DOMAIN
DEFAULT_FETCH_TIMEOUT_MS
DEFAULT_HYBRID_IN_CLOUD_MODEL
DEFAULT_LOCATION
FinishReason
HarmSeverity
InferenceMode
LANGUAGE_TAG
PACKAGE_VERSION
POSSIBLE_ROLES
SILENT_ERROR
Task
VALID_PARTS_PER_ROLE
VALID_PART_FIELDS
VALID_PREVIOUS_CONTENT_ROLES
badFinishReasons
errorsCausingFallback
logger
name
responseLineRE
version

+ errorsCausingFallback

getImagenModel
Size
Type Base (a4848b4) Merge (3d413d7) Diff
size 13.3 kB 13.3 kB +34 B (+0.3%)
size-with-ext-deps 31.0 kB 31.0 kB +34 B (+0.1%)
getLiveGenerativeModel
Size
Type Base (a4848b4) Merge (3d413d7) Diff
size 15.1 kB 15.1 kB +34 B (+0.2%)
size-with-ext-deps 32.8 kB 32.8 kB +34 B (+0.1%)
startAudioConversation
Size
Type Base (a4848b4) Merge (3d413d7) Diff
size 12.6 kB 12.7 kB +34 B (+0.3%)
size-with-ext-deps 30.5 kB 30.5 kB +34 B (+0.1%)

Test Logs

https://storage.googleapis.com/firebase-sdk-metric-reports/9azZ4SL09K.html

This commit adds a new test suite to verify that the GenerativeModel's methods correctly dispatch requests to either the on-device or cloud backends based on the selected InferenceMode. It covers generateContent, generateContentStream, and countTokens.

packages/ai/src/requests/hybrid-helpers.ts

.changeset/feat-prefer-in-cloud.md

packages/ai/src/types/enums.ts

Co-authored-by: Daniel La Rocque <[email protected]>

…rebase-js-sdk into feat/prefer-in-cloud

Co-authored-by: Daniel La Rocque <[email protected]>

…rebase-js-sdk into feat/prefer-in-cloud

.changeset/feat-prefer-in-cloud.md

rachelsaunders · 2025-09-08T19:40:32Z

docs-devsite/ai.md


 <b>(EXPERIMENTAL)</b> Determines whether inference happens on-device or in-cloud.

+<b>PREFER\_ON\_DEVICE:</b> Attempt to make inference calls on-device. If on-device inference is not available, it will fall back to cloud. <br/> <b>ONLY\_ON\_DEVICE:</b> Only attempt to make inference calls on-device. It will not fall back to cloud. If on-device inference is not available, inference methods will throw. <br/> <b>ONLY\_IN\_CLOUD:</b> Only attempt to make inference calls to the cloud. It will not fall back to on-device. <br/> <b>PREFER\_IN\_CLOUD:</b> Attempt to make inference calls to the cloud. If not available, it will fall back to on-device.


Suggested change

PREFER\_ON\_DEVICE: Attempt to make inference calls on-device. If on-device inference is not available, it will fall back to cloud. ONLY\_ON\_DEVICE: Only attempt to make inference calls on-device. It will not fall back to cloud. If on-device inference is not available, inference methods will throw. ONLY\_IN\_CLOUD: Only attempt to make inference calls to the cloud. It will not fall back to on-device. PREFER\_IN\_CLOUD: Attempt to make inference calls to the cloud. If not available, it will fall back to on-device.

PREFER\_ON\_DEVICE: Attempt to make inference calls using an on-device model. If on-device inference is not available, the SDK will fall back to using a cloud-hosted model. ONLY\_ON\_DEVICE: Only attempt to make inference calls using an on-device model. The SDK will not fall back to a cloud-hosted model. If on-device inference is not available, inference methods will throw. ONLY\_IN\_CLOUD: Only attempt to make inference calls using a cloud-hosted model. The SDK will not fall back to an on-device model. PREFER\_IN\_CLOUD: Attempt to make inference calls to a cloud-hosted model. If not available, the SDK will fall back to an on-device model.

Addressed in source code comment and re-generated.

Co-authored-by: rachelsaunders <[email protected]>

…rebase-js-sdk into feat/prefer-in-cloud

Co-authored-by: Daniel La Rocque <[email protected]> Co-authored-by: rachelsaunders <[email protected]>

* Add FDC Integration Tests (#8703) * Upgrade two more `download-artifact` v3 actions to v4 in Firestore workflow (#8706) There were a pair of `download-artifact` GitHub actions that were still set to `v3` instead of `v4`. The `v3` actions were unable to find the build archive in the test-changed-firestore workflow. This PR upgrades those to `v4` actions. * Version Packages (#8705) v11.2.0 Release v11.2.0 Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com> * chore(deps): replace dependency npm-run-all with npm-run-all2 5.0.0 (#8112) Co-authored-by: renovate[bot] <29139614+renovate[bot]@users.noreply.github.com> * Add App Check token to FirebaseServerApp (#8651) FirebaseServerApp now accepts an optional App Check token at initialization. The product SDKs will look for this token, and if it's present, the SDKs will use this value in lieu of calling getToken on App Check. This change affects the following SDKs: Auth, Cloud Functions, Data Connect, Firestore, Realtime Database, Vertex AI * Update rollup-plugin-terser dep (#8709) * chore(deps): update actions/github-script action to v7 (#8711) Co-authored-by: renovate[bot] <29139614+renovate[bot]@users.noreply.github.com> * Fireperf web vitals (#8644) * Add support for capturing web vitals metrics in Firebase performance for Web (Largest Contentful Paint, Interaction to Next Paint, Cumulative Layout Shift) * Modifies export to use sendBeacon instead of fetch API, and shifts the upload time to the first time the page is hidden or unloaded. * Update a lot of dependencies (#8716) * chore(config): migrate config renovate.json (#8723) Co-authored-by: renovate[bot] <29139614+renovate[bot]@users.noreply.github.com> * chore(deps): update actions/setup-node action to v4 (#8712) Co-authored-by: renovate[bot] <29139614+renovate[bot]@users.noreply.github.com> * chore(deps): update dependency firebase-tools to v13 [security] (#8607) Co-authored-by: renovate[bot] <29139614+renovate[bot]@users.noreply.github.com> * Have renovate ignore karma-webpack (#8725) * Workflow CI: Update CHROME_VALIDATED_VERSION for Auth tests (#8722) Update our CI's logged `CHROME_VALIDATED_VERSION` to the currently tested version. We haven't updated it in a while. Keeping this up to date should reduce support burden later. * chore(deps): update dependency @rollup/plugin-json to v6 (#8727) Co-authored-by: renovate[bot] <29139614+renovate[bot]@users.noreply.github.com> * Update dev dependency firebase-tools 13.6.0 -> 13.29.2 * Discard earliest heartbeat once there are 30 heartbeats (#8724) * Delete earliest heartbeats only once there are 30 heartbeats * Add changeset * Use max heartbeat const in tests * Fix test name in all caps * Fix test names * Formatting * Fix negative offset of reconnect delay (#8718) (#8719) RTDB Fix negative offset of reconnect delay. Fixes #8718. * Swap Performance's owner from jposuna to visumickey (#8737) * Give `check-version` job pull request write permissions (#8743) * Add pull_request write permissions * fix typo * Skip flaky auth recaptcha test (#8753) * Update dependencies in packages and repo-scripts (#8729) * Add ActionCodeSettings.linkDomain and deprecate ActionCodeSettings.dynamicLinkDomain (#8738) * Add linkDomain field to ActionCodeSettings (#8428) * Add linkDomain field to ActionCodeSettings * Update API reports * Update error message for ERROR_INVALID_HOSTING_LINK_DOMAIN to include that default hosting domains cannot be used. * Use constants for test values --------- Co-authored-by: NhienLam <[email protected]> * FDL Deprecation & Hosting link Integration Test (#8603) * Implement Integration test for passwordless email sign-in via firebase-hosting links. * Add license to new test file created' did not match any files * Remove unwanted _ * Add support for setting custom hosting link domain in test app (#8614) * Add afterEach method and apply formatting to the hosting links integration test (#8615) * Mark ActionCodeSettings.dynamicLinkDomain as deprecated * Add changeset * Update API reports * Address review comments * Fix changeset * Update refdocs and address review comments * Update deprecated comment * Add FDL deprecation FAQ link * Fix linkDomain link in refdocs --------- Co-authored-by: NhienLam <[email protected]> Co-authored-by: mansisampat <[email protected]> * FIX: sort strings in UTF-8 encoded byte order (#8691) * Added more granular tracking for web frameworks * fix "TextEncoder undefined" issue in some platforms (#8765) * Version Packages (#8766) Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com> * Adds a "firebase" changeset to release PR as needed (#8770) * Use Node's `TextEncoder` in E2E JSDOM Environment (#8783) * Update Firebase index.d.ts type definition file reference (#8543) Signed-off-by: Emmanuel Ferdman <[email protected]> * Revert the UTF-8 encoding in string sorting (#8782) * Version Packages (#8786) Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com> * Emulator Idempotency: Firestore (#8780) Update `connectFirestoreEmulator` to support its invocation more than once. If the Firestore instance is already in use, and `connectFirestoreEmulator` is invoked with the same configuration, then the invocation will now succeed instead of assert. The implementation takes the Data Connect implementation as inspiration. Data Connect stores the parameters passed to `connectDataConnectEmulator` on the instance of Data Connect itself, so that they can be quickly checked to see if subsequent invocations match. This PR implements a similar storage and compare process with the optional `emulatorOptions` parameter (host and port are already stored). This PR unlocks support for SSR frameworks which render the page numerous times with the same instances of Firestore. Before this PR customers were required to guard against calling `connectFirestoreEmulator` in their SSR logic, which added to code complexity. Now the Firebase SDK does that guarding logic so that our users' apps don't have to. * Emulator Idempotency: Database (#8769) Update the `connectDatabaseEmulator` function to support its invocation more than once. If the Database instance is already in use, and `connectDatabaseEmulator` is invoked with the same configuration, then the invocation will now succeed instead of assert. This unlocks support for web frameworks which may render the page numerous times with the same instances of RTDB. Before this PR customers needed to add extra code to guard against calling `connectDatabaseEmulator` in their SSR logic. Now we do that guarding logic on their behalf which should simplify our customer's apps. Fixes #6824. * SSR related improvements for RC client SDK. (#8699) SSR related improvements for RC client SDK. * Exposes an initialFetchResponse arg that pre-hydrates the client SDK state * Adds an argument that allows setting an alternate template ID to fetch from * Splits storage impls into a simple in-memory version that can be run in SSR contexts * Adds some basic tests for the API functions, which were previously untested * Emulator Idempotency: Auth (#8750) Update the `connectAuthEmulator` function to support its invocation more than once. If the Auth instance is already in use, and `connectAuthEmulator` is invoked with the same configuration, then the invocation will now succeed instead of assert. This unlocks support for web frameworks which may render the page numerous times with the same instances of auth. Before this PR customers needed to add extra code to guard against calling `connectAuthEmulator` in their SSR logic. Now we do that guarding logic on their behalf which should simplify our customer's apps. Fixes #6824. * Upgrade dependencies (#8781) * first batch * @rollup/plugin-node-resolve 13 -> 16 * Publish VertexAI API's tagged with `@beta` (#8788) * Filter empty text parts when streaming (#8736) * Filter empty text parts when streaming * Add changeset * Remove unused filterEmptyTextParts() * Move logic into a function * final * dont throw if mock response file isn't found * update responses version to 6 * Only ignore empty text parts in the aggregated response. * review fixes * Throw in `aggregateResponses` if `newPart` has no properties * Change API Documenter (BETA) annotation to (Public Preview) (#8789) * Change (BETA) annotation to (Public Preview) * format * Minor bump * Create VertexAI Node CJS and ESM bundles (#8728) * Create Node CJS and ESM bundles * Add changeset * Run formatter * register version with node * Catch errors in debug token exchange logic (#8792) * Throw error if mock response file not found (#8739) * Fix vectorvalue docs (#8453) * Fix vectorvalue docs * Use @link vector * Fix vector link with code tag * Re-enable auth redirect tests (#8803) * Re-enable redirect tests * Fix lint * Add Image Generation support using Imagen (#8683) * WIP * Introduce VertexAIModel base class, add documentation, and respond to other comments * Generate devsite docs * revert dataconnect change * formatting * WIP * Introduce VertexAIModel base class, add documentation, and respond to other comments * Generate devsite docs * revert dataconnect change * formatting * Fix ImagenRequestConfig comment * Improve tests * Formatting * Add ImagenGenerationResponse * Formatting * Update to match API changes * fixes * format * Update API reports * add changeset * update * format * Update API reports * remove public docs * Nest imageFormat in outputOptions * remove unecessary console.log * Log warning if compressionQuality outside of range * format * Imagen Documentation (#8776) * Revert "remove public docs" This reverts commit f095563. * update docs * update docs * format * minor fixes * format * mark all APIs beta instead of public * review fixes * format * Replace (BETA) with (Public Preview) in docs * fix links * another pass * format * review fixes * Cleanup * update changeset --------- Co-authored-by: dlarocque <[email protected]> * Add support for token-based usage metrics. (#8757) * Add support for token-based usage metrics. Token measurement is broken down by modaliy, with separate counters for image, audio, etc. Tests are in version 6.*, so this change also includes bumping update_responses.sh * Use right name for unspecified enum * Update API reports * Make promptTokensDetails an array * Update API reports * Add missing documentation * Add changeset entry * Fix formatting errors * Update API reports * Add missing refdoc * Update API reports * Use correct formatting for Modality documentation. * Update API reports * Update docs * Update packages/vertexai/src/types/enums.ts Co-authored-by: rachelsaunders <[email protected]> * Update md files * Revert changes to api-extractor.json --------- Co-authored-by: rachelsaunders <[email protected]> * Add Vertex and DataConnect E2E smoke tests (#8805) * Update dev script dependency @changesets/types to v6 (#8806) Co-authored-by: renovate[bot] <29139614+renovate[bot]@users.noreply.github.com> * Bump cross-spawn from 7.0.3 to 7.0.6 in /e2e (#8645) Bumps [cross-spawn](https://github.com/moxystudio/node-cross-spawn) from 7.0.3 to 7.0.6. - [Changelog](https://github.com/moxystudio/node-cross-spawn/blob/master/CHANGELOG.md) - [Commits](moxystudio/node-cross-spawn@v7.0.3...v7.0.6) --- updated-dependencies: - dependency-name: cross-spawn dependency-type: indirect ... Signed-off-by: dependabot[bot] <[email protected]> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com> * Revert "Bump cross-spawn from 7.0.3 to 7.0.6 in /e2e (#8645)" (#8811) This reverts commit 117279e. * App Hosting JS SDK autoinit (#8483) Adds a `postinstall` step to `@firebase/util` which hardcodes autoinit defaults into `@firebase/util/dist/postinstall.(m)js` for both CJS and ESM variants [go/firestack-autoinit](http://goto.google.com/firestack-autoinit). Hardcoding is important since environment variable substitution doesn't dive into a developer's `node_modules` folder. This gives us a framework agnostic way to inject configuration and is a more robust solution than the _experimental_ autoinit methods provided by Web Frameworks [go/firebase-api-client-autoinit](https://goto.google.com/firebase-api-client-autoinit). Once this lands, we'll backport to Hosting and Functions and aim to deprecate the other autoinit methods. * Added non-null properties to database.ThenableReference (#8800) * Version Packages (#8794) Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com> * DataConnect: Update requests to point to v1 backend endpoints instead of v1beta (#8820) * update API requests to point to v1 instead of v1beta * update changelog * Remove auto-commit API report workflow (#8808) * Add missing `BlockReason` and `FinishReason` enum values (#8741) * Convert enum comments to JSDoc comments (#8742) * Add `systemInstruction`, `tools`, and `generationConfig` to `CountTokensRequest` (#8827) * Update yarn.lock in e2e (#8831) * Fix transport service retry logic condition (Fixes #8813) (#8814) Modify the retry mechanism to stop when remaining tries is less than or equal to zero, improving the robustness of the retry handling. Fixes #8813 * Give `check-changeset` job pull request write permissions (#8833) * Fix: sort strings in UTF-8 encoded byte order with lazy encoding (#8787) * fix golden files test (#8843) * fix golden files test * update responses version --------- Co-authored-by: David Motsonashvili <[email protected]> * Surface all App Check errors in public getToken() method (#8842) * Grant Create PR workflow pull-request write perms. (#8846) Our staging release workflow failed to create a Version Packages PR. Grant the PR creation workflow `pull-request` `write` permissions. * Grant Create PR content write permissions. (#8848) Our staging release workflow failed to merge branch. Grant the PR creation workflow `content` `write` permissions to fix this. * Version Packages (#8849) - v11.5.0 Release version 11.5.0. Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> * Deprecate `GroundingAttribution` (#8832) * Stream full logs of test-all firestore job (#8851) * Set github actions to use commit hashes (#8852) * Implement Partial Errors for FDC (#8821) * Add `X-Firebase-AppId` header to VertexAI requests (#8809) * Update dependency webpack to v5.94.0 [SECURITY] (#8734) Co-authored-by: renovate[bot] <29139614+renovate[bot]@users.noreply.github.com> * Update dependency express to v4.20.0 [SECURITY] (#8759) Co-authored-by: renovate[bot] <29139614+renovate[bot]@users.noreply.github.com> * karma.base.js: replace "BROWSER" with "BROWSERS" in log messages, to reflect the actual name of the environment variable (see #8491) (#8875) * Auth cookie persistence (#8839) Adding `Persistence.COOKIE` a new persistence method backed by cookies. The `browserCookiePersistence` implementation is designed to be used in conjunction with middleware that ensures both your front and backend authentication state remains synchronized. * firestore: fix `undefined` document snapshot data after "clear site data" (#8871) * Use base browsers variable in Firestore karma config (#8876) * [chore] Update some dependencies (#8868) * Version Packages (#8878) Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> * Fix 'window is not defined' error when calling clearIndexedDbPersistence * Fix Data Connect Types (#8898) * test(vertexai): update mock responses to v8 (#8904) * Fix: display WebChannel error message (#8907) * test(vertexai): add `backendName` param to mock response getters (#8906) `backendName` can be either `googleAI` or `vertexAI`. This can be passed to `getMockResponse` or `getMockResponseStreaming` to specify whether the mock response lookup should be done in the set of vertexAI or googleAI mock files. Modified the `convert-mocks.ts` script to read mock responses from the 'developerapi' directory, and add an export to the generated file for the new lookup object with those mock responses. * fix: remove `null` value inclusion from `!=` and `not-in` filter results (#8915) * Add assertion IDs that will be included in production log statements for fail and hardAsserts (#8313) * docs(vertexai): remove HTML `<code>` tags (#8900) We wrap links that reference public APIs with `<code>` tags so that they can be rendered as code in devsite. Unfortunately, VSCode's hover feature can not render these links at all. Since there are no other alternatives for creating code-formatted links, we should remove the problematic HTML `<code>` tags. * test(vertexai): update developerapi mock response dir to googleai (#8921) * Fix languageCode parameter in action_code_url (#8912) * Fix languageCode parameter in action_code_url * Add changeset * test(vertexai): update mock responses to v10 (#8959) * Version Packages (#8956) Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> * Improve the integration test coverage for online vs offline comparisons. (#8975) * Improve the integration test coverage for online vs offline comparisons. * prettier. * Add missing cases. * fix(vertexai): add missing quote to chat role error message (#8979) * Auto Enable SSL for Firebase Studio (#8980) * Add support for running the emulators in Cloud Workstation (#8968) * Add Cookie Support For Firebase Studio (#8986) * Fix Auth Port for Firebase Studio (#8998) * Change automaticDataCollectionEnabled to default to true. (#8999) * Version Packages (#8997) Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> * Roll back #8999 (#9007) * Revert "Change automaticDataCollectionEnabled to default to true. (#8999)" This reverts commit 3789b5a. * Revert #8999 - backend is not ready * Version Packages (#9008) Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> * fix(vertexai): pass `GenerativeModel`'s `BaseParams` to `ChatSession` (#8972) startChat wasn't passing GenerativeModel's BaseParams to ChatSession. So, if a model had a generationConfig, it would never be passed to ChatSession. * feat(vertexai): Gemini multimodal output (#8922) Adds new ResponseModality enum that allows users to specify which modalities should be included in a response. Since we provide a text() accessor, a similar inlineDataParts() accessor was added to return all InlineDataPart[] in the first candidate. * test: Clean up vscode launch file and add vertexai debug config (#9000) * Fix encoding of CSI keys in Safari (#8993) * Restore #8999 (#9010) * Revert "Roll back #8999 (#9007)" This reverts commit 51e7b48. * Changeset * feat(ai): Migrate to `AI` and add GoogleAI support (#8931) * Add Emulator Overlay (#8977) * Rename `@firebase/vertexai` package to `@firebase/ai` (#9011) * Add Emulator Overlay (#8977) (#9031) Co-authored-by: Maneesh Tewani <[email protected]> * add temporary release branch (#9032) * Allow custom branch for release (#9033) * add temporary release branch * restore Vertex token for temp publish (#9036) * Version Packages (manual PR) (#9034) * Fixed scroll behavior (#9043) * Version Packages (manual PR) (#9044) * Remove redundant changeset * Update e2e tests to reflect AI package and new API (#9017) * Version Packages (#9016) Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> * Revert "Fixed scroll behavior (#8977)" (#9053) * Revert "Fixed scroll behavior (#9043)" This reverts commit 9bcd1ea. * Revert "Add Emulator Overlay (#8977) (#9031)" This reverts commit 8593fa0. * Undid revert of util * Create dirty-crews-cross.md * Removed unnecessary markdown file * Updated to only show banner when calling connect*Emulator (#9055) * Version Packages (#9054) Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> * Close webchannel instances on terminate (#9041) Explicitly close any unclosed WebChannel instances on Firestore#terminate() * Reorder reporters to prevent log message duplication in CI (#9040) * docs(ai): Fix 'occured' typo in error codes (#9021) * feat(ai): Add `HarmBlockThreshold.OFF` (#9050) * feat(ai): Schema `title`, `maximum`, `minimum`, and `propertyOrdering` (#9047) * feat(ai): Add support for `minItems` and `maxItems` to `Schema` (#9026) * Build out e2e subdirectories (#9065) * Added test project to fdc (#8924) * Version Packages (#9073) Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Release version 11.9.0. * test(ai): add integration tests (#8853) * Update issue template to rename VertexAI > AI (#9074) * ci: Fix Firefox error and streamline cross-browser tests (#9015) * Fixed issue where requestSts wasn't including the Studio cookie in it (#9075) * Fix issue where we ping an endpoint that hasn't been implemented (#9059) * Updated firebase auth changeset (#9088) * build(all): Version Packages v11.9.1 (#9089) Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> v11.9.1 release. * test(ai): Upgrade mock responses to v14 (#9091) * Feat/google3 typings support (#9085) * fix(ai): Add deprecation tag to `totalBillableCharacters` (#9090) * test(ai): Add `gemini-2.5-flash` to integration tests (#9110) Added gemini-2.5-flash to our integration tests now that it's publicly available. The token counts differ slightly between 2.0-flash and 2.5-flash so I introduced conditionals when checking token counts. * firestore: minor refactor of listener registration of "versionchange" indexedb events (#9087) * Fixed issue where storage wasn't passing cookies when using firebase Studio (#9111) * fix: Add "react-native" entry point to @firebase/app (#9112) * Feat(Firestore) JSON serialization of types to improve SSR support. (#8926) Support the ability to resume `onSnapshot` listeners in the CSR phase based on serialized `DataSnapshot`s and `QuerySnapshot`s built in the SSR phase. Allow Firestore result types to be serialized with `toJSON` and then deserialized with `fromJSON` methods on the objects. `DocumentSnapshot` and `QuerySnapshot` deserialization methods will be standalone, tree-shakable functions `dataSnapshotFromJSON` and `querySnapshotFromJSON`. * chore(Firestore) add changeset for PR #8926 (#9115) Add the missing changset for PR #8926. * Update Fireperf logging to use sendBeacon only if the payload is under the 64KB limit (#9120) * Update Fireperf logging to use sendBeacon only if the payload is under the 64KB limit for most browsers. - For the flush, attempt to use sendBeacon with a low number of events incase sendBeacon is also used by other libraries. * Add changeset and fix format * Add additional comments * Put max flush size behind remote config flag * Version Packages (#9126) Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> * Increase unit test coverage for facebook.ts to 100% (#9129) * Increase unit test coverage for facebook.ts to 100% * Increase unit test coverage for github.ts to 100% (#9130) * Increase unit test coverage for github.ts to 100% * test: Use inline source maps for webpack test output (#9109) * Increase unit test coverage for saml.ts to 100% (#9131) * Increase unit test coverage for saml.ts to 100% * firestore: misc.ts: further improved performance of UTF-8 string comparison logic * fix(ai): typings for functionDeclaration.parameters. (#8948) * feat(ai): add support for grounding with google search (#9068) * feat!(ai): Remove `vertexai` import alias (#9081) * docs(firestore): add an example to the detailed comment in compareUtf8Strings() (#9154) * feat(ai): Add support for Thinking Budget (#9156) * feat!(ai): remove `VertexAI` APIs (#9063) * fix!(ai): convert TS enums exports in Firebase AI into const variables (#9137) * fix(firestore): revert fix for `undefined` document snapshot data after "clear site data" (#9162) This reverts #8871 because it had unexpected negative customer impacts (#9056) and caused CI failures. * build!: update build targets to ES2020 (#9101) * (feat) initializeServerApp support for App Hosting auto init (#9151) Implement Auto Init for `initializeServerApp`. Auto init was previously implemented for `initializeApp` in #8483. This PR adds the same functionality to the `initializeServerApp` API surface. Fixes #8863 * feat!(ai): Remove `GroundingAttribution` (#9079) * feat!(ai): add support for `anyOf` schemas (#9042) * chore!: update engines.node to minimum of 20 (#9128) * fix(ci): Update e2e tests to import from ai instead of vertexai (#9164) * chore: Add missing changeset for 9101 (#9167) * Revert "firestore: minor refactor of listener registration of "versionchange" indexedb events (#9087) (#9168) * Revert "firestore: minor refactor of listener registration of "versionchange" indexedb events (#9087)" This reverts commit f73e08b. * Create hungry-icons-dream.md * docs: fix doc typos found by g3 presubmit (#9166) * Version Packages (#9165) Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> * Fixes a bug where the out-of-the-box core web vital metrics would throw an error when the target element name exceeds the max length of a custom attribute value. (#9178) * fix(app-check): Prevent redundant exchangeToken calls in debug mode (#9187) * Update BrowsersList DB (#9188) * Firebase AI Hybrid Inference Implementation (#9029) Co-authored-by: Daniel La Rocque <[email protected]> Co-authored-by: Erik Eldridge <[email protected]> Co-authored-by: Christina Holland <[email protected]> * Version Packages (#9195) Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> * Backfill fixes for hybrid docs (#9197) * Fix Firebase Functions Emulator usage on Firebase Studio (#9204) * Used valid path for pingserver * Added test * Create three-balloons-collect.md * Fixed fromUrl * Removed only * Fixed typings * fix(firestore): fix empty message reject inside transaction body (#9177) * Add limitedUseToken option to AI SDK (#9201) * [Firebase AI] Add thought summary and signature support (#9192) * fix(ai): Only include unexpected `prediction` element in error message (#9222) * fix(ai): Only include unexpected prediction element in error message. If the `responseJson` from the Predict request has an image, the entire image base64 will be included in the error message. This can make the error message >400kB (!!), which could cause performance issues in apps. I ran into this when testing #9216, since the `safetyAttributes` would be included in a response and trigger this error, but the `predictions` array in `responseJson` would have images. I noticed that when the sample app rendered the `error.message`, it began running into serious performance issues. * fix(ai): Add `'includeSafetyAttributes'` to Predict request payloads (#9216) * fix(ai): keep window-specific code out of Node bundle (#9225) * feat(ai): add support for the Live API (#9224) * Version Packages (#9230) Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> * remove accidental factory export (#9232) * Version Packages (#9233) Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> * fix(ai): Fix test script broken by #9232 (#9235) * Update CHANGELOG.md (#9237) * feat(ai): Add prefer_in_cloud option for inference mode (#9236) Co-authored-by: Daniel La Rocque <[email protected]> Co-authored-by: rachelsaunders <[email protected]> * Remote Config Realtime for Web (#9221) * feat(ai): Add code execution (#9249) * chore(firestore): Update closure-net dependency to the latest release (#9242) * Update closure-net dependency to the latest release (6f48f578d3e80fe7a85e530a5d95b9351433d135) * Change documentation tags for hybrid inference from EXPERIMENTAL to public preview (#9253) * docs: Fix docgen script to clean out package temp folders first (#9250) * Version Packages (#9258) Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> * Expose setUserProperties from internal analytics instance (#9263) * feat(ai): add support for URL context (#9254) * docs(ai): tag code execution with "beta" tag (public preview) (#9274) * fix(ai): Handle empty parts when streaming (#9262) * feat(ai): Add method to send function responses in a live session (#9272) * Update ai model name used for e2e tests (#9282) * docs(ai): promote gemini developer api and imagen to GA (#9255) * Version Packages (#9300) release v12.4.0 For JS SDK release v12.4.0 Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> * remove emulator files that were readded * Update util.api.md --------- Signed-off-by: Emmanuel Ferdman <[email protected]> Signed-off-by: dependabot[bot] <[email protected]> Co-authored-by: Maneesh Tewani <[email protected]> Co-authored-by: DellaBitta <[email protected]> Co-authored-by: Google Open Source Bot <[email protected]> Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com> Co-authored-by: DellaBitta <[email protected]> Co-authored-by: renovate[bot] <29139614+renovate[bot]@users.noreply.github.com> Co-authored-by: Christina Holland <[email protected]> Co-authored-by: Bryan Atkinson <[email protected]> Co-authored-by: Daniel La Rocque <[email protected]> Co-authored-by: Alexander <[email protected]> Co-authored-by: jposuna <[email protected]> Co-authored-by: Nhien (Ricky) Lam <[email protected]> Co-authored-by: NhienLam <[email protected]> Co-authored-by: mansisampat <[email protected]> Co-authored-by: Mila <[email protected]> Co-authored-by: Stephen Rosa <[email protected]> Co-authored-by: Emmanuel Ferdman <[email protected]> Co-authored-by: Kevin Elko <[email protected]> Co-authored-by: dlarocque <[email protected]> Co-authored-by: Rodrigo Lazo <[email protected]> Co-authored-by: rachelsaunders <[email protected]> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com> Co-authored-by: James Daniels <[email protected]> Co-authored-by: Rich Hodgkins <[email protected]> Co-authored-by: Leo <[email protected]> Co-authored-by: David Motsonashvili <[email protected]> Co-authored-by: David Motsonashvili <[email protected]> Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: Denver Coneybeare <[email protected]> Co-authored-by: Pavan Shankar <[email protected]> Co-authored-by: Ehsan <[email protected]> Co-authored-by: eranc-google <[email protected]> Co-authored-by: Raymond Lam <[email protected]> Co-authored-by: Raymond Lam <[email protected]> Co-authored-by: Nicole <[email protected]> Co-authored-by: gsiddh <[email protected]> Co-authored-by: Erik Eldridge <[email protected]> Co-authored-by: cherylEnkidu <[email protected]> Co-authored-by: Andrew Heard <[email protected]> Co-authored-by: Daymon <[email protected]> Co-authored-by: Tushar Khandelwal <[email protected]>

hsubox76 added 5 commits September 2, 2025 13:59

fix tests

6619ed4

stop publicly exposing mode

cadfd09

fix ts errors

36deac4

format

0aadd2e

hsubox76 marked this pull request as ready for review September 3, 2025 16:36

hsubox76 requested review from a team as code owners September 3, 2025 16:36

hsubox76 added 2 commits September 3, 2025 09:37

undo grammar fix

f929d6c

fix prefer_on_cloud fallback criteria

8c9c1c2

dlarocque reviewed Sep 3, 2025

View reviewed changes

hsubox76 and others added 9 commits September 3, 2025 12:09

Apply suggestions from code review

34094c8

Co-authored-by: Daniel La Rocque <[email protected]>

Add doc comments on InferenceModes

7c4f837

Merge branch 'feat/prefer-in-cloud' of https://github.com/firebase/fi…

274e5e8

…rebase-js-sdk into feat/prefer-in-cloud

Update .changeset/feat-prefer-in-cloud.md

db53108

Co-authored-by: Daniel La Rocque <[email protected]>

fix ERROR casing and formatting

a1af166

Merge branch 'feat/prefer-in-cloud' of https://github.com/firebase/fi…

6f87cb4

…rebase-js-sdk into feat/prefer-in-cloud

fix expected error case in tests

7fd62c4

Move InferenceMode doc comment to be on top of type

d4e843e

move docs to where they will be seen

15908c7

rachelsaunders requested changes Sep 8, 2025

View reviewed changes

hsubox76 and others added 3 commits September 8, 2025 12:52

Update .changeset/feat-prefer-in-cloud.md

d165c42

Co-authored-by: rachelsaunders <[email protected]>

Address comment

4324ddd

Merge branch 'feat/prefer-in-cloud' of https://github.com/firebase/fi…

337d34c

…rebase-js-sdk into feat/prefer-in-cloud

rachelsaunders approved these changes Sep 8, 2025

View reviewed changes

dlarocque approved these changes Sep 9, 2025

View reviewed changes

hsubox76 merged commit 06ab5c4 into main Sep 9, 2025
37 of 38 checks passed

hsubox76 deleted the feat/prefer-in-cloud branch September 9, 2025 18:14

google-oss-bot mentioned this pull request Sep 17, 2025

Version Packages #9258

Merged

mansisampat pushed a commit that referenced this pull request Sep 25, 2025

feat(ai): Add prefer_in_cloud option for inference mode (#9236)

3207246

Co-authored-by: Daniel La Rocque <[email protected]> Co-authored-by: rachelsaunders <[email protected]>

firebase locked and limited conversation to collaborators Oct 10, 2025


		<b>(EXPERIMENTAL)</b> Determines whether inference happens on-device or in-cloud.

		<b>PREFER\_ON\_DEVICE:</b> Attempt to make inference calls on-device. If on-device inference is not available, it will fall back to cloud. <br/> <b>ONLY\_ON\_DEVICE:</b> Only attempt to make inference calls on-device. It will not fall back to cloud. If on-device inference is not available, inference methods will throw. <br/> <b>ONLY\_IN\_CLOUD:</b> Only attempt to make inference calls to the cloud. It will not fall back to on-device. <br/> <b>PREFER\_IN\_CLOUD:</b> Attempt to make inference calls to the cloud. If not available, it will fall back to on-device.

feat(ai): Add prefer_in_cloud option for inference mode #9236

feat(ai): Add prefer_in_cloud option for inference mode #9236

Uh oh!

Conversation

hsubox76 commented Sep 2, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

changeset-bot bot commented Sep 2, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🦋 Changeset detected

Uh oh!

google-oss-bot commented Sep 2, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Size Report 1

Affected Products

@firebase/ai

firebase

Test Logs

Uh oh!

google-oss-bot commented Sep 2, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Size Analysis Report 1

Affected Products

@firebase/ai

AIError

AIErrorCode

AIModel

AnyOfSchema

ArraySchema

Backend

BackendType

BlockReason

BooleanSchema

ChatSession

FinishReason

FunctionCallingMode

GenerativeModel

GoogleAIBackend

HarmBlockMethod

HarmBlockThreshold

HarmCategory

HarmProbability

HarmSeverity

ImagenAspectRatio

ImagenImageFormat

ImagenModel

ImagenPersonFilterLevel

ImagenSafetyFilterLevel

InferenceMode

IntegerSchema

LiveGenerativeModel

LiveResponseType

LiveSession

Modality

NumberSchema

ObjectSchema

POSSIBLE_ROLES

ResponseModality

Schema

SchemaType

StringSchema

VertexAIBackend

getAI

getGenerativeModel

getImagenModel

getLiveGenerativeModel

startAudioConversation

Test Logs

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

rachelsaunders Sep 8, 2025

Choose a reason for hiding this comment

Uh oh!

hsubox76 Sep 8, 2025

Choose a reason for hiding this comment

hsubox76 commented Sep 2, 2025 •

edited

Loading

changeset-bot bot commented Sep 2, 2025 •

edited

Loading

google-oss-bot commented Sep 2, 2025 •

edited

Loading

Size Report ¹

`@firebase/ai`

`firebase`

google-oss-bot commented Sep 2, 2025 •

edited

Loading

Size Analysis Report ¹

`@firebase/ai`

`AIError`

`AIErrorCode`

`AIModel`

`AnyOfSchema`

`ArraySchema`

`Backend`

`BackendType`

`BlockReason`

`BooleanSchema`

`ChatSession`

`FinishReason`

`FunctionCallingMode`

`GenerativeModel`

`GoogleAIBackend`

`HarmBlockMethod`

`HarmBlockThreshold`

`HarmCategory`

`HarmProbability`

`HarmSeverity`

`ImagenAspectRatio`

`ImagenImageFormat`

`ImagenModel`

`ImagenPersonFilterLevel`

`ImagenSafetyFilterLevel`

`InferenceMode`

`IntegerSchema`

`LiveGenerativeModel`

`LiveResponseType`

`LiveSession`

`Modality`

`NumberSchema`

`ObjectSchema`

`POSSIBLE_ROLES`

`ResponseModality`

`Schema`

`SchemaType`

`StringSchema`

`VertexAIBackend`

`getAI`

`getGenerativeModel`

`getImagenModel`

`getLiveGenerativeModel`

`startAudioConversation`