react-native-vision-camera-mlkit

A React Native Vision Camera plugin that exposes high-performance Google ML Kit frame processor features such as text recognition (OCR), face detection, barcode scanning, pose detection, and more.

The example app is intentionally heavy and demo-focused. For integration details, follow the documentation below.

Requirements

Install Vision Camera (React Native):

npm i react-native-vision-camera
cd ios && pod install

Install Worklets Core:

npm i react-native-worklets-core
# or
yarn add react-native-worklets-core

Add the Babel plugin in babel.config.js:

module.exports = {
  plugins: [['react-native-worklets-core/plugin']],
};

For Expo, follow the Vision Camera guide: react-native-vision-camera.com/docs/guides

Installation

npm install react-native-vision-camera-mlkit
# or
yarn add react-native-vision-camera-mlkit

cd ios && pod install

ML Kit Models Installation (Selective)

By default, all ML Kit features are enabled. You can selectively include only the models you need to reduce binary size.

Android (Gradle)

In your app's android/build.gradle (root project), add:

ext["react-native-vision-camera-mlkit"] = [
  mlkit: [
    textRecognition: true,
    textRecognitionChinese: false,
    textRecognitionDevanagari: false,
    textRecognitionJapanese: false,
    textRecognitionKorean: false,
    faceDetection: false,
    faceMeshDetection: false,
    poseDetection: false,
    poseDetectionAccurate: false,
    selfieSegmentation: false,
    subjectSegmentation: false,
    documentScanner: false,
    barcodeScanning: true,
    imageLabeling: false,
    objectDetection: false,
    digitalInkRecognition: false,
  ]
]

iOS (Podfile)

In your ios/Podfile, add a configuration hash before target:

$VisionCameraMLKit = {
  'textRecognition' => true,
  'textRecognitionChinese' => false,
  'textRecognitionDevanagari' => false,
  'textRecognitionJapanese' => false,
  'textRecognitionKorean' => false,
  'faceDetection' => false,
  'poseDetection' => false,
  'poseDetectionAccurate' => false,
  'selfieSegmentation' => false,
  'barcodeScanning' => true,
  'imageLabeling' => false,
  'objectDetection' => false,
  'digitalInkRecognition' => false,
}

Android-only keys: faceMeshDetection, subjectSegmentation, documentScanner.

Usage

Text Recognition (Frame Processor)

import {
  useFrameProcessor,
  runAsync,
  runAtTargetFps,
} from 'react-native-vision-camera';
import { useTextRecognition } from 'react-native-vision-camera-mlkit';

const { textRecognition } = useTextRecognition({
  language: 'LATIN',
  scaleFactor: 1,
  invertColors: false,
});

const frameProcessor = useFrameProcessor(
  (frame) => {
    'worklet';

    runAtTargetFps(10, () => {
      'worklet';
      runAsync(frame, () => {
        'worklet';
        const result = textRecognition(frame, {
          outputOrientation: 'portrait',
        });
        console.log(result.text);
      });
    });
  },
  [textRecognition]
);

TextRecognitionOptions:

language?: 'LATIN' | 'CHINESE' | 'DEVANAGARI' | 'JAPANESE' | 'KOREAN'
scaleFactor?: number (0.9-1.0)
invertColors?: boolean
frameProcessInterval?: number (deprecated, use runAtTargetFps)

TextRecognitionArguments:

outputOrientation?: 'portrait' | 'portrait-upside-down' | 'landscape-left' | 'landscape-right' (iOS only)

Image Processing (Static Images)

Use processImageTextRecognition to analyze a file path or URI without the camera (for example, images picked from the gallery).

import { processImageTextRecognition } from 'react-native-vision-camera-mlkit';

const result = await processImageTextRecognition(imageUri, {
  language: 'LATIN',
  orientation: 'portrait',
  invertColors: false,
});

console.log(result.blocks);

TextRecognitionImageOptions:

language?: 'LATIN' | 'CHINESE' | 'DEVANAGARI' | 'JAPANESE' | 'KOREAN'
orientation?: 'portrait' | 'portrait-upside-down' | 'landscape-left' | 'landscape-right'
invertColors?: boolean

The native bridge normalizes URIs (file:// is removed on iOS and added on Android if missing). Supported formats: JPEG, PNG, WebP.

Feature Utilities

The package also exposes helpers from the plugin factory:

import {
  getFeatureErrorMessage,
  isFeatureAvailable,
  assertFeatureAvailable,
  getAvailableFeatures,
} from 'react-native-vision-camera-mlkit';

getAvailableFeatures(): MLKitFeature[]
isFeatureAvailable(feature: MLKitFeature): boolean
assertFeatureAvailable(feature: MLKitFeature): void
getFeatureErrorMessage(feature: MLKitFeature): string

Error Handling

Frame processors throw a setup error when the feature is not enabled in Gradle/Podfile. For static image processing, the following error strings are exported:

IMAGE_NOT_FOUND_ERROR
INVALID_URI_ERROR
IMAGE_PROCESSING_FAILED_ERROR
UNSUPPORTED_IMAGE_FORMAT_ERROR

Use the feature helpers to provide user-friendly configuration hints:

import {
  assertFeatureAvailable,
  MLKIT_FEATURE_KEYS,
} from 'react-native-vision-camera-mlkit';

assertFeatureAvailable(MLKIT_FEATURE_KEYS.TEXT_RECOGNITION);

Performance

Follow the Vision Camera performance guide
Prefer runAsync(...) for heavy ML work to keep the frame processor responsive.
Use runAtTargetFps(...) to throttle processing instead of frameProcessInterval.

iOS Orientation Notes (Text Recognition)

iOS camera sensors are fixed in landscape orientation. The frame buffer stays landscape-shaped even when the UI rotates, so ML Kit needs an explicit orientation hint to rotate text correctly. On iOS, pass outputOrientation to textRecognition(frame, { outputOrientation }) so ML Kit can map the buffer to upright text. Android handles rotation automatically.

⚠️ iOS Simulator (Apple Silicon) – Heads-up

On Apple Silicon Macs, building for the iOS Simulator (arm64) may fail after installing this package.

This is a known limitation of Google ML Kit, which does not currently ship an arm64-simulator slice for some iOS frameworks. The library works correctly on physical iOS devices and on the iOS Simulator when running under Rosetta.

Google ML Kit Vision Features Roadmap

#	Feature	Status	Platform
0	Text recognition v2
1	Face detection
2	Face mesh detection
3	Pose detection
4	Selfie segmentation
5	Subject segmentation
6	Document scanner
7	Barcode scanning
8	Image labeling
9	Object detection and tracking
10	Digital ink recognition

Sponsor on GitHub

If this project helps you, please consider sponsoring its development

react-native-vision-camera-mlkit is provided as is and maintained in my free time.

If you’re integrating this library into a production app, consider funding the project.

Name		Name	Last commit message	Last commit date
Latest commit History 27 Commits
.github		.github
.yarn/releases		.yarn/releases
android		android
docs/static/img		docs/static/img
example		example
ios		ios
src		src
.editorconfig		.editorconfig
.gitattributes		.gitattributes
.gitignore		.gitignore
.nvmrc		.nvmrc
.swift-format		.swift-format
.watchmanconfig		.watchmanconfig
.yarnrc.yml		.yarnrc.yml
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
README.md		README.md
SECURITY.md		SECURITY.md
VisionCameraMLKit.podspec		VisionCameraMLKit.podspec
babel.config.js		babel.config.js
eslint.config.mjs		eslint.config.mjs
lefthook.yml		lefthook.yml
package.json		package.json
tsconfig.build.json		tsconfig.build.json
tsconfig.json		tsconfig.json
turbo.json		turbo.json
yarn.lock		yarn.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

react-native-vision-camera-mlkit

Requirements

Installation

ML Kit Models Installation (Selective)

Android (Gradle)

iOS (Podfile)

Usage

Text Recognition (Frame Processor)

Image Processing (Static Images)

Feature Utilities

Error Handling

Performance

iOS Orientation Notes (Text Recognition)

⚠️ iOS Simulator (Apple Silicon) – Heads-up

Google ML Kit Vision Features Roadmap

Sponsor on GitHub

About

Uh oh!

Releases 2

Sponsor this project

Uh oh!

Uh oh!

Contributors 2

Uh oh!

Languages

Uh oh!

License

pedrol2b/react-native-vision-camera-mlkit

Folders and files

Latest commit

History

Repository files navigation

react-native-vision-camera-mlkit

Requirements

Installation

ML Kit Models Installation (Selective)

Android (Gradle)

iOS (Podfile)

Usage

Text Recognition (Frame Processor)

Image Processing (Static Images)

Feature Utilities

Error Handling

Performance

iOS Orientation Notes (Text Recognition)

⚠️ iOS Simulator (Apple Silicon) – Heads-up

Google ML Kit Vision Features Roadmap

Sponsor on GitHub

About

Topics

Resources

License

Code of conduct

Contributing

Security policy

Uh oh!

Stars

Watchers

Forks

Releases 2

Sponsor this project

Uh oh!

Uh oh!

Contributors 2

Uh oh!

Languages