Capacitor ML Kit Document Scanner Plugin

Unofficial Capacitor plugin for ML Kit Document Scanner.¹

This plugin allows you to use Google's ML Kit Document Scanner in your Capacitor applications. It provides an easy way to add a document scanning feature, allowing users to scan documents and receive them as JPEG images and/or PDF files.

Demo

A demo application showcasing the capabilities of this plugin can be found here: https://github.com/therealabdi2/demo-mlkit-doc-scanner

Installation

npm install capacitor-mlkit-doc-scanner
npx cap sync

Android

Add the ML Kit Document Scanner dependency to your app's build.gradle file (android/app/build.gradle):

dependencies {
    // ...other dependencies
    implementation 'com.google.android.gms:play-services-mlkit-document-scanner:16.0.0-beta1'
}

Ensure your project's minSdkVersion is 21 or higher in android/app/build.gradle. The ML Kit Document Scanner also requires a minimal device total RAM of 1.7GB.

Supported Platforms

Android
iOS (Not available)
Web (Not available - native feature)

scanDocument(...)

scanDocument(options?: ScanOptions | undefined) => Promise<ScanResult>

Starts the document scanning process.

Param	Type	Description
`options`	`ScanOptions`	Configuration options for the scanner.

Returns: Promise<ScanResult>

Interfaces

ScanResult

Result of a document scan operation.

Prop	Type	Description
`scannedImages`	`string[]`	An array of URIs for the scanned image pages (JPEG). Present if 'JPEG' or 'JPEG_PDF' was requested in resultFormats.
`pdf`	`PdfInfo`	Information about the generated PDF. Present if 'PDF' or 'JPEG_PDF' was requested in resultFormats.

PdfInfo

Information about a generated PDF document.

Prop	Type	Description
`uri`	`string`	The URI of the generated PDF file.
`pageCount`	`number`	The number of pages in the PDF.

ScanOptions

Options for the document scanner.

Prop	Type	Description	Default
`galleryImportAllowed`	`boolean`	Whether to allow importing from the photo gallery.	`false`
`pageLimit`	`number`	The maximum number of pages that can be scanned.	`10`
`resultFormats`	`'JPEG' \| 'PDF' \| 'JPEG_PDF'`	The desired result formats. Can be 'JPEG', 'PDF', or 'JPEG_PDF'.	`'JPEG_PDF'`
`scannerMode`	`'FULL' \| 'BASE' \| 'BASE_WITH_FILTER'`	The scanner mode. BASE: Basic editing capabilities (crop, rotate, reorder pages, etc.). BASE_WITH_FILTER: Adds image filters (grayscale, auto image enhancement, etc.) to the BASE mode. FULL: Adds ML-enabled image cleaning capabilities (erase stains, fingers, etc.) to the BASE_WITH_FILTER mode. This mode will also allow future major features to be automatically added along with Google Play services updates, while the other two modes will maintain their current feature sets and only receive minor refinements.	`"FULL"`

Usage Example

import { MlkitDocScanner } from 'capacitor-mlkit-doc-scanner';

async function startScan() {
  try {
    const result = await MlkitDocScanner.scanDocument({
      galleryImportAllowed: true,
      pageLimit: 5,
      resultFormats: 'JPEG_PDF',
      scannerMode: 'FULL',
    });

    console.log('Scan successful:', result);

    if (result.scannedImages && result.scannedImages.length > 0) {
      console.log('First image URI:', result.scannedImages[0]);
      // Handle image URIs (e.g., display them)
    }

    if (result.pdf) {
      console.log('PDF URI:', result.pdf.uri);
      console.log('PDF Page Count:', result.pdf.pageCount);
      // Handle PDF URI (e.g., open or upload it)
    }
  } catch (error) {
    console.error('Scan failed:', error);
  }
}

Important Notes

The ML Kit Document Scanner models, scanning logic, and UI flow are dynamically downloaded by Google Play services. Users might have to wait for these to download before the first use.
This API requires Android API level 21 or above.
It also requires a minimal device total RAM of 1.7GB. If lower, it returns an MlKitException with error code UNSUPPORTED when calling the API (this plugin will reject the promise).
Consider that generating document files takes time and requires processing power, so only request the output formats (JPEG, or PDF, or both) you actually need via the resultFormats option.

Contributing

Contributions are welcome! Please feel free to submit issues and pull requests to https://github.com/therealabdi2/capacitor-mlkit-doc-scanner-plugin.

License

This plugin is licensed under the MIT License.

This project is not affiliated with, endorsed by, sponsored by, or approved by Google LLC or any of their affiliates or subsidiaries. ↩

Name		Name	Last commit message	Last commit date
Latest commit History 30 Commits
.github/workflows		.github/workflows
.vscode		.vscode
android		android
src		src
.eslintignore		.eslintignore
.gitignore		.gitignore
.npmignore		.npmignore
.prettierignore		.prettierignore
.prettierrc.js		.prettierrc.js
README.md		README.md
package-lock.json		package-lock.json
package.json		package.json
rollup.config.mjs		rollup.config.mjs
tsconfig.json		tsconfig.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Capacitor ML Kit Document Scanner Plugin

Demo

Installation

Android

Supported Platforms

API

scanDocument(...)

Interfaces

ScanResult

PdfInfo

ScanOptions

Usage Example

Important Notes

Contributing

License

About

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Capacitor ML Kit Document Scanner Plugin

Demo

Installation

Android

Supported Platforms

API

scanDocument(...)

Interfaces

ScanResult

PdfInfo

ScanOptions

Usage Example

Important Notes

Contributing

License

Footnotes

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Uh oh!

Contributors

Uh oh!

Languages