@@ -18,29 +18,29 @@ In-Database Machine Learning for DuckDB
1818
1919---
2020
21- Infera is DuckDB extension that allows you use machine learning (ML) models directly in SQL queries to perform inference
22- on data stored in DuckDB tables.
21+ Infera is a DuckDB extension that allows you to use machine learning (ML) models directly in SQL queries to perform
22+ inference on data stored in DuckDB tables.
2323It is developed in Rust and uses [ Tract] ( https://github.com/snipsco/tract ) as the backend inference engine.
2424Infera supports loading and running models in [ ONNX] ( https://onnx.ai/ ) format.
25- Check out the [ ONNX Model Zoo] ( https://huggingface.co/onnxmodelzoo ) repositors on Hugging Face for a large
25+ Check out the [ ONNX Model Zoo] ( https://huggingface.co/onnxmodelzoo ) repository on Hugging Face for a large
2626collection of ready-to-use models that can be used with Infera.
2727
2828### Motivation
2929
30- In a conventional data science workflow, when data is stored in a database, it is not normally possible to use ML models
31- directly on the data.
32- Users need to move the data out of the database first (for example, export it to a CSV file), load the data into a
30+ In a conventional data science workflow, when data is stored in a database, it is not typically possible to use ML
31+ models directly on the data.
32+ Users need to move the data out of the database first (for example, export it to a CSV file) and load the data into a
3333Python or R environment, run the model there, and then import the results back into the database.
3434This process is time-consuming and inefficient.
35- Infera aims to solve this problem by letting users to run ML models directly in SQL queries inside the database.
35+ Infera aims to solve this problem by letting users run ML models directly in SQL queries inside the database.
3636It simplifies the workflow and speeds up the process for users, and eliminates the need for moving data around.
3737
3838### Features
3939
40- - Adds ML inference as first-class citizens in SQL queries.
40+ - Adds ML inference as a first-class citizen in SQL queries.
4141- Supports loading and using local as well as remote models.
4242- Supports using ML models in ONNX format with a simple and flexible API.
43- - Supports performing inference on table columns or raw BLOB ( tensor) data.
43+ - Supports performing inference on table columns or raw tensor data.
4444- Supports both single-value and multi-value model outputs.
4545- Supports autoloading all models from a specified directory.
4646- Thread-safe, fast, and memory-efficient.
@@ -55,7 +55,7 @@ See the [ROADMAP.md](ROADMAP.md) for the list of implemented and planned feature
5555
5656### Quickstart
5757
58- 1 . Clone the repository and build Infera extension from source:
58+ 1 . Clone the repository and build the Infera extension from source:
5959
6060``` bash
6161git clone --recursive https://github.com/CogitatorTech/infera.git
@@ -94,9 +94,11 @@ select infera_unload_model('linear_model');
9494select infera_get_version();
9595````
9696
97+ [](https:// asciinema .org / a/ 745806 )
98+
9799> [!NOTE]
98100> After building from source, the Infera binary will be ` build/release/extension/infera/infera.duckdb_extension` .
99- > You can load it using the ` load 'build/release/extension/infera/infera.duckdb_extension';` in DuckDB shell.
101+ > You can load it using the ` load 'build/release/extension/infera/infera.duckdb_extension';` in the DuckDB shell.
100102> Note that the extension binary will only work with the DuckDB version that it was built against.
101103> At the moment, Infera is not available as
102104> a [DuckDB community extension](https:// duckdb .org / community_extensions/ list_of_extensions).
0 commit comments