diff --git a/community_extensions/extensions/cache_httpfs.md b/community_extensions/extensions/cache_httpfs.md index 20314bb9da..0def1f0f39 100644 --- a/community_extensions/extensions/cache_httpfs.md +++ b/community_extensions/extensions/cache_httpfs.md @@ -35,8 +35,8 @@ docs: - Exposes function to get cache size and cleanup cache - Provides an option to disable / enable cache, which could act as a drop-in replacement for httpfs -extension_star_count: 105 -extension_star_count_pretty: 105 +extension_star_count: 106 +extension_star_count_pretty: 106 extension_download_count: 5441 extension_download_count_pretty: 5.4k image: '/images/community_extensions/social_preview/preview_community_extension_cache_httpfs.png' diff --git a/community_extensions/extensions/duckpgq.md b/community_extensions/extensions/duckpgq.md index 00c816214e..24eb838e1b 100644 --- a/community_extensions/extensions/duckpgq.md +++ b/community_extensions/extensions/duckpgq.md @@ -60,8 +60,8 @@ docs: *Disclaimer:* As this extension is part of an ongoing research project by the Database Architectures group at CWI, some features may still be under development. We appreciate your understanding and patience as we continue to improve it. -extension_star_count: 297 -extension_star_count_pretty: 297 +extension_star_count: 298 +extension_star_count_pretty: 298 extension_download_count: null extension_download_count_pretty: n/a image: '/images/community_extensions/social_preview/preview_community_extension_duckpgq.png' diff --git a/community_extensions/extensions/gsheets.md b/community_extensions/extensions/gsheets.md index 336f81dd5b..0778176031 100644 --- a/community_extensions/extensions/gsheets.md +++ b/community_extensions/extensions/gsheets.md @@ -67,8 +67,8 @@ docs: The DuckDB GSheets Extension allows reading and writing of data in Google Sheets from DuckDB. For detailed setup and usage instructions, visit the docs at [duckdb-gsheets.com](https://duckdb-gsheets.com). -extension_star_count: 303 -extension_star_count_pretty: 303 +extension_star_count: 304 +extension_star_count_pretty: 304 extension_download_count: 3661 extension_download_count_pretty: 3.7k image: '/images/community_extensions/social_preview/preview_community_extension_gsheets.png' diff --git a/community_extensions/extensions/miniplot.md b/community_extensions/extensions/miniplot.md index 2f7f0f9f43..17b2585925 100644 --- a/community_extensions/extensions/miniplot.md +++ b/community_extensions/extensions/miniplot.md @@ -108,6 +108,7 @@ LOAD {{ page.extension.name }}; | line_chart | scalar | NULL | NULL | | | miniplot | scalar | NULL | NULL | | | miniplot_openssl_version | scalar | NULL | NULL | | +| scatter_3d_chart | scalar | NULL | NULL | | | scatter_chart | scalar | NULL | NULL | | diff --git a/community_extensions/extensions/mlpack.md b/community_extensions/extensions/mlpack.md index 57df1257ba..0bd715d16b 100644 --- a/community_extensions/extensions/mlpack.md +++ b/community_extensions/extensions/mlpack.md @@ -8,7 +8,7 @@ excerpt: | extension: name: mlpack description: Machine learning library in C++ - version: 0.0.1 + version: 0.0.2 language: C++ build: cmake license: MIT @@ -19,8 +19,46 @@ extension: repo: github: eddelbuettel/duckdb-mlpack - ref: c2d0c83b24501fa6912765ccdb03685f021d2b86 + ref: 64ce9296ac4189f2c2883928f8e937b8954a8f79 +docs: + hello_world: | + -- Perform adaBoost (using weak learner 'Perceptron' by default) + -- Read 'features' into 'X', 'labels' into 'Y', use optional parameters + -- from 'Z', and prepare model storage in 'M' + CREATE TABLE X AS SELECT * FROM read_csv("https://eddelbuettel.github.io/duckdb-mlpack/data/iris.csv"); + CREATE TABLE Y AS SELECT * FROM read_csv("https://eddelbuettel.github.io/duckdb-mlpack/data/iris_labels.csv"); + CREATE TABLE Z (name VARCHAR, value VARCHAR); + INSERT INTO Z VALUES ('iterations', '50'), ('tolerance', '1e-7'); + CREATE TABLE M (json VARCHAR); + + -- Train model for 'Y' on 'X' using parameters 'Z', store in 'M' + CREATE TEMP TABLE A AS SELECT * FROM mlpack_adaboost("X", "Y", "Z", "M"); + + -- Count by predicted group + SELECT COUNT(*) as n, predicted FROM A GROUP BY predicted; + + -- Model 'M' can be used to predict + CREATE TABLE N (x1 DOUBLE, x2 DOUBLE, x3 DOUBLE, x4 DOUBLE); + -- inserting approximate column mean values + INSERT INTO N VALUES (5.843, 3.054, 3.759, 1.199); + -- inserting approximate column mean values, min values, max values + INSERT INTO N VALUES (5.843, 3.054, 3.759, 1.199), (4.3, 2.0, 1.0, 0.1), (7.9, 4.4, 6.9, 2.5); + -- and this predict one element each + SELECT * FROM mlpack_adaboost_pred("N", "M"); + + EOF + + extended_description: | + The mlpack extension allows to fit (or train) and predict (or classify) from the models implemented, currently adaBoost and (regularized) linear regression. + The format is the same for both: four tables, say, "X", "Y", "Z" and "M" provide input for, respectively, features "X", labels "Y", optional parameters varying by model in "Z" as well as an output table "M" for the JSON-serialized model. + Following a model fit (or training), a prediction (or classification) can be made using "M" and new + predictor values as shown in the example. + + The implementation is still stressing the 'minimal' part of 'a MVP demo': currently only Linux is supported. + It should be considered experimental, interfaces may change while we work out how to automate interface generation from the mlpack-side. + + For more, see the [repo](https://github.com/eddelbuettel/duckdb-mlpack). extension_star_count: 1 extension_star_count_pretty: 1 @@ -53,7 +91,8 @@ LOAD {{ page.extension.name }}; | function_name | function_type | description | comment | examples | |-------------------------------|---------------|-------------|---------|----------| -| mlpack_adaboost | table | NULL | NULL | | +| mlpack_adaboost_pred | table | NULL | NULL | | +| mlpack_adaboost_train | table | NULL | NULL | | | mlpack_linear_regression_fit | table | NULL | NULL | | | mlpack_linear_regression_pred | table | NULL | NULL | | | mlpack_table | table | NULL | NULL | | diff --git a/community_extensions/extensions/rusty_sheet.md b/community_extensions/extensions/rusty_sheet.md index 3524291176..742e8942bf 100644 --- a/community_extensions/extensions/rusty_sheet.md +++ b/community_extensions/extensions/rusty_sheet.md @@ -64,8 +64,8 @@ docs: The DuckDB rusty-sheet extension that enables reading Excel, WPS and OpenDocument spreadsheet files directly within SQL queries. This extension provides seamless integration for analyzing spreadsheet data using DuckDB's powerful SQL engine. For detailed setup and usage instructions, visit the docs at [rusty-sheet](https://github.com/redraiment/rusty-sheet). -extension_star_count: 39 -extension_star_count_pretty: 39 +extension_star_count: 40 +extension_star_count_pretty: 40 extension_download_count: 381 extension_download_count_pretty: 381 image: '/images/community_extensions/social_preview/preview_community_extension_rusty_sheet.png'