Skip to content

Releases: rapidsai/cudf

v26.02.01

05 Feb 16:33
v26.02.01
5b9658c

Choose a tag to compare

What's Changed

🐛 Bug Fixes

Full Changelog: v26.02.00...v26.02.01

v26.02.00

04 Feb 17:39
v26.02.00
9782a26

Choose a tag to compare

What's Changed

🚨 Breaking Changes

  • Avoid counting nulls and creating null mask in groupby aggregation MERGE_M2 by @ttnghia in #20716
  • Remove cudf::get_current_device_resource by @bdice in #20688
  • Avoid creating null mask in groupby aggregation M2 by @ttnghia in #20726
  • Remove deprecated left semi- and anti- join APIs by @shrshi in #20668
  • Inline and simplify some column methods by @vyasr in #20819
  • Enable copy-on-write in cudf.pandas by @vyasr in #20401
  • [FEA] Improve Null-Aware Operator Support in AST-Codegen by @lamarrr in #20206
  • Remove legacy hash-combine logic and unify hashing with row hasher by @PointKernel in #20796
  • Remove deprecated .from_pandas constructors by @mroeschke in #20925
  • Remove deprecated Series.data by @mroeschke in #20914
  • Remove all base attributes from ColumnBase by @vyasr in #20961
  • Fix handling of unquoted strings in the CSV reader by @vuule in #20996

🐛 Bug Fixes

📖 Documentation

  • Add note to developer guide about null values being undefined by @bdice in #20645
  • [DOC] Add cudf-polars to the example build command by @Matt711 in #20763
  • Clarify internal API header placement guidelines for details headers by @PointKernel in #20985
  • Clarify deprecation message for cudf::round by @nirandaperera in #20809
  • Require nvcc 12.9 in contributing guide by @bdice in #21186

🚀 New Features

Read more

v25.12.00

10 Dec 22:34
v25.12.00
580975b

Choose a tag to compare

What's Changed

🚨 Breaking Changes

🐛 Bug Fixes

Read more

v25.10.00

08 Oct 16:55

Choose a tag to compare

🚨 Breaking Changes

🐛 Bug Fixes

📖 Documentation

🚀 New Features

Read more

[NIGHTLY] v25.12.00

25 Sep 18:27

Choose a tag to compare

[NIGHTLY] v25.12.00 Pre-release
Pre-release

🔗 Links

🚨 Breaking Changes

  • Change .str.starts/endswith with tuple argument to match any pattern instead of pairwise matching (#20249) @mroeschke
  • Remove DataFrame.apply_chunks, Groupby.apply_grouped (#20194) @mroeschke
  • [cudf-polars] CUDA stream (#20154) @madsbk
  • Remove compatibility with nvCOMP versions before 5.0 (#20140) @vuule
  • Rewrite JNI functions to use JNI_TRY/JNI_CATCH (#19053) @ttnghia

🐛 Bug Fixes

📖 Documentation

🚀 New Features

  • Implement ARGMIN and ARGMAX aggregations for reduction (#20207) @ttnghia
  • Add remaining memory resources (#20197) @vyasr
  • Add memory resources to scalars (#20196) @vyasr
  • Skip decompression of pruned parquet pages (#20192) @mhaseeb123
  • Add memory resources to replace, json, and hashing (#20150) @vyasr
  • Support decimal literals in cudf-polars (#20147) @Matt711
  • Add pylibcudf is_valid_reduce_aggregation API (#20145) @davidwendt
  • Add memory resources to I/O modules (#20136) @vyasr
  • Add memory resources to reduce, column, column_factories, and contiguous_split (#20135) @vyasr
  • Passthrough unary ops through Parquet predicate pushdown (#20127) @mhaseeb123
  • Add memory resource to all strings modules (#20123) @vyasr
  • Add memory resources to all nvtext APIs (#20119) @vyasr
  • Add an example to inspect parquet files and dump row group and page level metadata information (#20117) @mhaseeb123
  • Allow multiple calls to cudf::initialize and cudf::deinitialize (#20111) @vuule
  • Remove rounding from cudf java (#20110) @pmattione-nvidia
  • Add memory resources to groupby, datetime, and lists modules (#20102) @vyasr
  • Add memory resources to search, reshape, and partitioning module (#20101) @vyasr
  • Add memory resources to rolling, sorting, and quantiles modules (#20099) @vyasr
  • Add memory resources to binaryop, copying, and stream_compaction (#20059) @vyasr
  • Add memory resources to unary, transform, and filling modules (#20054) @vyasr
  • Support cum_sum(...).over(...) expressions in cudf-polars (#19908) @Matt711
  • Support forward/backward filling null values in a grouped window context (#19907) @Matt711
  • [FEA] Implement JIT Filter for read_parquet (#19831) @lamarrr
  • Add an example to demonstrate the use of next-gen parquet reader to read a parquet file with highly selective filters (#19469) @mhaseeb123
  • Rewrite JNI functions to use JNI_TRY/JNI_CATCH (#19053) @ttnghia
  • Add support for maintain_order param in joins (#17698) @Matt711

🛠️ Improvements

Read more

v25.08.00

06 Aug 17:51
6cea374

Choose a tag to compare

🚨 Breaking Changes

🐛 Bug Fixes

📖 Documentation

🚀 New Features

Read more

[NIGHTLY] v25.10.00

09 Oct 14:02
fa0f083

Choose a tag to compare

[NIGHTLY] v25.10.00 Pre-release
Pre-release

🔗 Links

🚨 Breaking Changes

🐛 Bug Fixes

📖 Documentation

🚀 New Features

Read more

v25.06.00

05 Jun 17:31
v25.06.00
84c4350

Choose a tag to compare

🚨 Breaking Changes

🐛 Bug Fixes

📖 Documentation

🚀 New Features

  • Add CLI argument to enable RMM async memory resource in PDS-H (#18899) @pentschev
  • Scan a headerless CSV file with column names provided (#18816) @Matt711
  • Add fast paths for DataFrame.to_cupy (#18801) @Matt711
  • Require numba-cuda>=0.11.0 (#18770) @brandon-b-miller
  • Create a pylibcudf Column from a python iterable (#18768) @Matt711
  • Support ConditianalJoin via broadcasting in cudf-polars streaming engine (#18723) @rjzamora
  • Experimental PQ reader utility to calculate total rows in input row groups (#18716) @mhaseeb123
  • Extend explain_query to support printing the logical plan (pre lowered plan) (#18708) @Matt711
  • Reuse libcudf dependencies for Java JNI build when they are available (#18682) @ttnghia
  • Add alloc_size member function to cudf::column and cudf::table (#18639) @davidwendt
  • Print the physical cudf-polars plan in pdsh.py (#18635) @rjzamora
  • String Transform Examples (#18616) @lamarrr
  • Add streaming support for group_by -> n_unique to cudf-polars (#18606) @rjzamora
  • Export cudf compiler flags and definitions (#18604) @ttnghia
  • Implement BIT_COUNT unary operation (#18589) @ttnghia
  • Expose column chunk metadata in read_parquet_metadata() (#18579) @mhaseeb123
  • Add APIs to check ORC and Parquet compression support at runtime (#18578) @vuule
  • Add Distinct support to the cudf-polars streaming executor (#18576) @rjzamora
  • Add support for large list host Arrow data conversion (#18562) @vyasr
  • Implement BITWISE_AGG aggregations (bitwise AND, OR and XOR) for sort-based groupby and reduction (#18551) @ttnghia
  • Implement row group pruning with bloom filters in experimental PQ reader (#18545) @mhaseeb123
  • Implement row group pruning with stats in experimental PQ reader (#18543) @mhaseeb123
  • [JNI] Expose row-wise sha1 api (#18540) @warrickhe
  • Add Sort + head/tail support to streaming cudf-polars executor (#18538) @rjzamora
  • Add multi-partition MapFunction support to cudf-polars (#18523) @rjzamora
  • Adds support for writing raw UTF-8 characters (without escaping) in the JSON writer (#18508) @Matt711
  • Support reading from device buffers in the pylibcudf IO APIs (#18496) @Matt711
  • Support multi-partition Select operations with aggregations (#18492) @rjzamora
  • Implemented String Output & User-data Support for Transforms (#18490) @lamarrr
  • Add a utility to bulk set multiple null masks (#18489) @mhaseeb123
  • High level interface for experimental PQ reader and implementation of metadata APIs (#18480) @mhaseeb123
  • Added pylibcudf.utilities.is_ptds_enabled (#18467) @TomAugspurger
  • Add a public API for copying a table_view to device array (#18450) @Matt711
  • Support cudf-polars cast_time_unit (#18442) @brandon-b-miller
  • Support creating a pylibcudf Column from a host array (#18425) @Matt711
  • Move parquet schema types and structs to public headers (#18424) @mhaseeb123
  • Add optional dtype argument to Scalar.from_any (#18415) @Matt711
  • Expose cudf::chunked_pack in pylibcudf (#18411) @wence-
  • Add support for long string columns in cudf::contiguous_split (#18393) @nvdbaranec
  • Implemented String Input support for Transforms and Removed jit::column_device_view (#18378) @lamarrr
  • Automatically dispatch between host and device decompression/compression based on the number of buffers (#18363) @vuule
  • Expose join hash table load factor (#18361) @PointKernel
  • Skip decoding of pages marked as pruned in PQ reader (#18347) @mhaseeb123
  • Sort-based inner join for high-multiplic...
Read more

v25.04.00

09 Apr 18:14
6bc4206

Choose a tag to compare

🚨 Breaking Changes

🐛 Bug Fixes

📖 Documentation

🚀 New Features

🛠️ Improvements

  • Use pyarrow 15 in oldest dependency CI jobs (#18409) @bdice
  • Bump librdkafka to 2.8.0 (#18370) @raydouglass
  • fix(rattler): ignore libzlib run dependency to avoid pandoc collision (#18368) @gforsyth
  • Fix zstd build interface include definition (#18366) @trxcllnt
  • test: Install pytest-env and hypothesis in test_narwhals.sh (#18337) @MarcoGorelli
  • Remove unused group_range_rolling_window API (#18313) @wence-
  • Cache column view creation from arrow types (#18302) @vyasr
  • Split Narwhals cudf.pandas tests failures into to fix and to skip (#18267) @mroeschke
  • Support BinOp, min, and max Aggregations in cudf-polars parallel ...
Read more

v25.02.02

03 Mar 18:22
v25.02.02
8139f3c

Choose a tag to compare

🚨 Breaking Changes

  • Expose stream-ordering in scalar and avro APIs (#17766) @shrshi
  • Add seed parameter to hash_character_ngrams (#17643) @davidwendt
  • Performance improvements and simplifications for fixed size row-based rolling windows (#17623) @wence-
  • Refactor distinct hash join to handle multiple probes with the same build table (#17609) @PointKernel
  • Deprecate cudf::grouped_time_range_rolling_window (#17589) @wence-
  • Remove "legacy" Dask DataFrame support from Dask cuDF (#17558) @rjzamora
  • Return empty result for segmented_reduce if input and offsets are both empty (#17437) @davidwendt
  • Rework minhash APIs for deprecation cycle (#17421) @davidwendt
  • Change indices for dictionary column to signed integer type (#17390) @davidwendt

🐛 Bug Fixes

📖 Documentation

🚀 New Features

🛠️ Improvements

Read more