Skip to content

Releases: googleapis/python-bigquery-dataframes

v1.19.0

24 Sep 02:27
a95493d
Compare
Choose a tag to compare

1.19.0 (2024-09-24)

Features

  • Add ml.model_selection.KFold class (#1001) (952cab9)
  • Support bool and bytes types in describe(include='all') (#994) (cc48f58)
  • Support ingress settings in remote_function (#1011) (8e9919b)

Bug Fixes

Performance Improvements

  • Join op discards child ordering in unordered mode (#923) (1b5b0ee)

Dependencies

v1.18.0

18 Sep 20:21
d2fb49f
Compare
Choose a tag to compare

1.18.0 (2024-09-18)

Features

  • Add "include" param to describe for string types (#973) (deac6d2)
  • Add subset parameter to DataFrame.dropna to select which columns to consider (#981) (f7c03dc)

Bug Fixes

  • DataFrameGroupby.agg now works with unnamed tuples (#985) (0f047b4)
  • Fix a bug that raises exception when re-indexing columns with their original order (#988) (596b03b)
  • Make the Series.apply outcome assignable to the original dataframe in partial ordering mode (#874) (c94ead9)

Dependencies

  • Limit ibis-framework version to 9.2.0 (#989) (06c1b33)
  • Update to ibis-framework 9.x and newer sqlglot (#827) (89ea44f)

v1.17.0

11 Sep 18:34
d42d674
Compare
Choose a tag to compare

1.17.0 (2024-09-11)

Features

  • Add __version__ alias to bigframes.pandas (#967) (9ce10b4)
  • Add Gemini 1.5 stable models support (#945) (c1cde19)
  • Allow setting table labels in to_gbq (#941) (cccc6ca)
  • Define list accessor for bigframes Series (#946) (8e8279d)
  • Enable read_csv() to process other files (#940) (3b35860)
  • Include the bigframes package version alongside the feedback link in error messages (#936) (7b59b6d)

Bug Fixes

  • Astype Decimal to Int64 conversion. (#957) (27764a6)
  • Make read_gbq_function work for multi-param functions (#947) (c750be6)
  • Support read_gbq_function for axis=1 application (#950) (86e54b1)

Documentation

  • Add docstring returns section to Options (#937) (a2640a2)
  • Update title of pypi notebook example to reflect use of the PyPI public dataset (#952) (cd62e60)

v1.16.0

04 Sep 20:53
6fdb6b1
Compare
Choose a tag to compare

1.16.0 (2024-09-04)

Features

  • Add DataFrame.struct.explode to add struct subfields to a DataFrame (#916) (ad2f75e)
  • Implement bigframes.bigquery.json_extract_array (#910) (575a29e)
  • Recover struct column from exploded Series (#904) (7dd304c)

Bug Fixes

  • Fix issue with iterating on >10gb dataframes (#949) (2b0f0fa)
  • Improve Series.replace for dict input (#907) (4208044)
  • NullIndex in ML model.predict error (#917) (612271d)
  • Struct field non-nullable type issue. (#914) (149d5ff)
  • Unordered mode errors in ml train_test_split (#925) (85d7c21)

Performance Improvements

Dependencies

  • Re-introduce support for numpy 1.24.x (#931) (3d71913)
  • Update minimum support to Pandas 1.5.3 and Pyarrow 10.0.1 (#903) (7ed3962)

Documentation

  • Add Claude3 ML and RemoteFunc notebooks (#930) (cfd16c1)
  • Create sample notebook to manipulate struct and array data (#883) (3031903)
  • Update struct examples. (#953) (d632cd0)
  • Use unstack() from BigQuery DataFrames instead of pandas in the PyPI sample notebook (#890) (d1883cc)

v1.15.0

20 Aug 18:44
e43e0e5
Compare
Choose a tag to compare

1.15.0 (2024-08-20)

Features

  • Add llm.TextEmbeddingGenerator to support new embedding models (#905) (6bc6a41)
  • Add ml.llm.Claude3TextGenerator model (#901) (7050038)

Documentation

  • Add columns for "requires ordering/index" to supported APIs summary (#892) (d2fc51a)
  • Remove duplicate description for kms_key_name (#898) (1053d56)
  • Update embedding model notebooks (#906) (d9b8ef5)

v1.14.0

14 Aug 02:21
ae07274
Compare
Choose a tag to compare

1.14.0 (2024-08-14)

Features

  • Implement bigframes.bigquery.json_extract (#868) (3dbf84b)
  • Implement Series.str.__getitem__ (#897) (e027b7e)

Bug Fixes

  • Fix caching from generating row numbers in partial ordering mode (#872) (52b7786)

Performance Improvements

  • Generate SQL with fewer CTEs (#877) (eb60804)
  • Speed up compilation by reducing redundant type normalization (#896) (e0b11bc)

Documentation

v1.13.0

05 Aug 22:43
5317327
Compare
Choose a tag to compare

1.13.0 (2024-08-05)

Features

  • df.apply(axis=1) to support remote function with mutiple params (#851) (2158818)
  • Allow windowing in 'partial' ordering mode (#861) (ca26fe5)
  • Create a separate OrderingModePartialPreviewWarning for more fine-grained warning filters (#879) (8753bdd)

Bug Fixes

  • Fix issue with invalid sql generated by ml distance functions (#865) (9959fc8)

Documentation

  • Create sample notebook using ordering_mode="partial" (#880) (c415eb9)
  • Update streaming notebook (#875) (e9b0557)

v1.12.0

31 Jul 22:11
8e00fe2
Compare
Choose a tag to compare

1.12.0 (2024-07-31)

Features

  • Add bigframes-mode label to query jobs (#832) (c9eaff0)
  • Add config option to set partial ordering mode (#855) (823c0ce)
  • Add stratify param support to ml.model_selection.train_test_split method (#815) (27f8631)
  • Add streaming.StreamingDataFrame class (#864) (a7d7197)
  • Allow DataFrame.join for self-join on Null index (#860) (e950533)
  • Support remote function cleanup with session.close (#818) (ed06436)
  • Support to_csv/parquet/json to local files/objects (#858) (d0ab9cc)

Bug Fixes

  • Fewer relation joins from df self-operations (#823) (0d24f73)
  • Fix 'sql' property for null index (#844) (1b6a556)
  • Fix unordered mode using ordered path to print frame (#839) (93785cb)
  • Reduce redundant remote_function deployments (#856) (cbf2d42)

Documentation

  • Add partner attribution steps to integrations sample notebook (#835) (d7b333f)
  • Make get_global_session/close_session/reset_session appears in the docs (#847) (01d6bbb)

v1.11.1

09 Jul 01:28
ee2b660
Compare
Choose a tag to compare

1.11.1 (2024-07-08)

Documentation

  • Remove session and connection in llm notebook (#821) (74170da)
  • Remove the experimental flask icon from the public docs (#820) (067ff17)

v1.11.0

01 Jul 19:57
6d947a2
Compare
Choose a tag to compare

1.11.0 (2024-07-01)

Features

  • Add .agg support for size (#792) (87e6018)
  • Add bigframes.bigquery.json_set (#782) (1b613e0)
  • Add bigframes.streaming.to_pubsub method to create continuous query that writes to Pub/Sub (#801) (b47f32d)
  • Add DataFrame.to_arrow to create Arrow Table from DataFrame (#807) (1e3feda)
  • Add PolynomialFeatures support to to_gbq and pipelines (#805) (57d98b9)
  • Add Series.peek to preview data efficiently (#727) (580e1b9)
  • Expose gcf memory param in remote_function (#803) (014765c)
  • More informative error when query plan too complex (#811) (136dc24)

Bug Fixes

  • Include internally required packages in remote_function hash (#799) (4b8fc15)

Documentation

  • Document dtype limitation on row processing remote_function (#800) (487dff6)