Skip to content

Releases: googleapis/python-bigquery-dataframes

v0.14.1

16 Nov 04:26
5ab5059

Choose a tag to compare

0.14.1 (2023-11-16)

Bug Fixes

  • Correctly handle null values when initializing fingerprint ordering (#210) (8324f13)

Documentation

  • Add an example notebook about line graphs (#197) (f957b27)

v0.14.0

15 Nov 00:26
e2b7df5

Choose a tag to compare

v0.14.0 Pre-release
Pre-release

0.14.0 (2023-11-14)

Warning: This release has a known bug that row ordering unstable.

Features

  • Add 'cross' join support (#176) (765446a)
  • Add 'index', 'pad', 'nearest' interpolate methods (#162) (6a28403)
  • Add series.sample (identical to existing dataframe.sample) (#187) (37914a4)
  • Add unordered sql compilation (#156) (58f420c)
  • Log most recent API calls as recent-bigframes-api-xx labels on BigQuery jobs (#145) (4ea33b7)
  • Read_gbq creates order deterministically without table copy (#191) (8ab81de)
  • Support date_series.astype("string[pyarrow]") to cast DATE to STRING (#186) (aee0e8e)
  • Support series.at[row_label] = scalar (#173) (0c8bd33)
  • Temporary resources no longer use BigQuery Sessions (#194) (4a02cac)

Bug Fixes

  • All sort operation are now stable (#195) (3a2761f)
  • Default to 7 days expiration for read_csv, read_json, read_parquet (#193) (03606cd)
  • Deprecate the remote_service_type in llm model (#180) (a8a409a)
  • For reset_index on unnamed multiindex, always use level_[n] label (#182) (f95000d)
  • Match pandas behavior when assigning listlike to empty dfs (#172) (c1d1f42)
  • Use anonymous dataset instead of session dataset for temp tables (#181) (800d44e)
  • Use random table for read_pandas (#192) (741c75e)
  • Use random table when loading data for read_csv, read_json, read_parquet (#175) (9d2e6dc)

Documentation

  • Add code samples for read_gbq_function using community UDFs (#188) (7506eab)
  • Add docstring code samples for Series.apply and DataFrame.map (#185) (c816d84)
  • Add llm kmeans notebook as an included example (#177) (d49ae42)
  • Use head() to get top n results, not to preview results (#190) (87f84c9)

v0.13.0

07 Nov 16:57
8b6b1c6

Choose a tag to compare

0.13.0 (2023-11-07)

Features

  • to_gbq without a destination table writes to a temporary table (#158) (e1817c9)
  • Add DataFrame.__iter__, DataFrame.iterrows, DataFrame.itertuples, and DataFrame.keys methods (#164) (c065071)
  • Add Series.__iter__ method (#164) (c065071)
  • Add interpolate() to series and dataframe (#157) (b9cb55c)
  • Support 32k text-generation and multilingual embedding models (#161) (5f0ea37)

Bug Fixes

  • Update default temp table expiration to 7 days (#174) (4ff26cd)

v0.12.0

01 Nov 21:49
bf1ec89

Choose a tag to compare

0.12.0 (2023-11-01)

Features

  • Add DataFrame.melt (#113) (4e4409c)
  • Add DataFrame.to_pandas_batches() to download large DataFrame objects (#136) (3afd4a3)
  • Add bigframes.options.compute.maximum_bytes_billed option that sets maximum bytes billed on query jobs (#133) (63c7919)
  • Add pandas.qcut (#104) (8e44518)
  • Add pd.get_dummies (#149) (d8baad5)
  • Add unstack to series, add level param (#115) (5edcd19)
  • Implement operator @ for DataFrame.dot (#139) (79a638e)
  • Populate ibis version in user agent (#140) (c639a36)

Bug Fixes

  • Don't override the global logging config (#138) (2ddbf74)
  • Fix bug with column names under repeated column assignment (#150) (29032d0)
  • Resolve plotly rendering issue by using ipython html for job pro… (#134) (39df43e)
  • Use indexee's session for loc listlike cases (#152) (27c5725)

Documentation

  • Add artithmetic df sample code (#153) (ac44ccd)
  • Fix indentation on read_gbq_function code sample (#163) (0801d96)
  • Link to ML.EVALUATE BQML page for score() methods (#137) (45c617f)

v0.11.0

26 Oct 02:39
c3b24b5

Choose a tag to compare

0.11.0 (2023-10-26)

Features

  • Add back reset_session as an alias for close_session (#124) (694a85a)
  • Change query parameter to query_or_table in read_gbq (#127) (f9bb3c4)

Bug Fixes

  • Expose bigframes.pandas.reset_session as a public API (#128) (b17e1f4)
  • Use series's own session in series.reindex listlike case (#135) (95bff3f)

Documentation

  • Add runnable code samples for DataFrames I/O methods and property (#129) (6fea8ef)
  • Add runnable code samples for reading methods (#125) (a669919)

v0.10.0

19 Oct 15:43
782a215

Choose a tag to compare

0.10.0 (2023-10-19)

Features

  • Implement DataFrame.dot for matrix multiplication (#67) (29dd414)

v0.9.0

18 Oct 17:38
e2788a8

Choose a tag to compare

0.9.0 (2023-10-18)

⚠ BREAKING CHANGES

  • rename bigframes.pandas.reset_session to close_session (#101)

Features

  • Add bigframes.options.bigquery.application_name for partner attribution (#117) (52d64ff)
  • Add AtIndexer getitems (#107) (752b01f)
  • Rename bigframes.pandas.reset_session to close_session (#101) (36693bf)
  • Send BigQuery cancel request when canceling bigframes process (#103) (e325fbb)
  • Support external packages in remote_function (#98) (ec10c4a)
  • Use ArrowDtype for STRUCT columns in to_pandas (#85) (9238fad)

Bug Fixes

  • Support multiindex for three loc getitem overloads (#113) (68e3cd3)

Performance Improvements

  • If primary keys are defined, read_gbq avoids copying table data (#112) (e6c0cd1)

Documentation

  • Add documentation for Series.struct.field and Series.struct.explode (#114) (a6dab9c)
  • Add open-source link in API doc (#106) (db51fe3)
  • Update ML overview API doc (#105) (1b3f3a5)

v0.8.0

12 Oct 17:40
1ff2755

Choose a tag to compare

0.8.0 (2023-10-12)

⚠ BREAKING CHANGES

  • The default behavior of to_parquet is changing from no compression to 'snappy' compression.

Features

  • Support compression in to_parquet (a8c286f)

Bug Fixes

  • Create session dataset for remote functions only when needed (#94) (1d385be)

v0.7.0

11 Oct 17:59
d39134d

Choose a tag to compare

0.7.0 (2023-10-11)

Features

  • Add aliases for several series properties (#80) (c0efec8)
  • Add equals methods to series/dataframe (#76) (636a209)
  • Add iat and iloc accessing by tuples of integers (#90) (228aeba)
  • Add level param to DataFrame.stack (#88) (97b8bec)
  • Allow df.drop to take an index object (#68) (740c451)
  • Use default session connection (#87) (4ae4ef9)

Bug Fixes

Documentation

  • Add more preprocessing models into the docs menu. (#97) (1592315)

v0.6.0

04 Oct 15:48
bd692d8

Choose a tag to compare

0.6.0 (2023-10-04)

Features

  • Add df.unstack (#63) (4a84714)
  • Add idxmin, idxmax to series, dataframe (#74) (781307e)
  • Add ml.preprocessing.KBinsDiscretizer (#81) (24c6256)
  • Add multi-column dataframe merge (#73) (c9fa85c)
  • Add update and align methods to dataframe (#57) (bf050cf)
  • Support STRUCT data type with Series.struct.field to extract child fields (#71) (17afac9)

Bug Fixes

  • Avoid 403 response too large to return error with read_gbq and large query results (#77) (8f3b5b2)
  • Change return type of Series.loc[scalar] (#40) (fff3d45)
  • Fix df/series.iloc by list with multiindex (#79) (971d091)