Releases · Blosc/python-blosc2

17 Jul 15:41

lshaw8317

v3.6.0

91d023a

Blosc2 v3.6.0

Changes in Blosc2 3.6.0

Expose the oindex C-level functionality in Blosc2 for NDArray.
Implement fancy indexing which closely matches NumPy functionality, using
ndindex library. Includes a fast path for 1D arrays, based on Zarr's implementation.
A major refactoring of slicing for lazy expressions using ndindex. We have also
added support for slices with non-unit steps for reduction expressions, which has introduced
improvements that could be incorporated into other lazy expression machinery in the future.
More complex slicing is now supported.
Minor bug fixes to ensure that Blosc2 indexing does not introduce dummy dimensions when NumPy does not,
and a more comprehensive squeeze function which squeezes specified dimensions.

Assets 3

03 Jul 08:54

FrancescAlted

v3.5.1

432ab1f

Release 3.5.1

Changes from 3.5.0 to 3.5.1

Reduced memory usage when computing slices of lazy expressions.
This is a significant improvement for large arrays (up to 20x less).
Also, we have added a fast path for slices that are small and fit in
memory, which can be up to 20x faster than the previous implementation.
See PR #430.
blosc2.concatenate() has been renamed to blosc2.concat().
This is in line with the Array API.
The old name is still available for backward compatibility, but it will
be removed in a future release.
Improve mode handling for concatenating to disk. See PR #428.
Useful for concatenating arrays that are stored in disk, and allows
specifying the mode to use when concatenating.

Assets 3

24 Jun 15:31

FrancescAlted

v3.5.0

ede0199

Release 3.5.0

Changes from 3.4.0 to 3.5.0

New blosc2.stack() function for stacking multiple arrays along a new axis.
Useful for creating multi-dimensional arrays from multiple 1D arrays.
See PR #427. Thanks to Luke Shaw for the implementation!
Blog: https://www.blosc.org/posts/blosc2-new-concatenate/#stacking-arrays
New blosc2.expand_dims() function for expanding the dimensions of an array.
This is useful for adding a new axis to an array, similar to NumPy's np.expand_dims().
See PR #427. Thanks to Luke Shaw for the implementation!

Assets 3

13 Jun 13:24

lshaw8317

v3.4.0

eaa5be5

v3.4.0

Summary

This release adds significant new functionality in the form of concatenate. We support general concatenation of ndarrays, and offer an optimised path with significant speedups for the case of concatenating arrays with compatible chunk and blockshapes. In addition, there are bug fixes and more functionality for slicing of lazyexprs, and the possibility to jit compile user-defined functions which operate on pandas objects using the blosc2 engine.

What's Changed

Enable slice lazy by @lshaw8317 in #417
Add support for new pandas UDF engine by @datapythonista in #418
Make behaviour of compute consistent for slicing by @lshaw8317 in #419
Update pre-commit hooks by @pre-commit-ci in #422
Concatenate by @FrancescAlted in #423

Full Changelog: v3.3.4...v3.4.0

Contributors

FrancescAlted, datapythonista, and 2 other contributors

Assets 3

22 May 11:04

lshaw8317

v3.3.4

acd5ccf

Blosc2 v3.3.4

This is a bugfix release, with some minor optimizations. We further improved the
correct chaining of string lazy expressions (to allow operands with more
diverse data types). In addition, both indexing and where expressions are now
supported within string lazy expressions. Finally, casting rules have
been improved to be more consistent with NumPy. In summary:

Expand possibilities for chaining string-based lazy expressions to incorporate
data types which do not have shape attribute, e.g. int, float etc.
See #406 and PR #411.
Enable slicing within string-based lazy expressions. See PR #414.
Improved casting for string-based lazy expressions.
Documentation improvements, see PR #410.
Compatibility fixes for working with h5py files.

Assets 3

14 May 16:00

FrancescAlted

v3.3.3

6393397

Release 3.3.3

Changes from 3.3.2 to 3.3.3

Expand possibilities for chaining string-based lazy expressions to include
main operand types (LazyExpr and NDArray). Still have to incorporate other
data types (which do not have shape attribute, e.g. int, float etc.).
See #406.
Fix indexing for lazy expressions, and allow use of None in getitem.
See PR #402.
Fix incorrect appending of dim to computed reductions. See PR #404.
Fix blosc2.linspace() for incompatible num/shape. See PR #408.
Add support for NumPy dtypes that are n-dimensional (e.g.
np.dtype(("<i4,>f4", (10,))),).
New MAX_DIM constant for the maximum number of dimensions supported.
This is useful for checking if a given array is too large to be handled.
More refinements on guessing cache sizes for Linux.
Update to C-Blosc2 2.17.2.dev. Now, we are forcing the flush of modified
pages only in write mode for mmap files. This fixes mmap issues on Windows.
Thanks to @JanSellner for the implementation.

Contributors

JanSellner

Assets 3

01 May 05:39

FrancescAlted

v3.3.2

916307e

Release 3.3.2

Changes from 3.3.1 to 3.3.2

Fixed a bug in the determination of chunk shape for the NDArray constructor.
This was causing problems when creating NDArray instances with a CPU that
was reporting a L3 cache size close (or exceeding) 2 GB. See PR #392.
Fixed a bug preventing the correct chaining of string lazy expressions for
logical operators (&, |, ^...). See PR #391.
More performance optimization for blosc2.permute_dims. Thanks to
Ricardo Sales Piquer (@ricardosp4) for the implementation.
Now, storage defaults (blosc2.storage_dflts) are honored, even if no
storage= param is used in constructors.
We are distributing Python 3.10 wheels now.

Contributors

ricardosp4

Assets 3

20 Apr 12:56

FrancescAlted

v3.3.1

12fd438

Release 3.3.1

Changes from 3.3.0 to 3.3.1

In our effort to better adapt to better adapt to the array API
(https://data-apis.org/array-api/latest/), we have introduced
permute_dims() and matrix_transpose() functions, and the .T property.
This replaces to previous transpose() function, which is now deprecated.
See PR #384. Thanks to Ricardo Sales Piquer (@ricardosp4).
Constructors like arange(), linspace() and fromiter() now
use far less memory when creating large arrays. As an example, a 5 TB
array of 8-byte floats now uses less than 200 MB of memory instead of
170 GB previously. See PR #387.
Now, when opening a lazy expression with blosc2.open(), and there is
a missing operand, the open still works, but the dtype and shape
attributes are None. This is useful for lazy expressions that have
lost some operands, but you still want to open them for inspection.
See PR #385.
Added an example of getting a slice out of a C2Array.

Contributors

ricardosp4

Assets 3

08 Apr 12:03

FrancescAlted

v3.3.0

57b1d87

Release 3.3.0

Changes from 3.2.1 to 3.3.0

New blosc2.transpose() function for transposing 2D NDArray instances
natively. See PR #375 and docs at
https://www.blosc.org/python-blosc2/reference/autofiles/operations_with_arrays/blosc2.transpose.html#blosc2.transpose
See also our new blog about this: https://www.blosc.org/posts/transpose-compressed-matrices/
Thanks to Ricardo Sales Piquer (@ricardosp4) for the implementation.
New fast path for NDArray.slice() for getting slices that are aligned with
underlying chunks. This is a common operation when working with NDArray
instances, and now it is up to 40x faster in our benchmarks (see PR #380).
Returned NDArray object in NDarray.slice() now defaults to original
codec/clevel/filters. The previous behavior was to use the default
codec/clevel/filters. See PR #378. Thanks to Luke Shaw (@lshaw8317).
Several English edits in the documentation. Thanks to Luke Shaw (@lshaw8317)
for his help in this area.

Contributors

lshaw8317 and ricardosp4

Assets 3

26 Mar 17:35

FrancescAlted

v3.2.1

91aba7a

Release 3.2.1

Changes from 3.2.0 to 3.2.1

The array containers are now using the __array_interface__ protocol to
expose the data in the array. This allows for better interoperability with
other libraries that support the __array_interface__ protocol, like NumPy,
CuPy, etc. Now, the range of functions that can be used within the blosc2.jit
decorator is way larger, and essentially all NumPy functions should work now.

See examples at: https://github.com/Blosc/python-blosc2/blob/main/examples/ndarray/jit-numpy-funcs.py
See benchmarks at: https://github.com/Blosc/python-blosc2/blob/main/bench/ndarray/jit-numpy-funcs.py
The performance of constructors like arange(), linspace() and fromiter()
has been improved. Now, they can be up to 3x faster, specially with large
arrays.
C-Blosc2 updated to 2.17.1. This fixes various UB as well as compiler warnings.

Assets 3

Uh oh!

Releases: Blosc/python-blosc2

Blosc2 v3.6.0

Uh oh!

Release 3.5.1

Changes from 3.5.0 to 3.5.1

Uh oh!

Release 3.5.0

Changes from 3.4.0 to 3.5.0

Uh oh!

v3.4.0

Summary

What's Changed

Contributors

Uh oh!

Blosc2 v3.3.4

Uh oh!

Release 3.3.3

Changes from 3.3.2 to 3.3.3

Contributors

Uh oh!

Release 3.3.2

Changes from 3.3.1 to 3.3.2

Contributors

Uh oh!

Release 3.3.1

Changes from 3.3.0 to 3.3.1

Contributors

Uh oh!

Release 3.3.0

Changes from 3.2.1 to 3.3.0

Contributors

Uh oh!

Release 3.2.1

Changes from 3.2.0 to 3.2.1

Uh oh!