-
Notifications
You must be signed in to change notification settings - Fork 167
Description
Please describe your wishes and possible alternatives to achieve the desired result.
Hi,
this issue solely serves as a discussion basis for now because I think that scipy sparse matrices and pretty much any other implementation that I know (correct me if I'm wrong !!!) default to 0 as the implicit sparse value.
For ehrapy it would be very useful to also support NaN as the default value for sparse matrices. 0s have a meaning in EHR data. So do NaNs, but this is a much harder problem to solve and is up to the data collectors (we should eventually differentiate between informed and uninformed NaNs - but this is not relevant here).
Adding support for this would probably be a monumental effort that would require adding support in Scipy sparse arrays and adapting implementations in scanpy. Before doing anything, I'd like to hear what people think.
Discussed in the past with @ivirshup at the Theislab retreat.