DEPR: float_precision in read_csv

We have "high" (the default), "legacy", and "round_trip".  When it was introduced in #8044, using the "high" precision apparently came with a performance penalty, but that changed by 2017 (https://github.com/pandas-dev/pandas/issues/17154#issuecomment-319917647) so the default was changed from "legacy" to "high" in #36228.  I can't think of any reason why anyone would use "legacy".

I'm not aware of anyone who uses this parameter at all.  Let's deprecate it and simplify the code+API.

<b>Update</b>: I patched the code to always use precision="high" to see whether it broke any tests.  Aside from a test specifically asserting that "legacy" is _inaccurate_, the only test that broke was `test_precise_conversion` (4 cases out of 42) where we parse `1.700000000000000177635684` to `1.7`.  I'm fine with this level of rounding (though I think using `fast_float` might improve it to `1.7000000000000002` which is what pyarrow gives).  I'd also be OK with saying "round_trip" level precision is only for the python engine (though that engine also gives 1.7)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

DEPR: float_precision in read_csv #64395

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Uh oh!

DEPR: float_precision in read_csv #64395

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions