fix: removes string type for large numbers as it's the wrong level of abstraction #4588

baywet · 2025-05-15T17:18:22Z

… abstraction Signed-off-by: Vincent Biret <[email protected]>

handrews

JSON does not provide for reliably transmitting integers outside of the range [-(253)+1, (253)-1], and does not provide any guarantee at all for reliably transmitting fixed point decimal. All of these need to continue to allow strings, otherwise interoperability is impossible.

baywet · 2025-05-15T17:42:40Z

@karenetheridge let me know if that rationale is enough for you and I can close this PR

karenetheridge · 2025-05-15T19:58:14Z

@handrews I don't see anything in https://ecma-international.org/wp-content/uploads/ECMA-404_2nd_edition_december_2017.pdf about restrictions for valid number ranges.

JSON itself is just text. A number can be of any length. How a particular architecture chooses to represent that internally is up to it. Obviously a "native" number has size limitations, and different languages have different options for dealing with that. e.g. Rust has the BigInt type which can represent numbers larger than its native int32, int64 etc. An implementation can choose to represent its numbers internally however it likes; if it encodes to JSON that encoder would need to know how to handle any custom types, but this is no different from needing to understand how to encode/decode objects from a custom representation if there isn't a native object type in that language, etc.

Remember that the document model we're using is at a layer of abstraction above the implementation itself. As long as the implementation is consistent and clear about how it represents the data types, and can encode/decode to JSON (or other interoperable data formats like YAML or CSV) consistently, then from the OpenAPI/JSON Schema perspective we're all good.

karenetheridge

thanks @baywet!

handrews · 2025-05-15T21:01:37Z

@karenetheridge the interoperability limits are documented in RFC8259 §6 (emphasis added):

This specification allows implementations to set limits on the range and precision of numbers accepted. Since software that implements IEEE 754 binary64 (double precision) numbers [IEEE754] is generally available and widely used, good interoperability can be achieved by implementations that expect no more precision or range than these provide, in the sense that implementations will approximate JSON numbers within the expected precision. A JSON number such as 1E400 or 3.141592653589793238462643383279 may indicate potential interoperability problems, since it suggests that the software that created it expects receiving software to have greater capabilities for numeric magnitude and precision than is widely available.

Note that when such software is used, numbers that are integers and are in the range [-(2**53)+1, (2**53)-1] are interoperable in the sense that implementations will agree exactly on their numeric values.

handrews · 2025-05-15T21:03:42Z

I'd also like to point out that we've been telling people to encode numbers as strings for better interoperability for at least as long as I've been around the project, and yanking that now seems likely to be confusing and frustrating for anyone who followed our advice. Which is advice you can find around the internet, particularly in the context of financial work where numeric interpretation interoperability is of critical importance.

ralfhandl

JavaScript-based implementations parse JSON numbers into binary64 numbers in memory, which cannot represent all values of int64 or decimal128 numbers.

Which is why these "long" numbers are represented as JSON strings.

This fact is documented here for the corresponding formats.

I would be fine with removing "number" as a "base type" because interoperable senders will always use strings on the wire.

Removing "string" would be confusing for consumers of OpenAPI descriptions because they will see these formats used as for example

monetaryAmount:
  type: string
  format: decimal

ralfhandl · 2025-05-16T06:45:58Z

Maybe we should rename the field/column "JSON Data Type" to "JSON representation".

As @karenetheridge pointed out JSON is a textual format for representing and exchanging data, and accepted practice for representing large numbers in JSON without loss of precision is to use JSON strings.

baywet · 2025-05-20T16:33:27Z

@karenetheridge based on #4585 (comment) can I close this PR then? or do you have changes to suggest?

karenetheridge · 2025-05-20T16:40:23Z

After a good conversation with @hudlow I have realized I was wrong here (in my comments in #4585).

To summarize, some languages like javascript have difficulty parsing JSON with very large integers into numbers (it would seem that there is no JSON decoder that knows how to use and parse values into custom types to represent these values). So for javascript users, the normal way they'd handle this is to represent the number as a string, and then parse it into an integer on the application side. And then we'd need something in the schema itself to indicate to the application that this value is intended to be a number -- and the obvious way of doing that is with the 'format' keyword, which could be used as either just an annotation, or an assertion as well.

So, we would want the format to allow strings for these values. That doesn't mean that the numeric keywords (maximum, minimum etc) should interpret these values -- to JSON Schema it's still just a string, but at the application level it should be interpreted as a number.

Conclusion: allowing these formats to be type: [string, number] or type: [string, integer] is acceptable.

I'd also suggest that all precision-based number formats be allowed to be strings (e.g. int64), in order to accomodate architectures with smaller integer sizes (e.g. 32 bit architectures that would have difficulty with an int64).

tldr: I think this PR can be closed without merging, and perhaps we need an article on the learn site talking about handling numbers with some recommendations for various languages?

handrews · 2025-05-20T18:00:36Z

@karenetheridge agreed on allowing all to be strings. This is important when working with serialization formats (XML and urlencoded being the most obvious) where everything is serialized as a string anyway. Also, I just prefer the consistency.

BTW, the correct JSON base type(s) are number, string, as integer is not a JSON type.

fix: removes string type for large numbers as it's the wrong level of…

1d9b0b6

… abstraction Signed-off-by: Vincent Biret <[email protected]>

baywet mentioned this pull request May 15, 2025

feat: adds uint16, 32 and 64 formats #4585

Merged

handrews requested changes May 15, 2025

View reviewed changes

karenetheridge approved these changes May 15, 2025

View reviewed changes

ralfhandl requested changes May 16, 2025

View reviewed changes

baywet closed this May 20, 2025

baywet deleted the fix/string-type branch May 20, 2025 17:29

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix: removes string type for large numbers as it's the wrong level of abstraction #4588

fix: removes string type for large numbers as it's the wrong level of abstraction #4588

baywet commented May 15, 2025

handrews left a comment

baywet commented May 15, 2025

karenetheridge commented May 15, 2025

karenetheridge left a comment

handrews commented May 15, 2025 •

edited

Loading

handrews commented May 15, 2025

ralfhandl left a comment

ralfhandl commented May 16, 2025

baywet commented May 20, 2025

karenetheridge commented May 20, 2025 •

edited

Loading

handrews commented May 20, 2025

fix: removes string type for large numbers as it's the wrong level of abstraction #4588

fix: removes string type for large numbers as it's the wrong level of abstraction #4588

Conversation

baywet commented May 15, 2025

handrews left a comment

Choose a reason for hiding this comment

baywet commented May 15, 2025

karenetheridge commented May 15, 2025

karenetheridge left a comment

Choose a reason for hiding this comment

handrews commented May 15, 2025 • edited Loading

handrews commented May 15, 2025

ralfhandl left a comment

Choose a reason for hiding this comment

ralfhandl commented May 16, 2025

baywet commented May 20, 2025

karenetheridge commented May 20, 2025 • edited Loading

handrews commented May 20, 2025

handrews commented May 15, 2025 •

edited

Loading

karenetheridge commented May 20, 2025 •

edited

Loading