Revival of JEP 198 (Light-Weight JSON API)?
Remi Forax
forax at univ-mlv.fr
Thu Apr 23 20:18:09 UTC 2020
----- Mail original -----
> De: "cay horstmann" <cay.horstmann at gmail.com>
> À: "discuss" <discuss at openjdk.java.net>
> Envoyé: Jeudi 23 Avril 2020 20:53:02
> Objet: Re: Revival of JEP 198 (Light-Weight JSON API)?
> A JavaScript number is a double-precision floating-point number. JSON
> cannot currently serialize BigInt.
>
> https://www.ecma-international.org/ecma-262/#sec-json.stringify
> https://www.ecma-international.org/ecma-262/#sec-terms-and-definitions-number-value
The problem is that at least Jackson can use BigInteger flawlessly, so it only works if you have Java at both ends or don't use JSON.stringify() but this is a use case that exist.
Said differently, the ECMA-404 which defines JSON is not a proper subset of ECMA-262 (3rd edition).
>
> Cheers,
>
> Cay
regards,
Rémi
>
> Il 23/04/20 11:05, forax at univ-mlv.fr ha scritto:
>> ----- Mail original -----
>>> De: "John Rose" <john.r.rose at oracle.com>
>>> À: "Brian Goetz" <brian.goetz at oracle.com>
>>> Cc: "Remi Forax" <forax at univ-mlv.fr>, "discuss" <discuss at openjdk.java.net>
>>> Envoyé: Samedi 18 Avril 2020 09:53:58
>>> Objet: Re: Revival of JEP 198 (Light-Weight JSON API)?
>>
>>> Representing a JSON tree as a tree of Valhalla values (yes, they
>>> can come in trees!) is very much worth investigating IMO. The nearest
>>> approximation before Valhalla would be to design a JSON node model
>>> which consists only of (or mainly of) value-based classes. Or, perhaps
>>> Valhalla-style cursors (iterators which say x=x.next() instead of x.next())
>>> can express a walk over a mutable tree, whose structure is hidden (and
>>> thus can be materialized lazily or streamily). The nearest approximation
>>> to that would be an iterator-style API, where one object updates itself
>>> in place.
>>>
>>> One other thought: Sometimes that parsing and unparsing logic
>>> for a syntax like JSON or XML can be expressed as operating on a
>>> linear event-stream, instead of a nested tree model. You can build
>>> the latter on top of the former, and event streams have a simpler
>>> object structure than tree-like models. It might be safer to start
>>> with a parser/unparser API that runs on event streams, and then
>>> build node models (with cursors, pattern matching, etc.) on top
>>> later when the tools for those appear. …Not that I’m longing
>>> to program an event-based API, but it’s better than nothing, and
>>> not lost effort either.
>>
>> It's what the usual stream APIs do, at least what jackson and the json api from
>> Java EE does.
>> The stream is just a tokenization of the text format
>> [1, 3, { "foo": null } ] -> START_ARRAY NUMBER NUMBER START_OBJECT FIELD_NAME
>> NULL END_OBJECT END_ARRAY
>>
>> From that you can construct an API like ASM when you call a method visit for
>> each value token (NUMBER, STRING, NULL, TRUE, etc) and a method visit that ask
>> for a sub visitor for START_ARRAY and START_OBJECT.
>>
>> The next step, providing a tree/node API is harder and i think stupid mostly
>> because JSON NUMBER is not typed so you have no idea if you should represent a
>> number (as an int, a long, a float, a double, a BigInteger or a DigDecimal*,
>> ??). You can introduce your own JsonNumber has an inline type but you will
>> incompatible with any existing Java values. It's better to have a type
>> representation that provide the type the user want for a number.
>>
>> That why most APIs do not provide a generic tree API but a data binding API that
>> takes a tree of types (a bean/record/interface tree) and a stream of JSON
>> tokens and construct the corresponding Java representation (creating instances
>> Bean/Record/Proxy). This tree of type can also be seen as a tokenization of the
>> type informations (START_OBJECT_TYPE, START_ARRAY_TYPE, NUMBER_TYPE, etc).
>> Annotations on the tree of types are used to derive an adapter (or several with
>> the concept of jackson views) to the visitor that will filter/map the field
>> name and the value before creating the instances.
>>
>> So IMO, a tree of instance of Valhalla inline types, a tree of nodes, is not
>> useful but inline types can still be useful to simplify the visitor interface
>> (instead of visitNumberInt/visitNumberLong/visitNumberDouble, etc, have a sole
>> visitNumber(JsonNumber) with the inline type only existing on the stack to
>> transfer the information from the reader to the deserializer (and vice-versa).
>>
>> regards,
>> Rémi
>> * ECMA 404 doesn't define a size for the numbers !
More information about the discuss
mailing list