It's a shame they couldn't use yaml, instead. I compared them and yaml uses abou...

AdrienBrault · on June 14, 2023

I think YAML actually uses more tokens than JSON without indents, especially with deep data. For example "," being a single token makes JSON quite compact.

You can compare JSON and YAML on https://platform.openai.com/tokenizer

IshKebab · on June 14, 2023

I would imagine JSON is easier for a LLM to understand (and for humans!) because it doesn't rely on indentation and confusing syntax for lists, strings etc.

nasir · on June 14, 2023

Its a lot more straightforward to use JSON programmatically than YAML.

golergka · on June 15, 2023

If you are using any kind of type checking instead of blindly trusting generated json it's exactly the same amount of work.

TeMPOraL · on June 14, 2023

It really shouldn't be, though. I.e. not unless you're parsing or emitting it ad-hoc, for example by assuming that an expression like:

  "{" + $someKey + ":" + $someValue + "}"

produces a valid JSON. It does - sometimes - and then it's indeed easier to work with. It'll also blow up in your face. Using JSON the right way - via a proper parser and serializer - should be identical to using YAML or any other equivalent format.

riwsky · on June 15, 2023

Even if the APIs for both were equally simple, modules for manipulating json are way more likely to be available in the stdlib of whatever language you’re using.

blamy · on June 15, 2023

JSON can be minified.