Use Grisu2 algorithm in `String::num_scientific` to fix serializing by aaronfranke · Pull Request #98750 · godotengine/godot

aaronfranke · 2024-11-02T04:11:06Z

Supersedes PR #96676, PR #86951, and fixes #78204, fixes #99103, fixes #99763, see also #100414.

This PR replaces the algorithm in String::num_scientific with Grisu2 to serialize numbers with more precision. The implementation was copied from simdjson here: https://github.com/simdjson/simdjson/blob/master/src/to_chars.cpp and adjusted slightly to match the existing behavior of String::num_scientific.

What: Grisu2 is an algorithm for serializing floats in scientific notation, with enough precision to ensure they can be read back exactly, while also having the minimum amount of digits, ensuring compactness and human readability. It uses integer operations and a table of pre-computed powers of ten, so it is extremely fast.

Why: We need to serialize with more precision to ensure that a serialized number can be deserialized into the same number. For example, for the number 123456789, the closest 32-bit float is 123456792. In master this is serialized as 1.23457e8, which becomes 123457000, over 200 off from the closest 32-bit float. With this PR, if a 32-bit float, it will be serialized with 9 digits as 123456790, which can be read back as exactly 123456792. 32-bit floats have 6 reliable digits, but up to 9 are needed to serialize to decimal in order to read back with full precision.

For an example with 64-bit floats, I have 1.234567898765432123456789e30 included in the test cases. The closest 64-bit float is 1.23456789876543218850569440461e30 (differs at the 8 which used to be a 2). This gets serialized as 1.2345678987654322e+30 which is deserialized to exactly 1.23456789876543218850569440461e30. 64-bit floats have 14 reliable digits, but up to 17 are needed to serialize to decimal in order to read back with full precision.

Note that the code in Variant writer for Vector2/Vector3/etc has been adjusted to work with both 32-bit and 64-bit floats, so it will correctly serialize the numbers for builds with either precision level.

Note that the docs have special code that always use the 32-bit version, since we don't need high precision in the docs.

Note that I kept the existing behavior where num_scientific does not have a trailing .0, but the code I grabbed from simdjson included that, so I removed it. It would be easy to add that back in. However I also separately re-added the trailing .0 for the documentation to ensure the docs are generated with .0 like before.

fire · 2024-11-02T04:17:43Z

Is it worth modifying json to native and json from native?

aaronfranke · 2024-11-02T05:17:49Z

@fire What do you mean?

fire · 2024-11-04T22:35:57Z

I was curious why you renamed rtos_fixed to serialize_real. Replacing methods create a lot of patch churn.

aaronfranke · 2024-11-05T00:37:02Z

@fire I can undo the name change if it's not desired, but I think this is a clearer name.

fire · 2024-11-05T01:25:02Z

I have no opinion on the name change. It's not that important.

bruvzg

I'm definitely in favor of using the same code for float serialization/print, and the implementation looks good. sprintf is too implementation dependent and unreliable.

I was curious why you renamed rtos_fixed to serialize_real.

It's internal method, so doesn't matter. But I like serialize_real more.

core/string/ustring.h

thirdparty/README.md

thirdparty/grisu2/godot.patch

clayjohn · 2024-11-06T22:23:09Z

I'm definitely in favor of using the same code for float serialization/print, and the implementation looks good. sprintf is too implementation dependent and unreliable.

Should we try to remove sprintf from other places? Notable we still use it in String::num and it causes similar problems there

arkology · 2024-11-07T06:06:16Z

Does PR solve this issue?
UPD: And maybe this?

aaronfranke · 2024-11-07T06:28:30Z

@arkology This PR only affects String::num_scientific, it does not change places that are currently using non-scientific numbers. However, now that this function is better, it opens the opportunity to use this in more places in future PRs.

nikitalita · 2024-11-12T14:10:23Z

This also fixes #99103

clayjohn · 2025-05-14T16:04:50Z

Discussed in the core meeting today. Attendees were generally happy with the implementation and agree that more accurate serialization is worth pursuing. Overall, we are happy to move forward with this PR

Before merging it would be good to have @akien-mga's opinion on the third party code and the diff noise.

akien-mga

Looks good to me overall.

A bit concerned about the impact this may have on user projects when it comes to git noise on upgrade. We'll want to make sure users get a suggestion to run the tool we have to re-save all scenes to apply those changes all at once. (CC @KoBeWi )

COPYRIGHT.txt

thirdparty/grisu2/patches/godot.patch

Repiteo · 2025-05-22T17:23:17Z

Thanks!

KoBeWi · 2025-05-22T18:12:30Z

We'll want to make sure users get a suggestion to run the tool we have to re-save all scenes to apply those changes all at once.

We could recommend it in migration guide (I think there is one?).

cridenour · 2025-07-14T23:06:14Z

Since this change, we're getting hundreds of diffs in the AABB of our ArrayMesh resources, without a re-import. Likely a @tool script is coming across them and they are re-saved, but with new "precision".

clayjohn · 2025-07-14T23:17:59Z

Since this change, we're getting hundreds of diffs in the AABB of our ArrayMesh resources, without a re-import. Likely a @tool script is coming across them and they are re-saved, but with new "precision".

Are you getting the diffs only once, or every time that you save?

We expect that you would get a diff in many scenes the first time that you save after updating. But you should only get the diff once.

cridenour · 2025-07-14T23:35:13Z

Are you getting the diffs only once, or every time that you save?

Multiple times, but looking at the commit log, it looks like I'm "fighting" with other people on the project as some of these digits go back and forth between us.

Ivorforce · 2025-07-14T23:41:54Z

Are you getting the diffs only once, or every time that you save?

Multiple times, but looking at the commit log, it looks like I'm "fighting" with other people on the project as some of these digits go back and forth between us.

Are you both on the same version of Godot? Are you using different systems?

cridenour · 2025-07-14T23:52:29Z

Same version of Godot (custom build) - Windows 10 and Windows 11. Surprisingly my Windows 10 and MacOS builds come up with the same number.

Is it possible this is only happening (sometimes) for AABB because they are getting recomputed as part of the load?

clayjohn · 2025-07-15T00:36:55Z

Same version of Godot (custom build) - Windows 10 and Windows 11. Surprisingly my Windows 10 and MacOS builds come up with the same number.

Is it possible this is only happening (sometimes) for AABB because they are getting recomputed as part of the load?

It's hard to say. The Grisu2 algorithm is very stable and shouldn't be introducing any floating point issues. But it is possible that the increased precision is exposing some existing floating point precision/determinism issues in your game.

cridenour · 2025-09-17T13:48:38Z

core/variant/variant_parser.cpp

-				}
+			const double value = p_variant.operator double();
+			String s;
+			// Hack to avoid garbage digits when the underlying float is 32-bit.


So I found that this "Hack" is also necessary on all the Vectors, AABBs, etc. when compiling with precision=double. That or the current grisu2 implementation isn't fully stable for double precision.

This hack works by checking if it's equivalent to the closest 32-bit float value. If you are seeing places where this is needed for vectors with precision=double then this means the underlying value isn't double somewhere, it's not a bug in Grisu2 itself. We could look for those places to try and fix issues, but it's also possible that the underlying value you're encountering is intended to be a 32-bit float, in which case we'd indeed need to apply this hack there if we want to avoid unnecessary digits being written to scene files and verison control.

The two places I see constant git changes are when saving a mesh to a file on import, the AABB is changing between computers, even without a re-import. I haven't been able to follow that code path but it's possible that is getting downcast at some point.

The second place is surprisingly on transforms on a saved scene. Many of the node transforms will sometimes change in the scene file despite not being moved. Usually just one digit of change.

Given we use double precision for runtime physics and rendering and our individual scenes and meshes don't need the full precision, I'll likely just always cast down to single precision (when it matches) in our variant writer.

I opened this PR to apply this to all float serialization: #110616

aaronfranke requested review from a team as code owners November 2, 2024 04:11

aaronfranke added this to the 4.4 milestone Nov 2, 2024

aaronfranke added bug topic:core labels Nov 2, 2024

This was referenced Nov 2, 2024

var_to_str rounds floats, losing massive precision in the process #78204

Closed

Add digits argument to String::num_scientific and fix serializing #96676

Closed

aaronfranke mentioned this pull request Nov 2, 2024

Prevent String::num_scientific from giving different precision levels depending on compiler #86951

Closed

bruvzg reviewed Nov 6, 2024

View reviewed changes

core/string/ustring.h Outdated Show resolved Hide resolved

thirdparty/README.md Outdated Show resolved Hide resolved

thirdparty/grisu2/godot.patch Outdated Show resolved Hide resolved

aaronfranke force-pushed the grisu branch 2 times, most recently from dc17bc5 to 850a082 Compare November 6, 2024 12:10

clayjohn mentioned this pull request Nov 12, 2024

Loss of float precision when using save as on a text resource or scene #99103

Closed

akien-mga changed the title ~~Use Grisu2 algorithm in String::num_scientific to fix serializing~~ Use Grisu2 algorithm in String::num_scientific to fix serializing Nov 13, 2024

aaronfranke force-pushed the grisu branch from 850a082 to 03153ce Compare November 14, 2024 09:57

aaronfranke force-pushed the grisu branch from ea89974 to 35e4b63 Compare May 14, 2025 15:28

akien-mga self-requested a review May 14, 2025 18:52

aaronfranke force-pushed the grisu branch from 35e4b63 to 5281868 Compare May 21, 2025 10:47

Repiteo modified the milestones: 4.x, 4.5 May 22, 2025

akien-mga approved these changes May 22, 2025

View reviewed changes

COPYRIGHT.txt Outdated Show resolved Hide resolved

thirdparty/grisu2/patches/godot.patch Show resolved Hide resolved

Use Grisu2 algorithm in String::num_scientific to fix serializing

15de1d6

aaronfranke force-pushed the grisu branch from c555c9a to 15de1d6 Compare May 22, 2025 16:13

Repiteo merged commit 6258a3e into godotengine:master May 22, 2025
20 checks passed

aaronfranke deleted the grisu branch May 22, 2025 17:29

ydeltastar mentioned this pull request May 31, 2025

Using Resource to save values greater than 1000000 may result in errors #106989

Closed

eigenviolet mentioned this pull request Jun 3, 2025

Unnecessary floating point digits being serialized in some (but not all) cases #107095

Closed

Mickeon mentioned this pull request Jun 6, 2025

Fix Color precision error in the documentation generated on M4 macOS. #104112

Merged

aaronfranke mentioned this pull request Jul 9, 2025

Specify Apache license version for Grisu2 #108400

Merged

Ivorforce mentioned this pull request Jul 15, 2025

Use double consistently in Range::get_as_ratio. #108638

Merged

aaronfranke mentioned this pull request Jul 21, 2025

Use num_scientific (Grisu2) when stringifying JSON with full precision #108836

Merged

beicause mentioned this pull request Aug 22, 2025

Serialized decimal differs when set by script and editor slider #109852

Closed

Calinou mentioned this pull request Aug 22, 2025

exported float is rounded down #109838

Open

aaronfranke mentioned this pull request Aug 23, 2025

Remove nearly-unused "default" range hint min/max #109884

Merged

cridenour reviewed Sep 17, 2025

View reviewed changes

aaronfranke mentioned this pull request Sep 17, 2025

Apply rtos_fix hack for handling 32-bit floats on all calls to rtos_fix #110616

Merged

Uh oh!

Conversation

aaronfranke commented Nov 2, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

fire commented Nov 2, 2024

Uh oh!

aaronfranke commented Nov 2, 2024

Uh oh!

fire commented Nov 4, 2024

Uh oh!

aaronfranke commented Nov 5, 2024

Uh oh!

fire commented Nov 5, 2024

Uh oh!

bruvzg left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

clayjohn commented Nov 6, 2024

Uh oh!

arkology commented Nov 7, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

aaronfranke commented Nov 7, 2024

Uh oh!

nikitalita commented Nov 12, 2024

Uh oh!

clayjohn commented May 14, 2025

Uh oh!

akien-mga left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Repiteo commented May 22, 2025

Uh oh!

KoBeWi commented May 22, 2025

Uh oh!

cridenour commented Jul 14, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

clayjohn commented Jul 14, 2025

Uh oh!

cridenour commented Jul 14, 2025

Uh oh!

Ivorforce commented Jul 14, 2025

Uh oh!

cridenour commented Jul 14, 2025

Uh oh!

clayjohn commented Jul 15, 2025

Uh oh!

cridenour Sep 17, 2025

Choose a reason for hiding this comment

Uh oh!

aaronfranke Sep 17, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

cridenour Sep 17, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

aaronfranke Sep 17, 2025

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

14 participants

aaronfranke commented Nov 2, 2024 •

edited

Loading

arkology commented Nov 7, 2024 •

edited

Loading

cridenour commented Jul 14, 2025 •

edited

Loading

aaronfranke Sep 17, 2025 •

edited

Loading

cridenour Sep 17, 2025 •

edited

Loading