Use re-spirv in the Vulkan driver to optimize shaders. by DarioSamo · Pull Request #111452 · godotengine/godot

DarioSamo · 2025-10-09T13:41:30Z

Disclaimer

re-spirv is a project I started with another contributor on our own with the express purpose of doing shader optimization in SPIR-V in real time environments. The project uses the MIT License and should be compatible with Godot's licensing. This PR adds a new dependency to the third party folder.

Background

Since the ubershaders PR was merged, Godot has been making use of specialization constants to heavily eliminate parts of code that are unused by materials depending on the environment's configuration or proximity to nodes such as lights or reflection probes. This is specially important in the mobile renderer, which will generate specializations based on the amount of lights close to it and compile them on the fly on the background while relying on the ubershader to display it while it's getting ready.

Specialization constants are used as the main flags for eliminating unused code, but not all drivers will prioritize eliminating code based on them first. re-spirv was developed with the explicit goal of prioritizing DCE based on eliminating code branches that can be determined to be dead once the values for the constants are known. As measured in that project and in Godot itself, it is possible to get significant reductions in pipeline creation time by eliminating code from the SPIR-V instead on the application side at the cost of some very small extra processing time.

Why not apply spirv-opt instead?

We should eventually add shader optimization to Godot using spirv-opt. There's even a PR here which adds it and gives substantial benefits to shader sizes and pipeline creation times too. spirv-opt even features options to freeze specialization constants to certain values and run optimization passes.

However, the performance leaves a lot to be desired and is basically not applicable for real-time cases. While re-spirv is unlikely to ever feature the great size reductions that spirv-opt can achieve, the size reduction that is possible is very much worth it for the very little extra processing time it adds to the pipeline creation.

At the time being, I think spirv-opt will be best suited for optimizing shaders when using the new shader baker option, which can afford to take more processing time during a step where the user does not expect real-time performance.

For purposes of comparison to spirv-opt, re-spirv currently only features the following optimizations:

Specialization Constant Patching
Constant Evaluation & Branch Pruning
Dead Code Elimination
Single Store & Load Elimination
Single Block Store & Load Elimination

Procedure

We have multiple possible places to apply SPIR-V optimization, and for the purposes of this PR, it has been moved to live solely inside the Vulkan driver. Some benefits have been found by running re-spirv for non-specialized shaders because they're currently compiled without optimizations, but I believe it would be best suited to rely on spirv-opt at some point in the future to optimize these shaders and leave re-spirv solely for specialized shaders.

The implementation of re-spirv in the driver is very simple: a parsing step is performed during shader creation to generate the analysis DAG that will be used during optimization. The results of this step are reusable and the cost only has to be paid once per shader and isn't paid again for each specialization that is created.

Before a pipeline is created, the optimizer is called with the known values of the specialization constants. The resulting SPIR-V is passed to the driver to create the pipeline. This means a new unique SPIR-V is generated on the fly and there's no need for the driver to parse the specialization constants.

Results

In theory, we shouldn't be seeing substantial decreases in pipeline creation time, as drivers already apply several optimization steps to achieve optimal shader performance when converting the SPIR-V to actual GPU code. However, not all drivers are built the same, and not all of them may prioritize the passes featured by re-spirv which are known to be the most effective first given the nature of the shaders. These drivers may reach the optimal point by applying different passes first or may pay a higher price from converting SPIR-V as-is to their own intermediate format first.

Some of these measurements might be within a margin of error due to the highly parallelized nature of the workload depending on the system and the driver in question. However, there's hardware that shows significant improvements and it should make it clear as to why we pursued this path.

Not all of these were measured using the same project or renderer so the numerical values can't be compared between platforms. The main purpose of the chart is to show the reductions on each platform. For example, the NVIDIA measurement was done on a much larger project using Forward+ to better represent the improvements on a project that fits the platform's demands better.

Average	Adreno 640	Quest 3 (Adreno 740)	Samsung S25 (Adreno 830)	Samsung S21 (Mali-G78)	Intel Xe (Mesa)	NVIDIA RTX 3090 Ti
Specializations (master)	285 ms	235 ms	21.4 ms	49.0 ms	146 ms	124 ms
Specializations (re-spirv)	138 ms	114 ms	18.6 ms	43.7 ms	136 ms	114 ms
Regular (master)	799 ms	444 ms	63.2 ms	161 ms	289 ms	267 ms
Regular (re-spirv)	868 ms	446 ms	60.4 ms	133 ms	298 ms	257 ms

The times for Adreno are one of the main reasons behind this PR. Older Adreno drivers see over a 2x reduction in pipeline compilation times, making the games run better much faster than before and using a lot less power in the process. Given driver updates are not an option usually available on these platforms, we see a very big incentive to have this optimizer built into Godot if we wish to target these devices.
- As a bonus, it was recently found out that Godot games on Android are not properly saving the pipeline cache under common use cases. Given pipeline caching is explicit on Android and no driver-side cache is guaranteed, this can lead to a very important improvement now on every run of a game.
We see newer devices with more up-to-date drivers don't get nearly as much of a benefit as expected. However, re-spirv having any kind of improvement in these drivers means there's potential room for optimizations by vendors in these areas to further reduce pipeline creation times.
Some benefits can be achieved consistently on some platforms when applying re-spirv to regular shaders (e.g. without specialization constants), but it doesn't seem this benefit is consistent across all platforms. However, this could be attributed to a measurement variance rather than a consistent error. We could potentially discard this item if we decide to rely on spirv-opt instead to optimize the shader during shader baking.
One detail that wasn't included in these results is that re-spirv will cause a small performance regression if the pipelines are already cached. However, the regression has been measured to be in the order of less than one millisecond per pipeline, and given these pipelines are compiled on the background, this can be considered an acceptable cost to pay given the reduction on the uncached cases.

Testing

Pipeline caches from the project need to be cleared if enabled before testing. Some crude macros in the form of PRINT_PIPELINE_COMPILATION_TIMES are provided on the source code itself, but my hope is to further polish the measuring and provide proper monitors and statistics as the PR progresses more.
re-spirv has not been really used in production yet in any big projects, so it is very possible visual regressions are incorrectly introduced. Any regressions should be reported as they'd invalidate some of the findings so far and should be promptly fixed before merging is considered.

TODO

re-spirv should be disabled when using shader debugging explicitly through the command line option. Otherwise, the code elimination would make it impossible to debug the shader correctly.
A Vulkan header makes for most of the code included in this PR. It's my understanding that we already have a Vulkan header elsewhere in the project that we could use, although I've not verified yet if it includes everything re-spirv needs. If it doesn't, we could probably just update that header instead and make re-spirv use it.
- I've adopted a middle of the road approach to this and updated reflect-spirv to use a newer header. However, I think we should just stick these in the vulkan directory and make both libraries use it instead.
Determine whether we should apply re-spirv to regular non-specialized shaders or not based on the findings.
Expand benchmarking with proper monitoring and correlation of shaders to their pipeline times, shader size reductions, and allow dumping this data to analyze it more effectively.

clayjohn · 2025-11-05T21:21:27Z

This makes a nice difference on AMD dGPUs too

Average	RX 6900 XT
Specializations (master)	99 ms
Specializations (re-spirv)	82 ms
Regular (master)	105 ms
Regular (re-spirv)	92 ms

Note: the values aren't comparable to the other devices since a different test project was used

DarioSamo · 2025-11-17T14:37:14Z

There's a few pending items but this should be mostly ready for review.

I replaced the old approach of printing the average time with a more detailed spreadsheet dumping, but perhaps we want to further analyze it and draw some more conclusions with it.

CC @akien-mga I'd like a suggestion on how to properly handle the newer Vulkan SPIR-V header required. The PR updates the one on spirv-reflect, but perhaps we prefer to just move it to the common vulkan folder altogether.

clayjohn · 2025-11-18T07:14:41Z

There's a few pending items but this should be mostly ready for review.

What are the pending items? Are the potential further optimizations in re-spirv, or are they things that will impact the integration code?

At this point I am totally fine with the integration code. So I am happy to move forward with this PR unless you have any significant known issues

DarioSamo · 2025-11-18T11:26:43Z

What are the pending items? Are the potential further optimizations in re-spirv, or are they things that will impact the integration code?

We just need to sort out the bit I pinged Rémi about, as I'm not sure how we want to handle the SPIR-V header going forward. It's just a code organization issue.

clayjohn · 2025-11-20T07:10:33Z

Looks like we use that header in SPIRV-Cross, SPIRV-reflect, and GLSLang. Usually it is only updated when we do an SDK update. IMO we should probably just update to a newer version of the SDK and leave the files in their current places instead of unifying them in one location.

That being said, see #107773

akien-mga · 2025-11-25T10:46:40Z

Looks like we use that header in SPIRV-Cross, SPIRV-reflect, and GLSLang. Usually it is only updated when we do an SDK update. IMO we should probably just update to a newer version of the SDK and leave the files in their current places instead of unifying them in one location.

Yeah I suggested doing that too in DMs with Darío.
We agreed to move them to thirdparty/spirv/include/spirv/unified1/spirv.{h,hpp}, i.e. following the paths from https://github.com/KhronosGroup/SPIRV-Headers/tree/main/include/spirv/unified1

akien-mga

Buildsystem changes look good overall.

drivers/vulkan/rendering_device_driver_vulkan.cpp

thirdparty/spirv-headers/LICENSE

servers/rendering/renderer_rd/spirv-reflect/SCsub

akien-mga

Approved for buildsystem changes.

thirdparty/spirv-reflect/patches/0003-spirv-headers.patch

Includes contributions by Rémi to unify usage of SPIR-V Headers across the dependencies. Co-authored-by: Rémi Verschelde <rverschelde@gmail.com>

Repiteo · 2025-12-02T18:00:46Z

Thanks!

DarioSamo · 2025-12-02T18:16:01Z

Since this is now merged, for anyone that happens to track a particular regression to this PR, please search for the RESPV_ENABLED macro and try using it with 0 and see if it goes away. While I've done as much as I could to track possible cases of incorrect optimization, something could pop up and I intend to debug it as soon as I'm able, and shader optimization errors can be really tough to resolve.

All in all, I expect to see some amazing loading time improvements on mobile platforms specially.

DarioSamo added topic:rendering performance labels Oct 9, 2025

AThousandShips added this to the 4.x milestone Oct 9, 2025

AThousandShips added the enhancement label Oct 9, 2025

clayjohn mentioned this pull request Nov 5, 2025

All Godot 4+ games startup time takes 30+ seconds after first install #112425

Open

KeyboardDanni mentioned this pull request Nov 11, 2025

Add shader baker to project exporter. #102552

Merged

12 tasks

dsnopek mentioned this pull request Nov 15, 2025

Weird visual artifacts in right eye with foveated rendering enabled in Vulkan Mobile renderer on Meta Quest 3 #112834

Closed

DarioSamo force-pushed the re-spirv branch 2 times, most recently from 4cbfb2e to c202ec7 Compare November 17, 2025 14:32

DarioSamo marked this pull request as ready for review November 17, 2025 14:35

DarioSamo requested review from a team as code owners November 17, 2025 14:35

clayjohn modified the milestones: 4.x, 4.6 Nov 27, 2025

akien-mga reviewed Dec 2, 2025

View reviewed changes

drivers/vulkan/rendering_device_driver_vulkan.cpp Outdated Show resolved Hide resolved

thirdparty/spirv-headers/LICENSE Show resolved Hide resolved

servers/rendering/renderer_rd/spirv-reflect/SCsub Outdated Show resolved Hide resolved

DarioSamo force-pushed the re-spirv branch from 496b729 to 4a90620 Compare December 2, 2025 14:32

akien-mga approved these changes Dec 2, 2025

View reviewed changes

thirdparty/spirv-reflect/patches/0003-spirv-headers.patch Outdated Show resolved Hide resolved

akien-mga requested a review from clayjohn December 2, 2025 14:35

DarioSamo force-pushed the re-spirv branch 2 times, most recently from f903f91 to 3b731ab Compare December 2, 2025 14:37

Use re-spirv in the Vulkan driver to optimize shaders.

cf00643

Includes contributions by Rémi to unify usage of SPIR-V Headers across the dependencies. Co-authored-by: Rémi Verschelde <rverschelde@gmail.com>

DarioSamo force-pushed the re-spirv branch from 3b731ab to cf00643 Compare December 2, 2025 14:39

clayjohn approved these changes Dec 2, 2025

View reviewed changes

Repiteo merged commit 666bcb2 into godotengine:master Dec 2, 2025
20 checks passed

Alexofp mentioned this pull request Dec 6, 2025

mediump crashes the engine in 4.6dev6 in vulkan #113665

Closed

xuhuisheng mentioned this pull request Dec 7, 2025

Shader causes crash in Godot 4.6 dev 6 #113694

Closed

mxtherfxcker mentioned this pull request Dec 7, 2025

Fix re-spirv null pointer crash on invalid SPIR-V parsing #113708

Merged

akien-mga mentioned this pull request Jan 16, 2026

Shader materials use way more RAM since 4.6dev6 #115032

Closed

Uh oh!

Conversation

DarioSamo commented Oct 9, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Disclaimer

Background

Why not apply spirv-opt instead?

Procedure

Results

Testing

TODO

Uh oh!

clayjohn commented Nov 5, 2025

Uh oh!

DarioSamo commented Nov 17, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

clayjohn commented Nov 18, 2025

Uh oh!

DarioSamo commented Nov 18, 2025

Uh oh!

clayjohn commented Nov 20, 2025

Uh oh!

akien-mga commented Nov 25, 2025

Uh oh!

akien-mga left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

akien-mga left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Repiteo commented Dec 2, 2025

Uh oh!

DarioSamo commented Dec 2, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

DarioSamo commented Oct 9, 2025 •

edited

Loading

DarioSamo commented Nov 17, 2025 •

edited

Loading

DarioSamo commented Dec 2, 2025 •

edited

Loading