vulkan persistent pipeline cache and shader spirv code cache implementation #6248

CLV-Iclucia · 2025-08-11T12:47:01Z

Add Persistent Vulkan Pipeline Cache and Persistent Shader SPIR-V Cache

Changes

Implemented persistent save/load APIs for PipelineCache, enabling serialization of VkPipelineCache data to disk and reuse across application runs.
Introduced strict validation for cache consistency, including:
- data hash verification
- GPU info
- ABI (32 bit or 64 bit)
Developed a separate on-disk cache mechanism for compiled shader SPIR-V binaries to avoid redundant shader compilation and improve pipeline build efficiency across runs.

Design Considerations

Save VkPipeline cache to the disk to accelerate pipeline creation.
Most of the time, compiling source code to spirv code is the bottleneck so I also designed a persistent spirv code cache to accelerate compiling across runs.

Testing

Verified robustness for pipeline cache in various situations like file corruption and multi-threading.
Passed all the current tests on Windows.

Performance

In a single pipeline creation test, the time taken for creating a pipeline is reduced from 90ms to 0.4ms using the two caches across runs (mocked by creating and destroying GPU repeatedly) on my PC and this is mainly contributed by spirv code cache.

The CPU is AMD Ryzen 7 5800H and the GPU is Nvidia RTX 3060.

pipeline cache test creation time (without cache): 90.87 ms
pipeline cache test creation time (with cache): 0.37 ms

Impact

Now ncnn users might need to manually load/save the pipeline cache, or customize spirv code cache saving directories.
Updating ncnn might need to change the ncnn_version in spirv cache header.

Questions for Review

1. Cross-platform file operations

The project currently lacks a unified cross-platform file API due to using C++11 (like renaming file). I implemented platform-specific file handling within the relevant pipelinecache.cpp files.
Would it be better to extract these into a cross-platform file module? Any recommendations or existing patterns in the project?

2. SPIR-V cache invalidation strategy

Multiple factors affect the generated SPIR-V, such as shader source, glslang version, and how ncnn handles device extensions and options.
I combined these into a single ncnn_version field in the spirv code cache header to invalidate caches on changes.
Is this approach sufficient? Are there better versioning or change-tracking strategies you recommend?

3. Testing and API exposure for SPIR-V cache

Exposing many interfaces to manipulate the SPIR-V cache risks complexity, yet hiding them makes thorough testing difficult.
What is the best practice here? Controlled internal APIs for testing, or relying on integration tests?

4. Security of SPIR-V cache files

If a cached SPIR-V file is maliciously altered (content and hash), there's a risk of running compromised shaders when loading the cache.
Do you have suggestions on mitigating this risk?

…test in test_pipeline_cache

…_cache.cpp

…he.h/cpp, reformat code

tencent-adm · 2025-08-11T12:47:17Z

All committers have signed the CLA.

github-actions · 2025-08-11T14:52:20Z

The binary size change of libncnn.so (bytes)

architecture	base size	pr size	difference
x86_64	15155464	15148064	-7400 😘
armhf	6182208	6170080	-12128 😘
aarch64	9520608	9520968	+360 ⚠️

nihui · 2025-08-21T07:34:30Z

感谢你的工作，请将你在实现中的笔记和心得，遇到的困难和解决方法等，记录成文章，发表在discussion分区，这将作为知识总结 https://github.com/Tencent/ncnn/discussions

Thank you for your work. Please record your notes and experience in the implementation, difficulties encountered and solutions, etc. into an article and publish it in the discussion section. This will serve as a knowledge summary. https://github.com/Tencent/ncnn/discussions

codecov-commenter · 2025-09-10T06:56:10Z

Codecov Report

❌ Patch coverage is 69.25373% with 103 lines in your changes missing coverage. Please review.
✅ Project coverage is 95.85%. Comparing base (a514cf5) to head (34c9d85).
⚠️ Report is 4 commits behind head on master.

Files with missing lines	Patch %	Lines
src/pipelinecache.cpp	68.28%	98 Missing ⚠️
src/gpu.cpp	80.76%	5 Missing ⚠️

Additional details and impacted files

@@            Coverage Diff             @@
##           master    #6248      +/-   ##
==========================================
- Coverage   95.89%   95.85%   -0.04%     
==========================================
  Files         837      836       -1     
  Lines      264994   265053      +59     
==========================================
- Hits       254105   254076      -29     
- Misses      10889    10977      +88

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

CLV-Iclucia and others added 15 commits August 1, 2025 23:28

add persistent vulkan pipeline cache prototype

20201fd

Merge branch 'Tencent:master' into vulkan-persistent-pipeline-cache-dev

e7070f3

finish persistent pipeline cache and add test

f93d840

add persistent spirv code cache in pipelinecache.h/cpp

cb2c27e

Merge branch 'master' into vulkan-persistent-pipeline-cache-dev

bda1eaa

finish shader spirv cache in pipelinecache.cpp/h

3e6438e

Reverted tools/pnnx

664eb0c

add interface for user to clear cache in pipelinecache.cpp/h, refine …

2074b59

…test in test_pipeline_cache

reformat code pipelinecache.cpp and test_pipeline_cache.cpp

ee63525

remove unnecessary headers in pipelinecache.cpp

68e3a20

fix lock in pipelinecache.cpp, add multithread tests in test_pipeline…

213e290

…_cache.cpp

make unnecessary protected methods invisible in header in pipelinecac…

780af83

…he.h/cpp, reformat code

revert unnecessary code format

834741f

add accidentally deleted files back

d364052

add accidentally removed pybind11_mat.h back

aaaf1db

github-actions bot added core test labels Aug 11, 2025

CLV-Iclucia added 4 commits August 11, 2025 21:21

add pipeline cache flag bits in simplevk.h

f9ab7c2

fix compilation error on linux

82f93ba

fix compilation error on linux

a04c2e4

fixing compilation error on linux in pipelinecache.cpp

a954b48

CLV-Iclucia and others added 3 commits September 10, 2025 15:35

Merge branch 'master' into vulkan-persistent-pipeline-cache-dev

cafc417

fix bugs on linux files and android compilation error

b57c398

apply code-format changes

34c9d85

nihui closed this Sep 11, 2025

nihui reopened this Sep 11, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

vulkan persistent pipeline cache and shader spirv code cache implementation #6248

vulkan persistent pipeline cache and shader spirv code cache implementation #6248

Uh oh!

CLV-Iclucia commented Aug 11, 2025

Uh oh!

tencent-adm commented Aug 11, 2025 •

edited

Loading

Uh oh!

github-actions bot commented Aug 11, 2025 •

edited

Loading

Uh oh!

nihui commented Aug 21, 2025

Uh oh!

codecov-commenter commented Sep 10, 2025 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Uh oh!

vulkan persistent pipeline cache and shader spirv code cache implementation #6248

Are you sure you want to change the base?

vulkan persistent pipeline cache and shader spirv code cache implementation #6248

Uh oh!

Conversation

CLV-Iclucia commented Aug 11, 2025

Add Persistent Vulkan Pipeline Cache and Persistent Shader SPIR-V Cache

Changes

Design Considerations

Testing

Performance

Impact

Questions for Review

1. Cross-platform file operations

2. SPIR-V cache invalidation strategy

3. Testing and API exposure for SPIR-V cache

4. Security of SPIR-V cache files

Uh oh!

tencent-adm commented Aug 11, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

github-actions bot commented Aug 11, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

nihui commented Aug 21, 2025

Uh oh!

codecov-commenter commented Sep 10, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

tencent-adm commented Aug 11, 2025 •

edited

Loading

github-actions bot commented Aug 11, 2025 •

edited

Loading

codecov-commenter commented Sep 10, 2025 •

edited

Loading