mirror of
https://github.com/status-im/Vulkan-Docs.git
synced 2025-02-25 12:35:11 +00:00
* Update release number to 120. Github Issues: * Add slink:VkAccelerationStructureTypeNV explicitly to extension XML for `<<VK_NV_ray_tracing>>` (public issue 848). * Add missing valid usage statements for feature flags in slink:VkCommandBufferInheritanceInfo (public pull request 1017). Internal Issues: * Clarify behavior of non-premultiplied destination colors for `<<VK_EXT_blend_operation_advanced>>` prior to the definition of slink:VkBlendOverlapEXT (internal issue 1766). * Fix the confusing phrasing "`no other queue must: be (doing something)`" for flink:vkQueuePresentKHR, flink:vkQueueSubmit, and flink:vkQueueBindSparse (internal issue 1774). * Add `<<VK_EXT_validation_features>>` flag to enable best practices checks, which will soon be available in the validation layer (internal issue 1779). * Specify allowed characters for VUID tag name components in the style guide (internal issue 1788). * Update links to SPIR-V extension specifications, and parameterize their markup in case the URLs change in the future (internal issue 1797). * Fix an off-by-one error in the valid usage statement for slink:VkPipelineExecutableInfoKHR (internal merge request 3303). * Clean up markup indentation not matching the style guide (internal merge request 3314). * Minor script updates to allow refpage aliases, generate a dynamic TOC for refpages, generate Apache rewrite rules for aliases, open external links from refpages in a new window, and synchronize with the OpenCL scripts. This will shortly enable a paned navigation setup for refpages, similar to the OpenCL 2.2 refpages (internal merge request 3322). * Script updates to add tests to the checker, refactor and reformat code, generate better text for some valid usage statements, use more Pythonic idioms, and synchronize with the OpenXR scripts (internal merge request 3239). * Script updates and minor fixes in spec language to not raise checker errors for refpage markup of pages not existing in the API, such as VKAPI_NO_STDINT_H. Remove corresponding suppression of some check_spec_links.py tests from .gitlab-ci.yml and 'allchecks' target (internal merge request 3315).
133 lines
4.0 KiB
Plaintext
133 lines
4.0 KiB
Plaintext
include::meta/VK_EXT_shader_subgroup_vote.txt[]
|
|
|
|
*Last Modified Date*::
|
|
2016-11-28
|
|
*IP Status*::
|
|
No known IP claims.
|
|
*Interactions and External Dependencies*::
|
|
- This extension requires the
|
|
{spirv}/KHR/SPV_KHR_subgroup_vote.html[`SPV_KHR_subgroup_vote`] SPIR-V
|
|
extension.
|
|
- This extension requires the
|
|
https://www.khronos.org/registry/OpenGL/extensions/ARB/ARB_shader_group_vote.txt[`GL_ARB_shader_group_vote`]
|
|
extension for GLSL source languages.
|
|
*Contributors*::
|
|
- Neil Henning, Codeplay
|
|
- Daniel Koch, NVIDIA Corporation
|
|
|
|
This extension adds support for the following SPIR-V extension in Vulkan:
|
|
|
|
* `SPV_KHR_subgroup_vote`
|
|
|
|
This extension provides new SPIR-V instructions:
|
|
|
|
* code:OpSubgroupAllKHR,
|
|
* code:OpSubgroupAnyKHR, and
|
|
* code:OpSubgroupAllEqualKHR.
|
|
|
|
to compute the composite of a set of boolean conditions across a group of
|
|
shader invocations that are running concurrently (a _subgroup_).
|
|
These composite results may be used to execute shaders more efficiently on a
|
|
slink:VkPhysicalDevice.
|
|
|
|
When using GLSL source-based shader languages, the following shader
|
|
functions from GL_ARB_shader_group_vote can map to these SPIR-V
|
|
instructions:
|
|
|
|
* code:anyInvocationARB() -> code:OpSubgroupAnyKHR,
|
|
* code:allInvocationsARB() -> code:OpSubgroupAllKHR, and
|
|
* code:allInvocationsEqualARB() -> code:OpSubgroupAllEqualKHR.
|
|
|
|
The subgroup across which the boolean conditions are evaluated is
|
|
implementation-dependent, and this extension provides no guarantee over how
|
|
individual shader invocations are assigned to subgroups.
|
|
In particular, a subgroup has no necessary relationship with the compute
|
|
shader _local workgroup_ -- any pair of shader invocations in a compute
|
|
local workgroup may execute in different subgroups as used by these
|
|
instructions.
|
|
|
|
Compute shaders operate on an explicitly specified group of threads (a local
|
|
workgroup), but many implementations will also group non-compute shader
|
|
invocations and execute them concurrently.
|
|
When executing code like
|
|
|
|
[source,c++]
|
|
----------------------------------------
|
|
if (condition) {
|
|
result = do_fast_path();
|
|
} else {
|
|
result = do_general_path();
|
|
}
|
|
----------------------------------------
|
|
|
|
where code:condition diverges between invocations, an implementation might
|
|
first execute code:do_fast_path() for the invocations where code:condition
|
|
is true and leave the other invocations dormant.
|
|
Once code:do_fast_path() returns, it might call code:do_general_path() for
|
|
invocations where code:condition is code:false and leave the other
|
|
invocations dormant.
|
|
In this case, the shader executes *both* the fast and the general path and
|
|
might be better off just using the general path for all invocations.
|
|
|
|
This extension provides the ability to avoid divergent execution by
|
|
evaluating a condition across an entire subgroup using code like:
|
|
|
|
[source,c++]
|
|
----------------------------------------
|
|
if (allInvocationsARB(condition)) {
|
|
result = do_fast_path();
|
|
} else {
|
|
result = do_general_path();
|
|
}
|
|
----------------------------------------
|
|
|
|
The built-in function code:allInvocationsARB() will return the same value
|
|
for all invocations in the group, so the group will either execute
|
|
code:do_fast_path() or code:do_general_path(), but never both.
|
|
For example, shader code might want to evaluate a complex function
|
|
iteratively by starting with an approximation of the result and then
|
|
refining the approximation.
|
|
Some input values may require a small number of iterations to generate an
|
|
accurate result (code:do_fast_path) while others require a larger number
|
|
(code:do_general_path).
|
|
In another example, shader code might want to evaluate a complex function
|
|
(code:do_general_path) that can be greatly simplified when assuming a
|
|
specific value for one of its inputs (code:do_fast_path).
|
|
|
|
=== New Object Types
|
|
|
|
None.
|
|
|
|
=== New Enum Constants
|
|
|
|
None.
|
|
|
|
=== New Enums
|
|
|
|
None.
|
|
|
|
=== New Structures
|
|
|
|
None.
|
|
|
|
=== New Functions
|
|
|
|
None.
|
|
|
|
=== New Built-In Variables
|
|
|
|
None.
|
|
|
|
=== New SPIR-V Capabilities
|
|
|
|
* <<spirvenv-capabilities-table-subgroupvote,SubgroupVoteKHR>>
|
|
|
|
=== Issues
|
|
|
|
None.
|
|
|
|
=== Version History
|
|
|
|
* Revision 1, 2016-11-28 (Daniel Koch)
|
|
- Initial draft
|