mirror of
https://github.com/status-im/Vulkan-Docs.git
synced 2025-01-12 23:14:20 +00:00
0cc6bba634
* Bump API patch number and header version number to 61 for this update. Github Issues: * Provide alternate length attributes (altlen=) in the XML schema, for those using length attributes to generate code instead of documentation (public issue 555). * Fix erroneous references to `latex:` being used for asciidoc math markup, rather than `latexmath:` (public pull request 556). * Add author ID to XML for Kazan software renderer (public pull request 557). Internal Issues: * Add the <<fundamentals-abi,Application Binary Interface>> section describing platform ABI requirements and recommendations, add examples of function and function pointer declarations to the <<boilerplate-platform-specific-calling-conventions, Platform-Specific Calling Conventions>> section, and remove related language that existed elsewhere in the specification (internal issue 64). * Describe where to document valid usage interactions of chained structures in the style guide, and fix one case now appearing in slink:VkBufferCreateInfo instead of the child slink:VkDedicatedAllocationBufferCreateInfoNV structure (internal issue 715). * Add example to the style guide of describing enumerated types which are empty when the spec is built without relevant extensions enabled, and apply it to existing examples for elink:VkDescriptorSetLayoutCreateFlagBits and elink:VkSubpassDescriptionFlagBits (internal issue 864). * Add a note to the <<fundamentals-validusage-enums, Valid Usage for Enumerated Types>> section that the special values suffixed with etext:_BEGIN_RANGE, etext:_END_RANGE, etext:_RANGE_SIZE and etext:_MAX_ENUM are not part of the API and should: not be used by applications (internal issue 872). * Added note to flink:vkCmdUpdateBuffers explaining the performance penalty for copies done in this way, and why the upper copy limit is what it is (internal issue 952). * Update `VK_KHX_device_group` to split some functionality into the new `VK_KHR_bind_memory2` extension, and rename that functionality (internal issue 969). * Remove *Status* fields from extension appendices, since they are by definition published and complete by the time they reach the public github repository (internal issue 973). Other Issues: * Update Data Format specification dependency to version 1.2 and change references to DF sections accordingly. * Update XML to make the pname:pAllocator parameter of flink:vkRegisterDeviceEventEXT and flink:vkRegisterDisplayEventEXT in the `VK_EXT_display_control` extension as optional. New Extensions: * `VK_KHR_bind_memory2` * `VK_KHR_image_format_list` * `VK_KHR_maintenance2` * `VK_KHR_sampler_ycbcr_conversion`
133 lines
4.1 KiB
Plaintext
133 lines
4.1 KiB
Plaintext
include::meta/VK_EXT_shader_subgroup_vote.txt[]
|
|
|
|
*Last Modified Date*::
|
|
2016-11-28
|
|
*IP Status*::
|
|
No known IP claims.
|
|
*Interactions and External Dependencies*::
|
|
- This extension requires the
|
|
https://www.khronos.org/registry/spir-v/extensions/KHR/SPV_KHR_subgroup_vote.html[+SPV_KHR_subgroup_vote+]
|
|
SPIR-V extension.
|
|
- This extension requires the
|
|
https://www.khronos.org/registry/OpenGL/extensions/ARB/ARB_shader_group_vote.txt[+GL_ARB_shader_group_vote+]
|
|
extension for GLSL source languages.
|
|
*Contributors*::
|
|
- Neil Henning, Codeplay
|
|
- Daniel Koch, NVIDIA Corporation
|
|
|
|
This extension adds support for the following SPIR-V extension in Vulkan:
|
|
|
|
* +SPV_KHR_subgroup_vote+
|
|
|
|
This extension provides new SPIR-V instructions:
|
|
|
|
* code:OpSubgroupAllKHR,
|
|
* code:OpSubgroupAnyKHR, and
|
|
* code:OpSubgroupAllEqualKHR.
|
|
|
|
to compute the composite of a set of boolean conditions across a group of
|
|
shader invocations that are running concurrently (a _subgroup_).
|
|
These composite results may be used to execute shaders more efficiently on a
|
|
slink:VkPhysicalDevice.
|
|
|
|
When using GLSL source-based shader languages, the following shader
|
|
functions from GL_ARB_shader_group_vote can map to these SPIR-V
|
|
instructions:
|
|
|
|
* code:anyInvocationARB() -> code:OpSubgroupAnyKHR,
|
|
* code:allInvocationsARB() -> code:OpSubgroupAllKHR, and
|
|
* code:allInvocationsEqualARB() -> code:OpSubgroupAllEqualKHR.
|
|
|
|
The subgroup across which the boolean conditions are evaluated is
|
|
implementation-dependent, and this extension provides no guarantee over how
|
|
individual shader invocations are assigned to subgroups.
|
|
In particular, a subgroup has no necessary relationship with the compute
|
|
shader _local workgroup_ -- any pair of shader invocations in a compute
|
|
local workgroup may execute in different subgroups as used by these
|
|
instructions.
|
|
|
|
Compute shaders operate on an explicitly specified group of threads (a local
|
|
workgroup), but many implementations will also group non-compute shader
|
|
invocations and execute them concurrently.
|
|
When executing code like
|
|
|
|
[source,c++]
|
|
----------------------------------------
|
|
if (condition) {
|
|
result = do_fast_path();
|
|
} else {
|
|
result = do_general_path();
|
|
}
|
|
----------------------------------------
|
|
|
|
where code:condition diverges between invocations, an implementation might
|
|
first execute code:do_fast_path() for the invocations where code:condition
|
|
is true and leave the other invocations dormant.
|
|
Once code:do_fast_path() returns, it might call code:do_general_path() for
|
|
invocations where code:condition is false and leave the other invocations
|
|
dormant.
|
|
In this case, the shader executes *both* the fast and the general path and
|
|
might be better off just using the general path for all invocations.
|
|
|
|
This extension provides the ability to avoid divergent execution by
|
|
evaluating a condition across an entire subgroup using code like:
|
|
|
|
[source,c++]
|
|
----------------------------------------
|
|
if (allInvocationsARB(condition)) {
|
|
result = do_fast_path();
|
|
} else {
|
|
result = do_general_path();
|
|
}
|
|
----------------------------------------
|
|
|
|
The built-in function code:allInvocationsARB() will return the same value
|
|
for all invocations in the group, so the group will either execute
|
|
code:do_fast_path() or code:do_general_path(), but never both.
|
|
For example, shader code might want to evaluate a complex function
|
|
iteratively by starting with an approximation of the result and then
|
|
refining the approximation.
|
|
Some input values may require a small number of iterations to generate an
|
|
accurate result (code:do_fast_path) while others require a larger number
|
|
(code:do_general_path).
|
|
In another example, shader code might want to evaluate a complex function
|
|
(code:do_general_path) that can be greatly simplified when assuming a
|
|
specific value for one of its inputs (code:do_fast_path).
|
|
|
|
=== New Object Types
|
|
|
|
None.
|
|
|
|
=== New Enum Constants
|
|
|
|
None.
|
|
|
|
=== New Enums
|
|
|
|
None.
|
|
|
|
=== New Structures
|
|
|
|
None.
|
|
|
|
=== New Functions
|
|
|
|
None.
|
|
|
|
=== New Built-In Variables
|
|
|
|
None.
|
|
|
|
=== New SPIR-V Capabilities
|
|
|
|
* <<spirvenv-capabilities-table-subgroupvote,SubgroupVoteKHR>>
|
|
|
|
=== Issues
|
|
|
|
None.
|
|
|
|
=== Version History
|
|
|
|
* Revision 1, 2016-11-28 (Daniel Koch)
|
|
- Initial draft
|