Vulkan-Docs/doc/specs/vulkan/appendices/VK_EXT_shader_subgroup_vote.txt

[[VK_EXT_shader_subgroup_vote]]
== VK_EXT_shader_subgroup_vote

*Name String*::
    VK_EXT_shader_subgroup_vote
*Extension Type*::
    Device extension
*Registered Extension Number*::
    66
*Status*::
    Draft
*Last Modified Date*::
    2016-11-28
*Revision*::
    1
*IP Status*::
    No known IP claims.
*Dependencies*::
  - This extension is written against version 1.0 of the Vulkan API.
  - This extension requires Vulkan 1.0.
  - This extension requires the
    https://www.khronos.org/registry/spir-v/extensions/KHR/SPV_KHR_subgroup_vote.html[+SPV_KHR_subgroup_vote+]
    SPIR-V extension.
  - This extension requires the
    https://www.opengl.org/registry/specs/ARB/shader_group_vote.txt[+GL_ARB_shader_group_vote+]
    extension for GLSL source languages.
*Contributors*::
  - Neil Henning, Codeplay
  - Daniel Koch, NVIDIA Corporation
*Contact*::
  - Daniel Koch (dkoch 'at' nvidia.com)
*Overview*::
+
--
This extension adds support for the following SPIR-V extension in Vulkan:

* SPV_KHR_subgroup_vote

This extension provides new SPIR-V instructions:

  * code:OpSubgroupAllKHR,
  * code:OpSubgroupAnyKHR, and
  * code:OpSubgroupAllEqualKHR.

to compute the composite of a set of boolean conditions across a group of
shader invocations that are running concurrently (a _subgroup_).
These composite results may be used to execute shaders more efficiently on a
slink:VkPhysicalDevice.

When using GLSL source-based shader languages, the following shader
functions from GL_ARB_shader_group_vote can map to these SPIR-V
instructions:

  * code:anyInvocationARB() -> code:OpSubgroupAnyKHR,
  * code:allInvocationsARB() -> code:OpSubgroupAllKHR, and
  * code:allInvocationsEqualARB() -> code:OpSubgroupAllEqualKHR.

The subgroup across which the boolean conditions are evaluated is
implementation-dependent, and this extension provides no guarantee over how
individual shader invocations are assigned to subgroups.
In particular, a subgroup has no necessary relationship with the compute
shader _local workgroup_ -- any pair of shader invocations in a compute
local workgroup may execute in different subgroups as used by these
instructions.

Compute shaders operate on an explicitly specified group of threads (a local
workgroup), but many implementations will also group non-compute shader
invocations and execute them concurrently.
When executing code like

[source,c++]
----------------------------------------
if (condition) {
  result = do_fast_path();
} else {
  result = do_general_path();
}
----------------------------------------

where code:condition diverges between invocations, an implementation might
first execute code:do_fast_path() for the invocations where code:condition
is true and leave the other invocations dormant.
Once code:do_fast_path() returns, it might call code:do_general_path() for
invocations where code:condition is false and leave the other invocations
dormant.
In this case, the shader executes *both* the fast and the general path and
might be better off just using the general path for all invocations.

This extension provides the ability to avoid divergent execution by
evaluating a condition across an entire subgroup using code like:

[source,c++]
----------------------------------------
if (allInvocationsARB(condition)) {
  result = do_fast_path();
} else {
  result = do_general_path();
}
----------------------------------------

The built-in function code:allInvocationsARB() will return the same value
for all invocations in the group, so the group will either execute
code:do_fast_path() or code:do_general_path(), but never both.
For example, shader code might want to evaluate a complex function
iteratively by starting with an approximation of the result and then
refining the approximation.
Some input values may require a small number of iterations to generate an
accurate result (code:do_fast_path) while others require a larger number
(code:do_general_path).
In another example, shader code might want to evaluate a complex function
(code:do_general_path) that can be greatly simplified when assuming a
specific value for one of its inputs (code:do_fast_path).

--

=== New Object Types

None.

=== New Enum Constants

None.

=== New Enums

None.

=== New Structures

None.

=== New Functions

None.

=== New Built-In Variables

None.

=== New SPIR-V Capabilities

  * <<spirvenv-capabilities-table-subgroupvote,SubgroupVoteKHR>>

=== Issues

None.

=== Version History

  * Revision 1, 2016-11-28 (Daniel Koch)
    - Initial draft