mirror of
https://github.com/status-im/Vulkan-Docs.git
synced 2025-01-12 23:14:20 +00:00
82e0f83d43
* Bump API patch number and header version number to 40 for this update. * There is a major build change in this release. We are now using the Ruby-based ``asciidoctor'' implementation, rather than the Python-based ``asciidoc'' implementation, to process the specification. While the actual specification markup changes were minimal, this requires a new set of build tools and a very different installation process, especially because we now use an experimental direct-to-PDF backend for Asciidoctor instead of Docbook->dblatex->PDF. It is no longer possible to build the Specification using asciidoc. See doc/specs/vulkan/README.adoc for some guidance on installing the new toolchain components. * There are some minor rendering issues in the PDF output due to teething problems with the asciidoctor toolchain, especially with mathematical equations. We are aware of these and working on them. Github Issues: * Updated sample code for the <<sparsememory-examples-basic,sparse resource binding example>> (public issue 97). * Modify line and point clipping behavior in the <<vertexpostproc-clipping, Primitive Clipping>> section to allow for pop-free behavior. The ability to check for which behavior is implemented may be added a future feature or extension (public issue 113). * Unify the discussions of implicit ordering throughout the spec, in particular in the new sections <<drawing-primitive-order, Primitive Order>>, <<primrast-order, Rasterization Order>>, and <<synchronization-implicit, Implicit Synchronization Guarantees>>; the discussion of <<synchronization-submission-order, submission order>>; and references elsewhere to these sections (public issue 133). * Clarify \<\<descriptorsets-compatibility,Pipeline Layout Compatibility>> language and introduce the term ``identically defined'' (public issue 164). * Add a dependency to the +VK_EXT_debug_marker+ extension that's needed to reuse the object type enum from +VK_EXT_debug_report+, and moves the definition of that enum into +VK_EXT_debug_report+ where it should be (public issue 409). * Remove redundant valid usage statement from slink:VkImageBlit (public issue 421). * Update GL_KHR_vulkan_glsl to allow the ternary operator to result in a specialization constant (public issue 424). * Fix valid usage for flink:VkPipelineShaderStageCreateInfo (public issue 426). * Correct typo in New Objects list for <<VK_EXT_debug_report>> (public issue 447). Internal Issues: * Moved to asciidoctor for spec builds (internal issue 121). * Update style guide to describe where to put new extensions-specific asciidoc files, and what to name them (internal issue 626). * Add src/spec/indexExt.py to autogenerate registry index entries linking into the 1.0-extensions specification, instead of maintaining the index manually. (internal issue 642). * Autogenerate extension dependencies and lists of all extensions and all KHR extensions from the "supported" attributes in +vk.xml+. Execute +make config/extDependency.sh+ from +doc/specs/vulkan+ when a supported extension is added to vk.xml, to regenerate the dependency script. The consequence is that specifying a single extension to the +makeExt+ script will automatically enable all extensions it depends on as well, and that the +makeAllExts+ and +makeKHR+ scripts do not need to be updated when a new extension is supported (internal issue 648). * Put extension appendices all at the same asciidoc section level, so KHR WSI extensions show up in the HTML index (internal issue 648). Other Issues: * Imbed images in the generated HTML specs instead of loading them from the images/ directory. * Fix missing EXT in extension name (ename:VK_EXT_SWAPCHAIN_COLOR_SPACE_EXTENSION_NAME). * Add new +VK_EXT_SMPTE_2086_metadata+ extension. * In the <<platformCreateSurface_xlib,Xlib Surface>> section of the +VK_KHR_xlib_surface+ specification, add language warning users that they always need to call code:XinitThreads. * Use the term "presentable image" (rather than "swapchain image") consistently in +VK_KHR_swapchain+ and related extensions, and add a glossary term defining it. * Relocate the valid usage for samples of flink:vkGetPhysicalDeviceSparseImageFormatProperties2KHR::pname:pFormatInfo to be below the flink:VkPhysicalDeviceSparseImageFormatInfo2KHR structure.
152 lines
4.4 KiB
Plaintext
152 lines
4.4 KiB
Plaintext
[[VK_EXT_shader_subgroup_vote]]
|
|
== VK_EXT_shader_subgroup_vote
|
|
|
|
*Name String*::
|
|
VK_EXT_shader_subgroup_vote
|
|
*Extension Type*::
|
|
Device extension
|
|
*Registered Extension Number*::
|
|
66
|
|
*Status*::
|
|
Draft
|
|
*Last Modified Date*::
|
|
2016-11-28
|
|
*Revision*::
|
|
1
|
|
*IP Status*::
|
|
No known IP claims.
|
|
*Dependencies*::
|
|
- This extension is written against version 1.0 of the Vulkan API.
|
|
- This extension requires Vulkan 1.0.
|
|
- This extension requires the
|
|
https://www.khronos.org/registry/spir-v/extensions/KHR/SPV_KHR_subgroup_vote.html[+SPV_KHR_subgroup_vote+]
|
|
SPIR-V extension.
|
|
- This extension requires the
|
|
https://www.opengl.org/registry/specs/ARB/shader_group_vote.txt[+GL_ARB_shader_group_vote+]
|
|
extension for GLSL source languages.
|
|
*Contributors*::
|
|
- Neil Henning, Codeplay
|
|
- Daniel Koch, NVIDIA Corporation
|
|
*Contact*::
|
|
- Daniel Koch (dkoch 'at' nvidia.com)
|
|
*Overview*::
|
|
+
|
|
--
|
|
This extension adds support for the following SPIR-V extension in Vulkan:
|
|
|
|
* SPV_KHR_subgroup_vote
|
|
|
|
This extension provides new SPIR-V instructions:
|
|
|
|
* code:OpSubgroupAllKHR,
|
|
* code:OpSubgroupAnyKHR, and
|
|
* code:OpSubgroupAllEqualKHR.
|
|
|
|
to compute the composite of a set of boolean conditions across a group of
|
|
shader invocations that are running concurrently (a _subgroup_).
|
|
These composite results may be used to execute shaders more efficiently on a
|
|
slink:VkPhysicalDevice.
|
|
|
|
When using GLSL source-based shader languages, the following shader
|
|
functions from GL_ARB_shader_group_vote can map to these SPIR-V
|
|
instructions:
|
|
|
|
* code:anyInvocationARB() -> code:OpSubgroupAnyKHR,
|
|
* code:allInvocationsARB() -> code:OpSubgroupAllKHR, and
|
|
* code:allInvocationsEqualARB() -> code:OpSubgroupAllEqualKHR.
|
|
|
|
The subgroup across which the boolean conditions are evaluated is
|
|
implementation-dependent, and this extension provides no guarantee over how
|
|
individual shader invocations are assigned to subgroups.
|
|
In particular, a subgroup has no necessary relationship with the compute
|
|
shader _local workgroup_ -- any pair of shader invocations in a compute
|
|
local workgroup may execute in different subgroups as used by these
|
|
instructions.
|
|
|
|
Compute shaders operate on an explicitly specified group of threads (a local
|
|
workgroup), but many implementations will also group non-compute shader
|
|
invocations and execute them concurrently.
|
|
When executing code like
|
|
|
|
[source,c++]
|
|
----------------------------------------
|
|
if (condition) {
|
|
result = do_fast_path();
|
|
} else {
|
|
result = do_general_path();
|
|
}
|
|
----------------------------------------
|
|
|
|
where code:condition diverges between invocations, an implementation might
|
|
first execute code:do_fast_path() for the invocations where code:condition
|
|
is true and leave the other invocations dormant.
|
|
Once code:do_fast_path() returns, it might call code:do_general_path() for
|
|
invocations where code:condition is false and leave the other invocations
|
|
dormant.
|
|
In this case, the shader executes *both* the fast and the general path and
|
|
might be better off just using the general path for all invocations.
|
|
|
|
This extension provides the ability to avoid divergent execution by
|
|
evaluating a condition across an entire subgroup using code like:
|
|
|
|
[source,c++]
|
|
----------------------------------------
|
|
if (allInvocationsARB(condition)) {
|
|
result = do_fast_path();
|
|
} else {
|
|
result = do_general_path();
|
|
}
|
|
----------------------------------------
|
|
|
|
The built-in function code:allInvocationsARB() will return the same value
|
|
for all invocations in the group, so the group will either execute
|
|
code:do_fast_path() or code:do_general_path(), but never both.
|
|
For example, shader code might want to evaluate a complex function
|
|
iteratively by starting with an approximation of the result and then
|
|
refining the approximation.
|
|
Some input values may require a small number of iterations to generate an
|
|
accurate result (code:do_fast_path) while others require a larger number
|
|
(code:do_general_path).
|
|
In another example, shader code might want to evaluate a complex function
|
|
(code:do_general_path) that can be greatly simplified when assuming a
|
|
specific value for one of its inputs (code:do_fast_path).
|
|
|
|
--
|
|
|
|
=== New Object Types
|
|
|
|
None.
|
|
|
|
=== New Enum Constants
|
|
|
|
None.
|
|
|
|
=== New Enums
|
|
|
|
None.
|
|
|
|
=== New Structures
|
|
|
|
None.
|
|
|
|
=== New Functions
|
|
|
|
None.
|
|
|
|
=== New Built-In Variables
|
|
|
|
None.
|
|
|
|
=== New SPIR-V Capabilities
|
|
|
|
* <<spirvenv-capabilities-table-subgroupvote,SubgroupVoteKHR>>
|
|
|
|
=== Issues
|
|
|
|
None.
|
|
|
|
=== Version History
|
|
|
|
* Revision 1, 2016-11-28 (Daniel Koch)
|
|
- Initial draft
|