mirror of
https://github.com/status-im/Vulkan-Docs.git
synced 2025-01-27 14:45:55 +00:00
0cc6bba634
* Bump API patch number and header version number to 61 for this update. Github Issues: * Provide alternate length attributes (altlen=) in the XML schema, for those using length attributes to generate code instead of documentation (public issue 555). * Fix erroneous references to `latex:` being used for asciidoc math markup, rather than `latexmath:` (public pull request 556). * Add author ID to XML for Kazan software renderer (public pull request 557). Internal Issues: * Add the <<fundamentals-abi,Application Binary Interface>> section describing platform ABI requirements and recommendations, add examples of function and function pointer declarations to the <<boilerplate-platform-specific-calling-conventions, Platform-Specific Calling Conventions>> section, and remove related language that existed elsewhere in the specification (internal issue 64). * Describe where to document valid usage interactions of chained structures in the style guide, and fix one case now appearing in slink:VkBufferCreateInfo instead of the child slink:VkDedicatedAllocationBufferCreateInfoNV structure (internal issue 715). * Add example to the style guide of describing enumerated types which are empty when the spec is built without relevant extensions enabled, and apply it to existing examples for elink:VkDescriptorSetLayoutCreateFlagBits and elink:VkSubpassDescriptionFlagBits (internal issue 864). * Add a note to the <<fundamentals-validusage-enums, Valid Usage for Enumerated Types>> section that the special values suffixed with etext:_BEGIN_RANGE, etext:_END_RANGE, etext:_RANGE_SIZE and etext:_MAX_ENUM are not part of the API and should: not be used by applications (internal issue 872). * Added note to flink:vkCmdUpdateBuffers explaining the performance penalty for copies done in this way, and why the upper copy limit is what it is (internal issue 952). * Update `VK_KHX_device_group` to split some functionality into the new `VK_KHR_bind_memory2` extension, and rename that functionality (internal issue 969). * Remove *Status* fields from extension appendices, since they are by definition published and complete by the time they reach the public github repository (internal issue 973). Other Issues: * Update Data Format specification dependency to version 1.2 and change references to DF sections accordingly. * Update XML to make the pname:pAllocator parameter of flink:vkRegisterDeviceEventEXT and flink:vkRegisterDisplayEventEXT in the `VK_EXT_display_control` extension as optional. New Extensions: * `VK_KHR_bind_memory2` * `VK_KHR_image_format_list` * `VK_KHR_maintenance2` * `VK_KHR_sampler_ycbcr_conversion`
228 lines
7.8 KiB
Plaintext
228 lines
7.8 KiB
Plaintext
include::meta/VK_NV_viewport_swizzle.txt[]
|
|
|
|
*Last Modified Date*::
|
|
2016-12-22
|
|
*Interactions and External Dependencies*::
|
|
- This extension requires pname:multiViewport and pname:geometryShader
|
|
features to be useful.
|
|
*Contributors*::
|
|
- Daniel Koch, NVIDIA
|
|
- Jeff Bolz, NVIDIA
|
|
|
|
This extension provides a new per-viewport swizzle that can modify the
|
|
position of primitives sent to each viewport.
|
|
New viewport swizzle state is added for each viewport, and a new position
|
|
vector is computed for each vertex by selecting from and optionally negating
|
|
any of the four components of the original position vector.
|
|
|
|
This new viewport swizzle is useful for a number of algorithms, including
|
|
single-pass cubemap rendering (broadcasting a primitive to multiple faces
|
|
and reorienting the vertex position for each face) and voxel rasterization.
|
|
The per-viewport component remapping and negation provided by the swizzle
|
|
allows application code to re-orient three-dimensional geometry with a view
|
|
along any of the X, Y, or Z axes.
|
|
If a perspective projection and depth buffering is required, 1/W buffering
|
|
should be used, as described in the single-pass cubemap rendering example in
|
|
the "Issues" section below.
|
|
|
|
|
|
=== New Object Types
|
|
|
|
None.
|
|
|
|
=== New Enum Constants
|
|
|
|
* Extending elink:VkStructureType:
|
|
|
|
** ename:VK_STRUCTURE_TYPE_PIPELINE_VIEWPORT_SWIZZLE_STATE_CREATE_INFO_NV
|
|
|
|
=== New Enums
|
|
|
|
* elink:VkViewportCoordinateSwizzleNV
|
|
* elink:VkPipelineViewportSwizzleStateCreateFlagsNV
|
|
|
|
=== New Structures
|
|
|
|
* slink:VkViewportSwizzleNV
|
|
* slink:VkPipelineViewportSwizzleStateCreateInfoNV
|
|
|
|
=== New Functions
|
|
|
|
None.
|
|
|
|
=== Issues
|
|
|
|
1) Where does viewport swizzling occur in the pipeline?
|
|
|
|
**RESOLVED**: Despite being associated with the viewport, viewport swizzling
|
|
must happen prior to the viewport transform.
|
|
In particular, it needs to be performed before clipping and perspective
|
|
division.
|
|
|
|
The viewport mask expansion (NV_viewport_array2) and the viewport swizzle
|
|
could potentially be performed before or after transform feedback, but
|
|
feeding back several viewports worth of primitives with different swizzles
|
|
doesn't seem particularly useful.
|
|
This specification applies the viewport mask and swizzle after transform
|
|
feedback, and makes primitive queries only count each primitive once.
|
|
|
|
2) Any interesting examples of how this extension, NV_viewport_array2, and
|
|
NV_geometry_shader_passthrough can be used together in practice?
|
|
|
|
**RESOLVED**: One interesting use case for this extension is for single-pass
|
|
rendering to a cubemap.
|
|
In this example, the application would attach a cubemap texture to a layered
|
|
FBO where the six cube faces are treated as layers.
|
|
Vertices are sent through the vertex shader without applying a projection
|
|
matrix, where the gl_Position output is (x,y,z,1) and the center of the
|
|
cubemap is at (0,0,0).
|
|
With unextended Vulkan, one could have a conventional instanced geometry
|
|
shader that looks something like the following:
|
|
|
|
[source,c]
|
|
---------------------------------------------------
|
|
layout(invocations = 6) in; // separate invocation per face
|
|
layout(triangles) in;
|
|
layout(triangle_strip) out;
|
|
layout(max_vertices = 3) out;
|
|
|
|
in Inputs {
|
|
vec2 texcoord;
|
|
vec3 normal;
|
|
vec4 baseColor;
|
|
} v[];
|
|
|
|
out Outputs {
|
|
vec2 texcoord;
|
|
vec3 normal;
|
|
vec4 baseColor;
|
|
};
|
|
|
|
void main()
|
|
{
|
|
int face = gl_InvocationID; // which face am I?
|
|
|
|
// Project gl_Position for each vertex onto the cube map face.
|
|
vec4 positions[3];
|
|
for (int i = 0; i < 3; i++) {
|
|
positions[i] = rotate(gl_in[i].gl_Position, face);
|
|
}
|
|
|
|
// If the primitive doesn't project onto this face, we're done.
|
|
if (shouldCull(positions)) {
|
|
return;
|
|
}
|
|
|
|
// Otherwise, emit a copy of the input primitive to the
|
|
// appropriate face (using gl_Layer).
|
|
for (int i = 0; i < 3; i++) {
|
|
gl_Layer = face;
|
|
gl_Position = positions[i];
|
|
texcoord = v[i].texcoord;
|
|
normal = v[i].normal;
|
|
baseColor = v[i].baseColor;
|
|
EmitVertex();
|
|
}
|
|
}
|
|
---------------------------------------------------
|
|
|
|
With passthrough geometry shaders, this can be done using a much simpler
|
|
shader:
|
|
|
|
[source,c]
|
|
---------------------------------------------------
|
|
layout(triangles) in;
|
|
layout(passthrough) in Inputs {
|
|
vec2 texcoord;
|
|
vec3 normal;
|
|
vec4 baseColor;
|
|
}
|
|
layout(passthrough) in gl_PerVertex {
|
|
vec4 gl_Position;
|
|
} gl_in[];
|
|
layout(viewport_relative) out int gl_Layer;
|
|
|
|
void main()
|
|
{
|
|
// Figure out which faces the primitive projects onto and
|
|
// generate a corresponding viewport mask.
|
|
uint mask = 0;
|
|
for (int i = 0; i < 6; i++) {
|
|
if (!shouldCull(face)) {
|
|
mask |= 1U << i;
|
|
}
|
|
}
|
|
gl_ViewportMask = mask;
|
|
gl_Layer = 0;
|
|
}
|
|
---------------------------------------------------
|
|
|
|
The application code is set up so that each of the six cube faces has a
|
|
separate viewport (numbered 0..5).
|
|
Each face also has a separate swizzle, programmed via the
|
|
slink:VkPipelineViewportSwizzleStateCreateInfoNV pipeline state.
|
|
The viewport swizzle feature performs the coordinate transformation handled
|
|
by the rotate() function in the original shader.
|
|
The "viewport_relative" layout qualifier says that the viewport number
|
|
(0..5) is added to the base gl_Layer value of zero to determine which layer
|
|
(cube face) the primitive should be sent to.
|
|
|
|
Note that the use of the passed through input <normal> in this example
|
|
suggests that the fragment shader in this example would perform an operation
|
|
like per-fragment lighting.
|
|
The viewport swizzle would transform the position to be face-relative, but
|
|
<normal> would remain in the original coordinate system.
|
|
It seems likely that the fragment shader in either version of the example
|
|
would want to perform lighting in the original coordinate system.
|
|
It would likely do this by reconstructing the position of the fragment in
|
|
the original coordinate system using gl_FragCoord, a constant or uniform
|
|
holding the size of the cube face, and the input gl_ViewportIndex (or
|
|
gl_Layer), which identifies the cube face.
|
|
Since the value of <normal> is in the original coordinate system, it would
|
|
not need to be modified as part of this coordinate transformation.
|
|
|
|
Note that while the rotate() operation in the regular geometry shader above
|
|
could include an arbitrary post-rotation projection matrix, the viewport
|
|
swizzle does not support arbitrary math.
|
|
To get proper projection, 1/W buffering should be used.
|
|
To do this:
|
|
|
|
1.
|
|
Program the viewport swizzles to move the pre-projection W eye coordinate
|
|
(typically 1.0) into the Z coordinate of the swizzle output and the eye
|
|
coordinate component used for depth into the W coordinate.
|
|
For example, the viewport corresponding to the +Z face might use a swizzle
|
|
of (+X, -Y, +W, +Z).
|
|
The Z normalized device coordinate computed after swizzling would then be
|
|
z'/w' = 1/Z_eye.
|
|
|
|
2.
|
|
On NVIDIA implementations supporting floating-point depth buffers with
|
|
values outside [0,1], prevent unwanted near plane clipping by enabling
|
|
DEPTH_CLAMP.
|
|
Ensure that the depth clamp doesn't mess up depth testing by programming the
|
|
depth range to very large values, such as minDepthBounds=-z,
|
|
maxDepthBounds=+z), where z == 2^127.
|
|
It should be possible to use IEEE infinity encodings also (0xFF800000 for
|
|
-INF, 0x7F800000 for +INF).
|
|
Even when near/far clipping is disabled, primitives extending behind the eye
|
|
will still be clipped because one or more vertices will have a negative W
|
|
coordinate and fail X/Y clipping tests.
|
|
|
|
On other implementations, scale X, Y, and Z eye coordinates so that vertices
|
|
on the near plane have a post-swizzle W coordinate of 1.0.
|
|
For example, if the near plane is at Z_eye = 1/256, scale X, Y, and Z by
|
|
256.
|
|
|
|
3.
|
|
Adjust depth testing to reflect the fact that 1/W values are large near the
|
|
eye and small away from the eye.
|
|
Clear the depth buffer to zero (infinitely far away) and use a depth test of
|
|
GREATER instead of LESS.
|
|
|
|
|
|
=== Version History
|
|
|
|
* Revision 1, 2016-12-22 (Piers Daniell)
|
|
- Internal revisions
|