🔬This is a nightly-only experimental API. (
stdarch_nvptx
#111199)Available on
target_arch="nvptx64"
only.Expand description
Platform-specific intrinsics for the NVPTX
platform.
See the module documentation for more details.
Structs§
- f16x2
Experimental - PTX-specific 32-bit wide floating point (f16 x 2) vector type
Functions§
- __
assert_ âšfail Experimental - Syscall to be used whenever the assert expression produces a
false
value. - _block_
dim_ âšx Experimental - x-th thread-block dimension.
- _block_
dim_ âšy Experimental - y-th thread-block dimension.
- _block_
dim_ âšz Experimental - z-th thread-block dimension.
- _block_
idx_ âšx Experimental - x-th thread-block index.
- _block_
idx_ âšy Experimental - y-th thread-block index.
- _block_
idx_ âšz Experimental - z-th thread-block index.
- _grid_
dim_ âšx Experimental - x-th block-grid dimension.
- _grid_
dim_ âšy Experimental - y-th block-grid dimension.
- _grid_
dim_ âšz Experimental - z-th block-grid dimension.
- _syncthreadsâš
Experimental - Synchronizes all threads in the block.
- _thread_
idx_ âšx Experimental - x-th thread index.
- _thread_
idx_ âšy Experimental - y-th thread index.
- _thread_
idx_ âšz Experimental - z-th thread index.
- f16x2_
add âšExperimental - Add two values, round to nearest even
- f16x2_
fma âšExperimental - Fused multiply-add, round to nearest even
- f16x2_
max âšExperimental - Find the maximum of two values
- f16x2_
max_ âšnan Experimental - Find the maximum of two values, NaNs pass through.
- f16x2_
min âšExperimental - Find the minimum of two values
- f16x2_
min_ âšnan Experimental - Find the minimum of two values, NaNs pass through.
- f16x2_
mul âšExperimental - Multiply two values, round to nearest even
- f16x2_
neg âšExperimental - Arithmetic negate
- f16x2_
sub âšExperimental - Subtract two values, round to nearest even
- freeâš
Experimental - Free previously dynamically allocated memory.
- mallocâš
Experimental - Allocate memory dynamically from a fixed-size heap in global memory.
- trapâš
Experimental - Generates the trap instruction
TRAP
- vprintfâš
Experimental - Print formatted output from a kernel to a host-side output stream.