🔬 This is a nightly-only experimental API. (
Available on x86 and target feature
For each packed 16-bit integer maps the value to the number of logical 1 bits.
Uses the writemask in k - elements are zeroed in the result if the corresponding mask bit is not set. Otherwise the computation result is written into the result.