pub fn _mm512_clmulepi64_epi128(
    a: __m512i,
    b: __m512i,
    const IMM8: i32,
) -> __m512iAvailable on x86 and target feature 
vpclmulqdq and target feature avx512f only.Expand description
Performs a carry-less multiplication of two 64-bit polynomials over the finite field GF(2) - in each of the 4 128-bit lanes.
The immediate byte is used for determining which halves of each lane a and b
should be used. Immediate bits other than 0 and 4 are ignored.
All lanes share immediate byte.