PMULLD - Packed MULtiply Low Dword
PMULLD xmm1, xmm2/m128 (S4.1
__m128i _mm_mullo_epi32(__m128i a, __m128i b)
For each DWORD, calculate (1) * (2) and set the lower DWORD of the result to (3).
VPMULLD xmm1, xmm2, xmm3/m128 (V1
__m128i _mm_mullo_epi32(__m128i a, __m128i b)
VPMULLD xmm1{k1}{z}, xmm2, xmm3/m128/m32bcst (V5+VL
__m128i _mm_mask_mullo_epi32(__m128i s, __mmask8 k, __m128i a, __m128i b)
__m128i _mm_maskz_mullo_epi32(__mmask8 k, __m128i a, __m128i b)
For each DWORD, calculate (1) * (2) and set the lower DWORD of the result to (3).
VPMULLD ymm1, ymm2, ymm3/m256 (V2
__m256i _mm256_mullo_epi32(__m256i a, __m256i b)
VPMULLD ymm1{k1}{z}, ymm2, ymm3/m256/m32bcst (V5+VL
__m256i _mm256_mask_mullo_epi32(__m256i s, __mmask8 k, __m256i a, __m256i b)
__m256i _mm256_maskz_mullo_epi32(__mmask8 k, __m256i a, __m256i b)
For each DWORD, calculate (1) * (2) and set the lower DWORD of the result to (3).
VPMULLD zmm1{k1}{z}, zmm2, zmm3/m512/m32bcst (V5
__m512i _mm512_mullo_epi32(__m512i a, __m512i b)
__m512i _mm512_mask_mullo_epi32(__m512i s, __mmask16 k, __m512i a, __m512i b)
__m512i _mm512_maskz_mullo_epi32(__mmask16 k, __m512i a, __m512i b)
For each DWORD, calculate (1) * (2) and set the lower DWORD of the result to (3).
x86/x64 SIMD Instruction List
Feedback