clang 20.0.0git
|
Go to the source code of this file.
Macros | |
#define | _mm_dpwsud_epi32(__W, __A, __B) |
Multiply groups of 2 adjacent pairs of signed 16-bit integers in __A with corresponding unsigned 16-bit integers in __B, producing 2 intermediate signed 16-bit results. | |
#define | _mm256_dpwsud_epi32(__W, __A, __B) |
Multiply groups of 2 adjacent pairs of signed 16-bit integers in __A with corresponding unsigned 16-bit integers in __B, producing 2 intermediate signed 16-bit results. | |
#define | _mm_dpwsuds_epi32(__W, __A, __B) |
Multiply groups of 2 adjacent pairs of signed 16-bit integers in __A with corresponding unsigned 16-bit integers in __B, producing 2 intermediate signed 16-bit results. | |
#define | _mm256_dpwsuds_epi32(__W, __A, __B) |
Multiply groups of 2 adjacent pairs of signed 16-bit integers in __A with corresponding unsigned 16-bit integers in __B, producing 2 intermediate signed 16-bit results. | |
#define | _mm_dpwusd_epi32(__W, __A, __B) |
Multiply groups of 2 adjacent pairs of unsigned 16-bit integers in __A with corresponding signed 16-bit integers in __B, producing 2 intermediate signed 16-bit results. | |
#define | _mm256_dpwusd_epi32(__W, __A, __B) |
Multiply groups of 2 adjacent pairs of unsigned 16-bit integers in __A with corresponding signed 16-bit integers in __B, producing 2 intermediate signed 16-bit results. | |
#define | _mm_dpwusds_epi32(__W, __A, __B) |
Multiply groups of 2 adjacent pairs of unsigned 16-bit integers in __A with corresponding signed 16-bit integers in __B, producing 2 intermediate signed 16-bit results. | |
#define | _mm256_dpwusds_epi32(__W, __A, __B) |
Multiply groups of 2 adjacent pairs of unsigned 16-bit integers in __A with corresponding signed 16-bit integers in __B, producing 2 intermediate signed 16-bit results. | |
#define | _mm_dpwuud_epi32(__W, __A, __B) |
Multiply groups of 2 adjacent pairs of unsigned 16-bit integers in __A with corresponding unsigned 16-bit integers in __B, producing 2 intermediate signed 16-bit results. | |
#define | _mm256_dpwuud_epi32(__W, __A, __B) |
Multiply groups of 2 adjacent pairs of unsigned 16-bit integers in __A with corresponding unsigned 16-bit integers in __B, producing 2 intermediate signed 16-bit results. | |
#define | _mm_dpwuuds_epi32(__W, __A, __B) |
Multiply groups of 2 adjacent pairs of unsigned 16-bit integers in __A with corresponding unsigned 16-bit integers in __B, producing 2 intermediate signed 16-bit results. | |
#define | _mm256_dpwuuds_epi32(__W, __A, __B) |
Multiply groups of 2 adjacent pairs of unsigned 16-bit integers in __A with corresponding unsigned 16-bit integers in __B, producing 2 intermediate signed 16-bit results. | |
#define _mm256_dpwsud_epi32 | ( | __W, | |
__A, | |||
__B | |||
) |
Multiply groups of 2 adjacent pairs of signed 16-bit integers in __A with corresponding unsigned 16-bit integers in __B, producing 2 intermediate signed 16-bit results.
Sum these 2 results with the corresponding 32-bit integer in __W, and store the packed 32-bit results in dst.
This intrinsic corresponds to the VPDPWSUD
instruction.
__W | A 256-bit vector of [8 x int]. |
__A | A 256-bit vector of [16 x short]. |
__B | A 256-bit vector of [16 x unsigned short]. |
Definition at line 82 of file avxvnniint16intrin.h.
#define _mm256_dpwsuds_epi32 | ( | __W, | |
__A, | |||
__B | |||
) |
Multiply groups of 2 adjacent pairs of signed 16-bit integers in __A with corresponding unsigned 16-bit integers in __B, producing 2 intermediate signed 16-bit results.
Sum these 2 results with the corresponding 32-bit integer in __W with signed saturation, and store the packed 32-bit results in dst.
This intrinsic corresponds to the VPDPWSUDS
instruction.
__W | A 256-bit vector of [8 x int]. |
__A | A 256-bit vector of [16 x short]. |
__B | A 256-bit vector of [16 x unsigned short]. |
Definition at line 152 of file avxvnniint16intrin.h.
#define _mm256_dpwusd_epi32 | ( | __W, | |
__A, | |||
__B | |||
) |
Multiply groups of 2 adjacent pairs of unsigned 16-bit integers in __A with corresponding signed 16-bit integers in __B, producing 2 intermediate signed 16-bit results.
Sum these 2 results with the corresponding 32-bit integer in __W, and store the packed 32-bit results in dst.
This intrinsic corresponds to the VPDPWUSD
instruction.
__W | A 256-bit vector of [8 x int]. |
__A | A 256-bit vector of [16 x unsigned short]. |
__B | A 256-bit vector of [16 x short]. |
Definition at line 220 of file avxvnniint16intrin.h.
#define _mm256_dpwusds_epi32 | ( | __W, | |
__A, | |||
__B | |||
) |
Multiply groups of 2 adjacent pairs of unsigned 16-bit integers in __A with corresponding signed 16-bit integers in __B, producing 2 intermediate signed 16-bit results.
Sum these 2 results with the corresponding 32-bit integer in __W with signed saturation, and store the packed 32-bit results in dst.
This intrinsic corresponds to the VPDPWSUDS
instruction.
__W | A 256-bit vector of [8 x int]. |
__A | A 256-bit vector of [16 x unsigned short]. |
__B | A 256-bit vector of [16 x short]. |
Definition at line 290 of file avxvnniint16intrin.h.
#define _mm256_dpwuud_epi32 | ( | __W, | |
__A, | |||
__B | |||
) |
Multiply groups of 2 adjacent pairs of unsigned 16-bit integers in __A with corresponding unsigned 16-bit integers in __B, producing 2 intermediate signed 16-bit results.
Sum these 2 results with the corresponding 32-bit integer in __W, and store the packed 32-bit results in dst.
This intrinsic corresponds to the VPDPWUUD
instruction.
__W | A 256-bit vector of [8 x unsigned int]. |
__A | A 256-bit vector of [16 x unsigned short]. |
__B | A 256-bit vector of [16 x unsigned short]. |
Definition at line 358 of file avxvnniint16intrin.h.
#define _mm256_dpwuuds_epi32 | ( | __W, | |
__A, | |||
__B | |||
) |
Multiply groups of 2 adjacent pairs of unsigned 16-bit integers in __A with corresponding unsigned 16-bit integers in __B, producing 2 intermediate signed 16-bit results.
Sum these 2 results with the corresponding 32-bit integer in __W with signed saturation, and store the packed 32-bit results in dst.
This intrinsic corresponds to the VPDPWSUDS
instruction.
__W | A 256-bit vector of [8 x unsigned int]. |
__A | A 256-bit vector of [16 x unsigned short]. |
__B | A 256-bit vector of [16 x unsigned short]. |
Definition at line 428 of file avxvnniint16intrin.h.
#define _mm_dpwsud_epi32 | ( | __W, | |
__A, | |||
__B | |||
) |
Multiply groups of 2 adjacent pairs of signed 16-bit integers in __A with corresponding unsigned 16-bit integers in __B, producing 2 intermediate signed 16-bit results.
Sum these 2 results with the corresponding 32-bit integer in __W, and store the packed 32-bit results in dst.
This intrinsic corresponds to the VPDPWSUD
instruction.
__W | A 128-bit vector of [4 x int]. |
__A | A 128-bit vector of [8 x short]. |
__B | A 128-bit vector of [8 x unsigned short]. |
Definition at line 48 of file avxvnniint16intrin.h.
#define _mm_dpwsuds_epi32 | ( | __W, | |
__A, | |||
__B | |||
) |
Multiply groups of 2 adjacent pairs of signed 16-bit integers in __A with corresponding unsigned 16-bit integers in __B, producing 2 intermediate signed 16-bit results.
Sum these 2 results with the corresponding 32-bit integer in __W with signed saturation, and store the packed 32-bit results in dst.
This intrinsic corresponds to the VPDPWSUDS
instruction.
__W | A 128-bit vector of [4 x int]. |
__A | A 128-bit vector of [8 x short]. |
__B | A 128-bit vector of [8 x unsigned short]. |
Definition at line 117 of file avxvnniint16intrin.h.
#define _mm_dpwusd_epi32 | ( | __W, | |
__A, | |||
__B | |||
) |
Multiply groups of 2 adjacent pairs of unsigned 16-bit integers in __A with corresponding signed 16-bit integers in __B, producing 2 intermediate signed 16-bit results.
Sum these 2 results with the corresponding 32-bit integer in __W, and store the packed 32-bit results in dst.
This intrinsic corresponds to the VPDPWUSD
instruction.
__W | A 128-bit vector of [4 x int]. |
__A | A 128-bit vector of [8 x unsigned short]. |
__B | A 128-bit vector of [8 x short]. |
Definition at line 186 of file avxvnniint16intrin.h.
#define _mm_dpwusds_epi32 | ( | __W, | |
__A, | |||
__B | |||
) |
Multiply groups of 2 adjacent pairs of unsigned 16-bit integers in __A with corresponding signed 16-bit integers in __B, producing 2 intermediate signed 16-bit results.
Sum these 2 results with the corresponding 32-bit integer in __W with signed saturation, and store the packed 32-bit results in dst.
This intrinsic corresponds to the VPDPWSUDS
instruction.
__W | A 128-bit vector of [4 x int]. |
__A | A 128-bit vector of [8 x unsigned short]. |
__B | A 128-bit vector of [8 x short]. |
Definition at line 255 of file avxvnniint16intrin.h.
#define _mm_dpwuud_epi32 | ( | __W, | |
__A, | |||
__B | |||
) |
Multiply groups of 2 adjacent pairs of unsigned 16-bit integers in __A with corresponding unsigned 16-bit integers in __B, producing 2 intermediate signed 16-bit results.
Sum these 2 results with the corresponding 32-bit integer in __W, and store the packed 32-bit results in dst.
This intrinsic corresponds to the VPDPWUUD
instruction.
__W | A 128-bit vector of [4 x unsigned int]. |
__A | A 128-bit vector of [8 x unsigned short]. |
__B | A 128-bit vector of [8 x unsigned short]. |
Definition at line 324 of file avxvnniint16intrin.h.
#define _mm_dpwuuds_epi32 | ( | __W, | |
__A, | |||
__B | |||
) |
Multiply groups of 2 adjacent pairs of unsigned 16-bit integers in __A with corresponding unsigned 16-bit integers in __B, producing 2 intermediate signed 16-bit results.
Sum these 2 results with the corresponding 32-bit integer in __W with signed saturation, and store the packed 32-bit results in dst.
This intrinsic corresponds to the VPDPWSUDS
instruction.
__W | A 128-bit vector of [4 x unsigned int]. |
__A | A 128-bit vector of [8 x unsigned short]. |
__B | A 128-bit vector of [8 x unsigned short]. |
Definition at line 393 of file avxvnniint16intrin.h.