clang 20.0.0git
|
Go to the source code of this file.
Macros | |
#define | _mm_dpbssd_epi32(__W, __A, __B) |
Multiply groups of 4 adjacent pairs of signed 8-bit integers in __A with corresponding signed 8-bit integers in __B, producing 4 intermediate signed 16-bit results. | |
#define | _mm256_dpbssd_epi32(__W, __A, __B) |
Multiply groups of 4 adjacent pairs of signed 8-bit integers in __A with corresponding signed 8-bit integers in __B, producing 4 intermediate signed 16-bit results. | |
#define | _mm_dpbssds_epi32(__W, __A, __B) |
Multiply groups of 4 adjacent pairs of signed 8-bit integers in __A with corresponding signed 8-bit integers in __B, producing 4 intermediate signed 16-bit results. | |
#define | _mm256_dpbssds_epi32(__W, __A, __B) |
Multiply groups of 4 adjacent pairs of signed 8-bit integers in __A with corresponding signed 8-bit integers in __B, producing 4 intermediate signed 16-bit results. | |
#define | _mm_dpbsud_epi32(__W, __A, __B) |
Multiply groups of 4 adjacent pairs of signed 8-bit integers in __A with corresponding unsigned 8-bit integers in __B, producing 4 intermediate signed 16-bit results. | |
#define | _mm256_dpbsud_epi32(__W, __A, __B) |
Multiply groups of 4 adjacent pairs of signed 8-bit integers in __A with corresponding unsigned 8-bit integers in __B, producing 4 intermediate signed 16-bit results. | |
#define | _mm_dpbsuds_epi32(__W, __A, __B) |
Multiply groups of 4 adjacent pairs of signed 8-bit integers in __A with corresponding unsigned 8-bit integers in __B, producing 4 intermediate signed 16-bit results. | |
#define | _mm256_dpbsuds_epi32(__W, __A, __B) |
Multiply groups of 4 adjacent pairs of signed 8-bit integers in __A with corresponding unsigned 8-bit integers in __B, producing 4 intermediate signed 16-bit results. | |
#define | _mm_dpbuud_epi32(__W, __A, __B) |
Multiply groups of 4 adjacent pairs of unsigned 8-bit integers in __A with corresponding unsigned 8-bit integers in __B, producing 4 intermediate signed 16-bit results. | |
#define | _mm256_dpbuud_epi32(__W, __A, __B) |
Multiply groups of 4 adjacent pairs of unsigned 8-bit integers in __A with corresponding unsigned 8-bit integers in __B, producing 4 intermediate signed 16-bit results. | |
#define | _mm_dpbuuds_epi32(__W, __A, __B) |
Multiply groups of 4 adjacent pairs of unsigned 8-bit integers in __A with corresponding unsigned 8-bit integers in __B, producing 4 intermediate signed 16-bit results. | |
#define | _mm256_dpbuuds_epi32(__W, __A, __B) |
corresponding unsigned 8-bit integers in __B, producing 4 intermediate signed 16-bit results. | |
#define _mm256_dpbssd_epi32 | ( | __W, | |
__A, | |||
__B | |||
) |
Multiply groups of 4 adjacent pairs of signed 8-bit integers in __A with corresponding signed 8-bit integers in __B, producing 4 intermediate signed 16-bit results.
Sum these 4 results with the corresponding 32-bit integer in __W, and store the packed 32-bit results in dst.
This intrinsic corresponds to the VPDPBSSD
instruction.
__A | A 256-bit vector of [32 x char]. |
__B | A 256-bit vector of [32 x char]. |
Definition at line 81 of file avxvnniint8intrin.h.
#define _mm256_dpbssds_epi32 | ( | __W, | |
__A, | |||
__B | |||
) |
Multiply groups of 4 adjacent pairs of signed 8-bit integers in __A with corresponding signed 8-bit integers in __B, producing 4 intermediate signed 16-bit results.
Sum these 4 results with the corresponding 32-bit integer in __W with signed saturation, and store the packed 32-bit results in dst.
This intrinsic corresponds to the VPDPBSSD
instruction.
__A | A 256-bit vector of [32 x char]. |
__B | A 256-bit vector of [32 x char]. |
Definition at line 151 of file avxvnniint8intrin.h.
#define _mm256_dpbsud_epi32 | ( | __W, | |
__A, | |||
__B | |||
) |
Multiply groups of 4 adjacent pairs of signed 8-bit integers in __A with corresponding unsigned 8-bit integers in __B, producing 4 intermediate signed 16-bit results.
Sum these 4 results with the corresponding 32-bit integer in __W, and store the packed 32-bit results in dst.
This intrinsic corresponds to the VPDPBSSD
instruction.
__A | A 256-bit vector of [32 x char]. |
__B | A 256-bit vector of [32 x unsigned char]. |
Definition at line 219 of file avxvnniint8intrin.h.
#define _mm256_dpbsuds_epi32 | ( | __W, | |
__A, | |||
__B | |||
) |
Multiply groups of 4 adjacent pairs of signed 8-bit integers in __A with corresponding unsigned 8-bit integers in __B, producing 4 intermediate signed 16-bit results.
Sum these 4 results with the corresponding 32-bit integer in __W with signed saturation, and store the packed 32-bit results in dst.
This intrinsic corresponds to the VPDPBSSD
instruction.
__A | A 256-bit vector of [32 x char]. |
__B | A 256-bit vector of [32 x unsigned char]. |
Definition at line 289 of file avxvnniint8intrin.h.
#define _mm256_dpbuud_epi32 | ( | __W, | |
__A, | |||
__B | |||
) |
Multiply groups of 4 adjacent pairs of unsigned 8-bit integers in __A with corresponding unsigned 8-bit integers in __B, producing 4 intermediate signed 16-bit results.
Sum these 4 results with the corresponding 32-bit integer in __W, and store the packed 32-bit results in dst.
This intrinsic corresponds to the VPDPBSSD
instruction.
__A | A 256-bit vector of [32 x unsigned char]. |
__B | A 256-bit vector of [32 x unsigned char]. |
Definition at line 357 of file avxvnniint8intrin.h.
#define _mm256_dpbuuds_epi32 | ( | __W, | |
__A, | |||
__B | |||
) |
corresponding unsigned 8-bit integers in __B, producing 4 intermediate signed 16-bit results.
Sum these 4 results with the corresponding 32-bit integer in __W with signed saturation, and store the packed 32-bit results in dst.
This intrinsic corresponds to the VPDPBUUDS
instruction.
__A | A 256-bit vector of [32 x unsigned char]. |
__B | A 256-bit vector of [32 x unsigned char]. |
Definition at line 426 of file avxvnniint8intrin.h.
#define _mm_dpbssd_epi32 | ( | __W, | |
__A, | |||
__B | |||
) |
Multiply groups of 4 adjacent pairs of signed 8-bit integers in __A with corresponding signed 8-bit integers in __B, producing 4 intermediate signed 16-bit results.
Sum these 4 results with the corresponding 32-bit integer in __W, and store the packed 32-bit results in dst.
This intrinsic corresponds to the VPDPBSSD
instruction.
__A | A 128-bit vector of [16 x char]. |
__B | A 128-bit vector of [16 x char]. |
Definition at line 47 of file avxvnniint8intrin.h.
#define _mm_dpbssds_epi32 | ( | __W, | |
__A, | |||
__B | |||
) |
Multiply groups of 4 adjacent pairs of signed 8-bit integers in __A with corresponding signed 8-bit integers in __B, producing 4 intermediate signed 16-bit results.
Sum these 4 results with the corresponding 32-bit integer in __W with signed saturation, and store the packed 32-bit results in dst.
This intrinsic corresponds to the VPDPBSSD
instruction.
__A | A 128-bit vector of [16 x char]. |
__B | A 128-bit vector of [16 x char]. |
Definition at line 116 of file avxvnniint8intrin.h.
#define _mm_dpbsud_epi32 | ( | __W, | |
__A, | |||
__B | |||
) |
Multiply groups of 4 adjacent pairs of signed 8-bit integers in __A with corresponding unsigned 8-bit integers in __B, producing 4 intermediate signed 16-bit results.
Sum these 4 results with the corresponding 32-bit integer in __W, and store the packed 32-bit results in dst.
This intrinsic corresponds to the VPDPBSSD
instruction.
__A | A 128-bit vector of [16 x char]. |
__B | A 128-bit vector of [16 x unsigned char]. |
Definition at line 185 of file avxvnniint8intrin.h.
#define _mm_dpbsuds_epi32 | ( | __W, | |
__A, | |||
__B | |||
) |
Multiply groups of 4 adjacent pairs of signed 8-bit integers in __A with corresponding unsigned 8-bit integers in __B, producing 4 intermediate signed 16-bit results.
Sum these 4 results with the corresponding 32-bit integer in __W with signed saturation, and store the packed 32-bit results in dst.
This intrinsic corresponds to the VPDPBSSD
instruction.
__A | A 128-bit vector of [16 x char]. |
__B | A 128-bit vector of [16 x unsigned char]. |
Definition at line 254 of file avxvnniint8intrin.h.
#define _mm_dpbuud_epi32 | ( | __W, | |
__A, | |||
__B | |||
) |
Multiply groups of 4 adjacent pairs of unsigned 8-bit integers in __A with corresponding unsigned 8-bit integers in __B, producing 4 intermediate signed 16-bit results.
Sum these 4 results with the corresponding 32-bit integer in __W, and store the packed 32-bit results in dst.
This intrinsic corresponds to the VPDPBSSD
instruction.
__A | A 128-bit vector of [16 x unsigned char]. |
__B | A 128-bit vector of [16 x unsigned char]. |
Definition at line 323 of file avxvnniint8intrin.h.
#define _mm_dpbuuds_epi32 | ( | __W, | |
__A, | |||
__B | |||
) |
Multiply groups of 4 adjacent pairs of unsigned 8-bit integers in __A with corresponding unsigned 8-bit integers in __B, producing 4 intermediate signed 16-bit results.
Sum these 4 results with the corresponding 32-bit integer in __W with signed saturation, and store the packed 32-bit results in dst.
This intrinsic corresponds to the VPDPBUUDS
instruction.
__A | A 128-bit vector of [16 x unsigned char]. |
__B | A 128-bit vector of [16 x unsigned char]. |
Definition at line 392 of file avxvnniint8intrin.h.