FP16 optimization on CPU #18701

jeyblu · 2023-12-05T00:09:07Z

jeyblu
Dec 5, 2023

Are FP16 code optimized with AVX or other vector instructions for CPU? Thanks.

your understanding is correct. it is not optimized for avx. We have optimized kernel for ARM neon chips and in the process of refining that:
https://github.com/microsoft/onnxruntime/blob/main/onnxruntime/core/mlas/lib/halfgemm_kernel_neon.cpp

chenfucn · 2023-12-05T23:29:42Z

your understanding is correct. it is not optimized for avx. We have optimized kernel for ARM neon chips and in the process of refining that:
https://github.com/microsoft/onnxruntime/blob/main/onnxruntime/core/mlas/lib/halfgemm_kernel_neon.cpp

1 reply

Are customers asking for FP16 support on ARM neon chips and not on Intel/AMD chips? Thanks

deischi · 2024-01-31T12:50:57Z

But honestly I do not know details about that.

0 replies