Hi, Intel technician,
when I build my application with the 2017 Fortran compiler in case of the use of AVX2 instruction and default SSE2 on a Broadwell computer, the application built with AVX2 is not as fast as that with SSE2 instructions on the Broadwell computer. My application is OpenMP-based parallel Fortran codes that are vectorized and threaded. Could you tell me when I can use AVX2 instead of SSE2 while building my application with Intel 2017 Fortran complier? My target is always to run my application as fast as possible. Which instruction is advised to use in order to achieve the best performance of my application on the Broadwell Computer?
I look forward to hearing from you. Thanks in advance,
Best regards,
Dingjun