It turns out, that by using -march=znver1 (or -march=native ) gcc skips some loops even though they can be vectorized. Why does this happen?
確定! 回上一頁