Data-Level Parallelism in Vector, SIMD, and GPU Architecture
$10-30 USD
Zaprt
Objavljeno pred več kot 4 leti
$10-30 USD
Plačilo ob dostavi
Consider the possibility of unrolling the loop and mapping multiple iterations to vector operations. Assume that you can use scatter-gather loads and stores (vldi and vsti). How does this affect the way you can write the RV64Vcode for this kernel?
I am looking for the correct answer for this question