subject

Assume a gpu architecture that contains 10 simd processors. each simd instruction is 32 bits, and each simd processor contains 8 lanes for single precision arithmetic, and load/store instructions, meaning that each non-diverged simd instruction can produce 32 results every 4 cycles. assume a kernel that has divergent branches that causes, on average 80% of threads to be active. also assume that 70% of all instructions are sp arithmetic and 20% load/store. because not all memory latencies are covered, assume an average simd instruction issue rate of 0.85. assume the gpu has a clock speed of 1.5 ghz. compute the throughput, in gflop/sec, for this code on this gpu.

ansver
Answers: 2

Another question on Computers and Technology

question
Computers and Technology, 23.06.2019 07:00
Why is investing in a mutual fund less risky than investing in a particular company's stock? a. mutual funds only invest in blue-chip stocks. b. investments in mutual funds are more liquid. c. mutual funds hold a diversified portfolio of stocks. d. investments in mutual funds offer a higher rate of return.
Answers: 2
question
Computers and Technology, 23.06.2019 17:00
What does the faves button do? a. users mark a web page as a favorite b. leads other readers to favor a specific page c. readers sort and align their favicons, or favorite icons d. leads users to a message board where they can post questions
Answers: 1
question
Computers and Technology, 24.06.2019 10:00
In which view can you see speaker notes?
Answers: 1
question
Computers and Technology, 24.06.2019 18:00
Hacer un algoritmo que me permita ingresar el nombre de una parcela de terreno y muestre junto al mensaje “tipo de suelos: suelos fumíferos, ¡excelente!
Answers: 1
You know the right answer?
Assume a gpu architecture that contains 10 simd processors. each simd instruction is 32 bits, and ea...
Questions
question
Mathematics, 14.04.2021 21:40
question
English, 14.04.2021 21:40
question
Mathematics, 14.04.2021 21:40