- Architecture
(= internals) of
a graphics card "computer":
Explanation:
- The GPU fetches
an instruction from
its "device" memory
(= memory in the
graphics card)
- GPU determines
(= decodes)
what instruction it has
fetched
and
passes this
(one single) instruction
to
all the ALUs
- There are 1024 ALU units
or more in
one graphics card !!!
|
- Each ALU performs the
(same) requested computation using
the values in
its own
registers/cache memory
- The ALU may also
update one of
its own register when
it executes the
instruction
|
-
Example execution: executing
the instruction
add R0, R1, R2
(R0 = R1 + R2)
- The GPU fetches the
instruction "add R0, R1, R2"
from the "device" memory:
- And sends the
instruction "add R0, R1, R2" to
all its ALU units for
execution:
Notice that:
- The GPU will
execute
the same instruction using
many different values
(because each ALU has its
own set of
registers !!!)
- So a GPU is a
Single Instruction Multiple Data
(SIMD) computer !!!
|
|
|