Parallel project π operation

Recall: the uni-processor γ_L operation

One-pass algorithm:

initialize a search structure H on grouping attributes of γ; /* ========================================================= Process the statistics for each group ========================================================= */ while ( R has more data blocks ) { read 1 data block in buffer b; for ( each tuple t ∈ b ) { /* ===================================================== We need a search structure H to implement the test t ∈ H efficiently !!! We can use hash table or some bin. search tree ====================================================== */ if ( t ∈ H ) { Update the statistics for group(t); } else { insert t in H; Initialize the statistics for group(t); } } } /* ========================================================= Now we can output the aggregate function for each group ========================================================= */ for ( each group ∈ H ) { Output group search key + statistics; }

Buffer utilization when there are M buffers available:

Naive parallel execution of γ_L will fail
- Naive parallel grouping γ_L:
- Why the naive parallel algorithm does not work:
  Graphically:
- Cause of the problem:
  Consequently:

Fixed parallel γ_L algorithm

Fixing parallel projection algorithm:

We must first re-distribute (using hashing) the tuples according to the grouping attribute values:
Then: each processor executes the (naive) uni-processor π_attrs operation locally on its fragment of relation R:

Performance of the parallel γ_L operation

Performance of the parallel γ_L:

Graphically:

Re-distributing the tuples:

Read all disk blocks (to perform the hashing re-distribution):
Transfer
to other nodes (assuming uniform distribution of project attribute values)
Write the re-distribued tuples back to disk:
(We assume the relation fragments are very large -- cannot not stored in memory)

Performing the uni-processor grouping γ_L on the tuples:

Read the tuples from disk:
Write the (projection) tuples to disk:
(Because the output of the last operation is not counted)

Total amount of work:

Disk read/write: 3 B(R) blocks P-1 Transfered: ----- blocks P

Amount of work per processor:

3 Disk read/write: --- B(R) blocks P P-1 Transfered: ----- blocks P²

(Transfer cost can be ignored in shared nothing processors)