Execution Models in Databases Query Processing

Posted Apr 10, 2024 Updated Apr 14, 2024

In (CS) Learning Note

1 min read

A physical query plan is a tree of physical plan operators
The execution model defines:
- the interface that connects operators to each other
  - Relation(s) in, relation out
  - Producer-consumer relationship
- how data is propagated between operators
- how operators are scheduled

Contents

Synchronous Execution Models
- Pipelining
- Iterators
Asynchronous Execution Models

Synchronous Execution Models

The operator interface is synchronous:
- Operators don’t generate tuples until getNext() is called
- In reality, different operators will have different evaluation times
- Some operators may block – causing the whole plan to block

Pipelining

Pipelining: read input, process, propagate output to next operator
Benefits of pipelining:
- No buffering (because no materialisation)
- Faster execution (no materialisation, so no disk I/Os)
- More in-memory operations
Not all operators can be pipelined
- Some require intermediate relations to be materialised
- Some operators will always block

Iterators

Standard interface on each operator:
- open()
- getNext()
- close()
Query engine calls the interface on the root operator
Calls to interface are propagated down the tree

Asynchronous Execution Models

Asynchronous implementation by introducing buffering:
- Within the operator calling the interface (the push model)
- Within the operator being called (the pull model)
- In the connections between operators (the stream model)
Asynchronous implementations minimise time during which blocking occurs

The Push Model

Propagate from the leaves upwards
- Producer propagates tuples as soon as they’re available
- Producer propagates tuples regardless of whether consumer has yet called getNext()
- Consumer buffers incoming tuples until it calls getNext()
Minimises idle time, good for pipelining

The Pull Model

Propagation driven from the root
- Producer buffers tuples until getNext() is called
- On-demand, close to pure implementation

The Stream Model

Connections as first-class objects:
- FIFO queues of tuples
- Producer propagates tuples to the queue as soon as they’re available
- Consumer call to getNext() does not block if there’s something in the queue
Asynchronous operators (but synchronous streams), good for parallelisation

(CS) Learning Note, Database

computer science Database

This post is licensed under CC BY 4.0 by the author.

Recently Updated

Trending Tags

software engineering computer science OOP code Data Structure Java Clean Code Algorithm Database software verification

Contents

Trending Tags

software engineering computer science OOP code Data Structure Java Clean Code Algorithm Database software verification

A new version of content is available.