Table of Contents

Data Processing - (Pipeline | Compose | Chain)

About

A pipeline is a finite or infinite automata known as a stream) where:

A pipeline creates a composition relationship.

A pipeline is also known as:

Pipeline follows a compositional structure known as cascade of operations.

Flow vs Pipeline

A dataflow (data workflow) is:

Model

A pipeline is a combination of:

Type

Imperative

The pipeline is executed step by step

Declarative

The pipeline is executed only when the terminal operation is called.

All steps are building a composite type known as algebraic data type.

Example

Shell

In an OS Shell (Dos, bash), a serie of command connected by the pipe operators forms a pipeline. See Shell Data Processing - Pipeline

Code

By returning the calling object from a function, you can compose (or chain) functions. See Design Pattern - (Object) Builder. When we compose (chain) an operation, the output of one operation becomes the input for the next operation, and operations are applied from left to right.

MapReduce

MapReduce - Pipeline

Library

Documentation / Reference