Data Processing Pipelines

This course is an introduction to software systems, for working with large datasets and managing data processing jobs. It provides hands-on experience with scripting, data sources, data parallelism, data streams, software development and deployment infrastructure, and distributed computing.

This section is WIP is slowly being updated. 🚧