Data Processing Pipelines
This course is an introduction to software systems, for working with large datasets and managing data processing jobs. It provides hands-on experience with scripting, data sources, data parallelism, data streams, software development and deployment infrastructure, and distributed computing.
This section is WIP is slowly being updated. 🚧