← All projects

Distributed Systems Engineer · University of Michigan Course Project

Distributed MapReduce

A fault-tolerant MapReduce framework that coordinates parallel map and reduce jobs across networked worker processes.

Overview

What I built

This Python framework implements the core mechanics behind distributed batch processing. A central manager schedules jobs and tracks worker state, while workers execute mapper and reducer programs, partition intermediate output, externally sort data, and merge partitions for reduction.