Distributed Systems Engineer · University of Michigan Course Project
Distributed MapReduce
A fault-tolerant MapReduce framework that coordinates parallel map and reduce jobs across networked worker processes.
Overview
What I built
This Python framework implements the core mechanics behind distributed batch processing. A central manager schedules jobs and tracks worker state, while workers execute mapper and reducer programs, partition intermediate output, externally sort data, and merge partitions for reduction.