Introduction to MapReduce

U.Vijayasaradhi, Directory of Engineering, MapR

DATE & TIME : 27 September 2013, 4PM VENUE: SEMINAR ROOM, SCIS


MapReduce is a programming model for processing large data sets with a parallel, distributed algorithm on a cluster. A MapReduce program comprises a Map() procedure that performs filtering and sorting (such as sorting students by first name into queues, one queue for each name) and a Reduce() procedure that performs a summary operation (such as counting the number of students in each queue, yielding name frequencies). The "MapReduce System" (also called "infrastructure", "framework") orchestrates by marshalling the distributed servers, running the various tasks in parallel, managing all communications and data transfers between the various parts of the system, providing for redundancy and fault tolerance, and overall management of the whole process. The model is inspired by the map and reduce functions commonly used in functional programming.


U. Vijaya Saradhi, M.Tech (I.I.Sc(http://I.I.Sc), Computer Science and Engineering, B.Tech: Mechanical Engineering). Worked for 4 years in Veritas on Volume manager, Worked on Agami filesystem for 4 years as Technical and Engineering Manager, Working in MapR technologies for past 4 and half years as Director of Engineering for India team.