Hadoop® is used for distributed computing and can query large datasets based on its reliable and scalable architecture.

Two major components of Hadoop® are the Hadoop® Distributed File System (HFDS) and MapReduce.

Discuss atleast FOUR (4) overall roles of these two components, including their role during system failures. Also include in your discussion advantages of parallel processing.   Clearly provide sub-titles to each part of your discussion

DQ requirement: Note that the requirement is to post your initial response no later than Thursday and you must post two additional post during the week (Sunday). I recommend your initial posting to be between 200-to-300 words. The replies to fellow students and to the professor should range between 100-to-150 words. All initial posts must contain a properly formatted in-text citation and scholarly reference.

Label your discussion properly.