Hadoop Interview questions


Total available count: 27
Subject - Apache
Subsubject - Hadoop

What is Narrow Transformations?

Narrow transformations are the result of map, filter, and such that is from the data from a single partition only, i.e. it is self-sustained. An output RDD(Resilient Distributed Datasets) has partitions with records that originate from a single partition in the parent RDD(Resilient Distributed Datasets). Only a limited subset of partitions used to calculate the result. Spark groups narrow transformations as a stage.




Next 5 interview question(s)

1
How many type of transformations exist?
2
What is Preferred Locations?
3
How do you define actions?
4
What is the transformation?
5
What are the possible operations on RDD?