WHAT IS RDD ?
RDD is the spark's core abstraction which is resilient distributed dataset.
It is the immutable distributed collection of objects.

RDD Creation


RDD vs Dataframe vs Dataset













