Skip to main content

Posts

Featured

Spark Transformations, Actions and Lazy Evaluation.

Apache Spark RDD supports two types of Operations: Transformations Actions A Transformation is a function that produces new RDD from the existing RDDs but when we want to work with the actual dataset, at that point Action is performed.

Latest posts

ETL vs ELT?

Pyspark functions - to_date() vs date_format()