53. approx_count_distinct(), avg(), collect_list(), collect_set(), countDistinct(), count() #pyspark

Similar Tracks
54. row_number(), rank(), dense_rank() functions in PySpark | #pyspark #spark #azuresynapse #azure
WafaStudies
14. explode(), split(), array() & array_contains() functions in PySpark | #PySpark #azuredatabricks
WafaStudies
40. UDF(user defined function) in PySpark | Azure Databricks #spark #pyspark #azuresynapse #azure
WafaStudies
22 Optimize Joins in Spark & Understand Bucketing for Faster joins |Sort Merge Join |Broad Cast Join
Ease With Data
Data Caching in Apache Spark | Optimizing performance using Caching | When and when not to cache
Learning Journal
Python Pandas Tutorial (Part 2): DataFrame and Series Basics - Selecting Rows and Columns
Corey Schafer