53. approx_count_distinct(), avg(), collect_list(), collect_set(), countDistinct(), count() #pyspark

Similar Tracks
54. row_number(), rank(), dense_rank() functions in PySpark | #pyspark #spark #azuresynapse #azure
WafaStudies
44. partitionBy function in PySpark | Azure Databricks #spark #pyspark #azuresynaspe #databricks
WafaStudies
10. withColumn() in PySpark | Add new column or Change existing column data or type in DataFrame
WafaStudies
22 Optimize Joins in Spark & Understand Bucketing for Faster joins |Sort Merge Join |Broad Cast Join
Ease With Data
50. Date functions in PySpark | current_date(), to_date(), date_format() functions #pspark #spark
WafaStudies
Trump wipes the floor with Ramaphosa - The biggest turning point in South African History
Willem Petzer
40. UDF(user defined function) in PySpark | Azure Databricks #spark #pyspark #azuresynapse #azure
WafaStudies
18. Column class in PySpark | pyspark.sql.Column | #PySpark #AzureDatabricks #spark #azuresynapse
WafaStudies
Data Caching in Apache Spark | Optimizing performance using Caching | When and when not to cache
Learning Journal