cancel
Showing results for 
Search instead for 
Did you mean: 
KosmaS
New Contributor III
since ‎07-16-2024
‎08-22-2024

User Stats

  • 5 Posts
  • 0 Solutions
  • 4 Kudos given
  • 1 Kudos received

User Activity

Hey Everyone,I experience data skewness for: df = (source_df .unionByName(source_df.withColumn("region", lit("Country"))) .groupBy("zip_code", "region", "device_type") .agg(countDistinct("device_id").alias("total_active_unique"), count("device_id").a...
To cache/persist an action needs to be triggered. I'm just wondering, will it make any difference if, after persisting some df, I use, for instance, take(5) instead of count()?Will it be a bit more effective, because of sending results from 5 partiti...
Hey,I had a stable notebook within the whole job. It contains one action defined as dumping data to s3. Currently, it started generating some issues. Maybe someone can suggest either how to investigate it further or what to try to do with such kinds ...
Kudos from