Databricks Community

asurendran · 02-06-2025

While loading data from one layer to another layer using pyspark window function, I noticed that some data is missing. This is happening if the data is huge. It's not happening for small quantity. Does anyone come across this issue before?

asurendran · 02-07-2025

I tried repartitioning and renaming dataframe name for each transformation. Still it's showing missing records. Please let me know if you have any other suggestion.

asurendran · 02-06-2025

Thanks Madhu! Will try this.

asurendran · 02-06-2025

Is there a way caching the dataframe helps to fix this issue?

asurendran · 02-06-2025

I have a dataframe with key, eff date, end date... I want to use a window function with lag option to populate previous end date... I am partitioning by the key and order by the effective date. But I am seeing count diference.

Databricks Community

User Stats

User Activity

Some records are missing after window function

Re: Some records are missing after window function

Re: Some records are missing after window function

Re: Some records are missing after window function

Re: Some records are missing after window function