Databricks Community

dc-rnc · ‎03-26-2025

Hi.

I have a workflow in which I write few rows into a Delta Table with auto-generated IDs. Then, I need to retrieve them back just after they're written into the table to collect those generated IDs, so I read the table and I use two columns (one is a client_id and the other is a timestamp) of the "input" DataFrame we inserted as join key.

The problem is that sometimes it's retrieving them, but not always. I did some smoke tests in an interactive notebook and I was never able to recreate what's happening in the workflow.

Here there is a screenshot of the Spark UI in which you can see the operations (saveAsTable and the two subsequent collects) and their submit time.

I did a lot of various tests in both the workflow and the interactive notebook, but still no relevant result. I thought it was something related to the timestamp and/or the timezone, but everything looked good to me. I've also tried to convert them to unix_timestamps and still the issue occurs.

Please, could you help me?

Thank you.
Best Regards.

jeremy98 · ‎03-27-2025

I'm interested too in this problem.. someone could help?

mark_ott · 5 hours ago

Your workflow issue—writing to a Delta Table, immediately reading back using a join on client_id and timestamp, but sometimes missing rows—suggests a subtle problem, likely related to timing, consistency, or column precision between your input DataFrame and the data as written to Delta. Here’s a thorough breakdown of what could be happening and key steps to resolve it:

Possible Root Causes

Write and Read Consistency
Delta Lake operations are ACID compliant, but there can be a delay between when writes are "committed" and when subsequent reads see the new data, especially if caching or transaction logs are involved.
Timestamp Precision Issues
Timestamps may be rounded or truncated during write operations (e.g., microseconds lost), causing join conditions to fail if you use exact matches. Delta defaults to millisecond precision, but Spark DataFrames may handle timestamps differently or keep more precision than Delta persists.
Time Zone Mismatches
If your environment has any timezone mismatches, either during writing or reading, this will break equality joins. Even with explicit conversion to Unix timestamp, subtle TZ issues can persist if conversion functions use different base timezones.
Spark Caching
If you're reading from a Delta Table that was cached earlier in the session, you might fetch stale data. Unpersist the cache before reading after a write.
Write-to-Read Race Condition
Double-check that your "collect" operation to read back is not happening before the Delta transaction is fully committed. Spark job scheduling can sometimes submit read before write is done, especially in parallel workflows.

Troubleshooting Steps

Validate Transaction Commit
Make sure to only trigger the read-back after the write operation returns and is committed. In workflows, use proper job dependencies/sequencing.
Check Timestamp Rounding
Inspect the timestamps in Spark DataFrame vs those in Delta Table with .show() on both, paying attention to sub-second digits. If there are differences, join with a tolerance range on timestamp instead of exact equality:

text

abs(df.timestamp - deltaTable.timestamp) < 1 second
Unpersist Caches
Run spark.catalog.clearCache() before reading after every write.
Audit Timezones
Set timezone for Spark explicitly:

text

spark.conf.set("spark.sql.session.timeZone", "UTC")

Do this for both writing and reading, then compare timestamps in UTC.
Compare Raw Data
After writing, query the Delta Table directly (e.g., with SQL) alongside your insert DataFrame, to see what got persisted.
Row Hashing for Join
Instead of joining on timestamp, create a stable hash on (client_id, timestamp) before the write and use it for post-write lookup.

Recommendations

Use a surrogate, auto-generated ID (from Delta or Spark) for reliable joining, not just (client_id, timestamp).
Avoid using floating point or timestamp columns for exact joins, switch to integer (unix timestamps) or use a range.
Always synchronize workflow steps to ensure all writes are durable before reads.
Check Spark job/submission times in Spark UI to ensure that read operations are scheduled after writes, not immediately in parallel.

Next Actions

Double-check the exact SQL/dataframe join key logic for equality and type matching.
Review Spark logs for “cache” and transaction commit timing.
Try a join with a ±1 second window on timestamps and compare results.