Databricks Community

Juju · ‎11-19-2023

Hi all,

I am currently running a job that will upsert a table by reading from delta change data feed from my silver table. Here is the relevent snippet of code:

rds_changes = spark.read.format("delta") \
  .option("readChangeFeed", "true") \
  .option("startingVersion", 0) \
  .table("main.default.gold_table") \
  .where(f"_commit_timestamp >= '{(datetime.now() - timedelta(hours=1)).strftime('%Y-%m-%d %H:%M:%S')}'")

Here is the error returned

com.databricks.sql.transaction.tahoe.DeltaFileNotFoundException: No file found in the directory: s3://databricks-workspace-stack-70da1-metastore-bucket/60ed403c-0a54-4f42-8b8a-73b8cea1bdc3/tables/6d4a9b3d-f88b-436e-be1b-09852f605f4c/_delta_log.

I have done the following:

Verify that the delta log folder is not deleted by accessing S3 directly
Able to query the table directly and perform `DESCRIBE HISTORY gold_table` on it without any issue

Anyone has any idea why this happen when I am running the job which was working fine previously without any changes

Juju · ‎11-20-2023

Hey @Retired_mod , found the issue is due to truncated delta log. Thanks for the help man

c-data · ‎10-22-2024

What was the fix?

Walecon · ‎11-14-2024

1) check the first delta feed enabled version in
DESCRIBE HISTORY `table_name`;

2) use this version instead of 0 in .option("startingVersion", x)

Databricks Community

DeltaFileNotFoundException: No file found in the directory (sudden task failure)

Connect with Databricks Users in Your Area

Submit your feedback and win a $50 gift card!

Share Your Feedback in Our Community Survey

Databricks Named a Leader in the 2024 Gartner® Magic Quadrant™ for Cloud Database Management Systems

Announcing the new Meta Llama 3.3 model on Databricks

Milestone: DatabricksTV Reaches 100 Videos!