- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
01-11-2022 03:17 AM
I have a pipeline with + 20 streams running based on autoloader. The pipeline crashed and after the crash I'm unable to start the streams and they fail with one of the following messages:
1):
The metadata file in the streaming source checkpoint directory is missing. This metadata file contains important default options for the stream, so the stream cannot be restarted right now. Please contact Databricks support for assistance.
2):
Caused by: java.io.FileNotFoundException: No such file or directory: s3a://elisa-automate-ml/pipeline-kpis/kpi_polystar_map/checkpoint/sources/0/rocksdb/SSTs/000061-fc068971-99e3-403e-8767-2335ec7a0330.sst
Accepted Solutions
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
01-16-2022 10:49 PM
seems like the root problem was having checkpoint located in the data folder and running vacuum on the delta table. Had to restart streams with fresh checkpoint locations to get this sorted.

- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
01-11-2022 04:56 PM
Hello @Olli Tiihonen! Welcome and thanks for your question. We'll make sure the community has a chance to answer your question before we come back around. Thanks!
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
01-16-2022 10:49 PM
seems like the root problem was having checkpoint located in the data folder and running vacuum on the delta table. Had to restart streams with fresh checkpoint locations to get this sorted.

- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
01-17-2022 09:29 AM
@Olli Tiihonen - Thanks for letting us know. I'm glad you were able to get to the bottom of things. 🙂

