โ01-11-2022 03:17 AM
I have a pipeline with + 20 streams running based on autoloader. The pipeline crashed and after the crash I'm unable to start the streams and they fail with one of the following messages:
1):
The metadata file in the streaming source checkpoint directory is missing. This metadata file contains important default options for the stream, so the stream cannot be restarted right now. Please contact Databricks support for assistance.
2):
Caused by: java.io.FileNotFoundException: No such file or directory: s3a://elisa-automate-ml/pipeline-kpis/kpi_polystar_map/checkpoint/sources/0/rocksdb/SSTs/000061-fc068971-99e3-403e-8767-2335ec7a0330.sst
โ01-16-2022 10:49 PM
seems like the root problem was having checkpoint located in the data folder and running vacuum on the delta table. Had to restart streams with fresh checkpoint locations to get this sorted.
โ01-11-2022 04:56 PM
Hello @Olli Tiihonenโ! Welcome and thanks for your question. We'll make sure the community has a chance to answer your question before we come back around. Thanks!
โ01-16-2022 10:49 PM
seems like the root problem was having checkpoint located in the data folder and running vacuum on the delta table. Had to restart streams with fresh checkpoint locations to get this sorted.
โ01-17-2022 09:29 AM
@Olli Tiihonenโ - Thanks for letting us know. I'm glad you were able to get to the bottom of things. ๐
Join a Regional User Group to connect with local Databricks users. Events will be happening in your city, and you wonโt want to miss the chance to attend and share knowledge.
If there isnโt a group near you, start one and help create a community that brings people together.
Request a New Group