01-11-2022 03:17 AM
I have a pipeline with + 20 streams running based on autoloader. The pipeline crashed and after the crash I'm unable to start the streams and they fail with one of the following messages:
1):
The metadata file in the streaming source checkpoint directory is missing. This metadata file contains important default options for the stream, so the stream cannot be restarted right now. Please contact Databricks support for assistance.
2):
Caused by: java.io.FileNotFoundException: No such file or directory: s3a://elisa-automate-ml/pipeline-kpis/kpi_polystar_map/checkpoint/sources/0/rocksdb/SSTs/000061-fc068971-99e3-403e-8767-2335ec7a0330.sst
01-16-2022 10:49 PM
seems like the root problem was having checkpoint located in the data folder and running vacuum on the delta table. Had to restart streams with fresh checkpoint locations to get this sorted.
01-11-2022 04:56 PM
Hello @Olli Tiihonen! Welcome and thanks for your question. We'll make sure the community has a chance to answer your question before we come back around. Thanks!
01-16-2022 10:49 PM
seems like the root problem was having checkpoint located in the data folder and running vacuum on the delta table. Had to restart streams with fresh checkpoint locations to get this sorted.
01-17-2022 09:29 AM
@Olli Tiihonen - Thanks for letting us know. I'm glad you were able to get to the bottom of things. 🙂
Passionate about hosting events and connecting people? Help us grow a vibrant local community—sign up today to get started!
Sign Up Now