PDF Generation via databricks Job
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
03-25-2025 11:31 AM
WE have a databricks Job that will aggregate some data and create some data tables. This needs to be exported out in a PDF format.
I have seen a few python libraries that can generate PDF, but was wondering if the PDF can be generated and dropped in a cloud storage location frm databricks job.
If anyone has some ideas on this, greatly appreciated.
Mulder
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
03-25-2025 12:10 PM
Hey @Mulder81 ,
After generating file you can copy it to dediacted blob storage using below command.
dbutils.fs.cp(source_path, destination_path)
You can also try saving file in desired path (blob storage) but there can be some obstacles
BR,
Piotrek
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
03-25-2025 12:23 PM
Are there any specific ways to generate the PDF file from a dataframe? and libraries that work
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
03-25-2025 02:23 PM
Hey @Mulder81
You can try this article
It uses pandas and fpdf. Also you need to read about temporary place/path where you save pdf for later copying for blob