cancel
Showing results for 
Search instead for 
Did you mean: 
Data Engineering
Join discussions on data engineering best practices, architectures, and optimization strategies within the Databricks Community. Exchange insights and solutions with fellow data engineers.
cancel
Showing results for 
Search instead for 
Did you mean: 

PDF Generation via databricks Job

Mulder81
New Contributor II

WE have a databricks Job that will aggregate some data and create some data tables. This needs to be exported out in a PDF format.

I have seen a few python libraries that can generate PDF, but was wondering if the PDF can be generated and dropped in a cloud storage location frm databricks job.

If anyone has some ideas on this, greatly appreciated.

Mulder

3 REPLIES 3

PiotrMi
Contributor

Hey @Mulder81 ,

After generating file you can copy it to dediacted blob storage using below command.

dbutils.fs.cp(source_path, destination_path)

You can also try saving file in desired path (blob storage) but there can be some obstacles 

BR,

Piotrek

Mulder81
New Contributor II

Are there any specific ways to generate the PDF file from a dataframe? and libraries that work

Hey @Mulder81 

 

You can try this article 

https://medium.com/@mahijain9211/creating-a-python-class-for-generating-pdf-tables-from-a-pandas-dat... 

 

It uses pandas and fpdf. Also you need to read about temporary place/path where you save pdf for later copying for blob 

Join Us as a Local Community Builder!

Passionate about hosting events and connecting people? Help us grow a vibrant local community—sign up today to get started!

Sign Up Now