Data Engineering

Forum Posts

Sorted by:

by CrisCampos • New Contributor II

11-22-2022 1:07:43 PM

3871 Views
1 replies
1 kudos

How to load a "pickle/joblib" file on Databricks

Hi Community, I am trying to load a joblib on Databricks, but doesn't seems to be working.Getting an error message: "Incompatible format detected" Any idea of how to load this type of file on db?Thanks!

Data Engineering

3871 Views
1 replies
1 kudos

11-22-2022 1:07:43 PM

View Replies

Latest Reply

tapash-db
Databricks Employee

08-07-2023 11:36:02 AM

1 kudos

You can import joblib/joblibspark package to load joblib files

1 kudos

08-07-2023 11:36:02 AM

by UmaMahesh1 • Honored Contributor III

04-11-2023 7:01:42 AM

2391 Views
1 replies
2 kudos

Checkpoint issue when loading data from confluent kafka

I have a streaming notebook which fetches messages from confluent Kafka topic and loads them into adls. It is a streaming notebook with the trigger as continuous processing. Before loading the message (which is in Avro format), I'm flattening out the...

Data Engineering

2391 Views
1 replies
2 kudos

04-11-2023 7:01:42 AM

View Replies

Latest Reply

Avinash_94
New Contributor III

04-14-2023 12:21:44 AM

2 kudos

Best approach is to not to depend on Kafka’s commit mechanism! We can store processing result and message offset to external data store in the same database transaction. So, if the database transaction fails, both commit and processing will fail and ...

2 kudos

04-14-2023 12:21:44 AM

by Arunsundar • New Contributor III

03-12-2023 9:16:49 PM

3529 Views
4 replies
4 kudos

The possibility of finding the workload dynamically and spin up the cluster based on the workload

Hi Team,Good morning. I would like to understand if there is a possibility to determine the workload automatically through code (data load from a file to a table, determine the file size, kind of a benchmark that we can check), based on which we can ...

Data Engineering

3529 Views
4 replies
4 kudos

03-12-2023 9:16:49 PM

View Replies

Latest Reply

pvignesh92
Honored Contributor

03-13-2023 10:40:13 AM

4 kudos

Hi @Arunsundar Muthumanickam , When you say workload, I believe you might be handling various volumes of data between Dev and Prod environment. If you are using Databricks cluster and do not have much idea on how the volumes might turn out in differ...

4 kudos

03-13-2023 10:40:13 AM

3 More Replies

by RamaSantosh • New Contributor II

09-19-2022 9:51:49 PM

4873 Views
2 replies
3 kudos

Data load from Azure databricks dataframe to cosmos db container

I am trying to load data from Azure databricks dataframe to cosmos db container using below commandcfg = { "spark.cosmos.accountEndpoint" : cosmosEndpoint, "spark.cosmos.accountKey" : cosmosMasterKey, "spark.cosmos.database" : cosmosDatabaseName, "sp...

Data Engineering

4873 Views
2 replies
3 kudos

09-19-2022 9:51:49 PM

View Replies

Latest Reply

Anonymous
Not applicable

10-02-2022 4:02:54 AM

3 kudos

Hey @Rama Santosh Ravada Hope all is well! Just wanted to check in if you were able to resolve your issue and would you be happy to share the solution or mark an answer as best? Else please let us know if you need more help. We'd love to hear from y...

3 kudos

10-02-2022 4:02:54 AM

1 More Replies

Databricks Community

How to load a "pickle/joblib" file on Databricks

Checkpoint issue when loading data from confluent kafka

The possibility of finding the workload dynamically and spin up the cluster based on the workload

Data load from Azure databricks dataframe to cosmos db container