Community Platform Discussions

by JonasStrenk • New Contributor

08-14-2023 5:42:34 AM

1044 Views
0 replies
0 kudos

Struct type limitation: possible hidden limit for parquet tables

Recently I discovered an issue when creating a PARQUET table that contains a column of type STRUCT with more than 350 string subfields. Such a table can be successfully created via a standard DDL script nevertheless each subsequent attempt to work wi...

Community Platform Discussions

Reply

1044 Views
0 replies
0 kudos

08-14-2023 5:42:34 AM

by jomt • New Contributor III

08-09-2023 6:00:03 AM

5487 Views
1 replies
0 kudos

Resolved! How do you properly read database-files (.db) with Spark in Python after the JDBC update?

I have a set of database-files (.db) which I need to read into my Python Notebook in Databricks. I managed to do this fairly simple up until July when a update in SQLite JDBC library was introduced. Up until now I have read the files in question with...

Community Platform Discussions

Reply

5487 Views
1 replies
0 kudos

08-09-2023 6:00:03 AM

View Replies

Latest Reply

jomt
New Contributor III

08-10-2023 6:36:48 AM

0 kudos

When the numbers in the table are really big (millions and billions) or really low (e.g. 1e-15), SQLite JDBC may struggle to import the correct values. To combat this, a good idea could be to use customSchema in options to define the schema using Dec...

0 kudos

08-10-2023 6:36:48 AM

by saberw • New Contributor

08-10-2023 12:40:36 AM

4279 Views
0 replies
0 kudos

Cron Schedule like 0 58/30 6,7,8,9,10,11,12,13,14,15,16,17 ? * MON,TUE,WED,THU,FRI * does not work

when we use this cron schedule: 0 58/30 6,7,8,9,10,11,12,13,14,15,16,17 ? * MON,TUE,WED,THU,FRI *so far only the 58th minute will run, but not the 28th minute (30minutes after 58th minute). Is there some kind of bug in the cron scheduler?Reference: h...

Community Platform Discussions

Reply

4279 Views
0 replies
0 kudos

08-10-2023 12:40:36 AM

by hukel • Contributor

08-06-2023 6:07:00 AM

2849 Views
5 replies
1 kudos

Resolved! Databricks Add-on for Splunk v1.2 - Error in 'databricksquery' command

Is anyone else using the new v1.2 of the Databricks Add-on for Splunk ? We upgraded to 1.2 and now get this error for all queries.Running process: /opt/splunk/bin/nsjail-wrapper /opt/splunk/bin/python3.7 /opt/splunk/etc/apps/TA-Databricks/bin/datab...

Community Platform Discussions

Reply

2849 Views
5 replies
1 kudos

08-06-2023 6:07:00 AM

View Replies

Latest Reply

hukel
Contributor

08-09-2023 10:15:01 AM

1 kudos

There is a new mandatory parameter for databricksquery called account_name. This breaking change is not documented in Splunkbase release notes but it does appear in the docs within the Splunk app. databricksquery cluster="<cluster_name>" query="<S...

1 kudos

08-09-2023 10:15:01 AM

4 More Replies

by GeKo • New Contributor III

08-09-2023 8:31:12 AM

973 Views
0 replies
0 kudos

global init script from workspace file ?

Hi Community,based on the announced change on Sep 1st, disabling cluster scoped init scripts in DBFS, I have questions re *global* init scripts.I am creating global init scripts via terraform "databricks_global_init_script" resources. Where do those ...

Community Platform Discussions

databricks_global_init_script

init script

workspace file

Reply

973 Views
0 replies
0 kudos

08-09-2023 8:31:12 AM

by shanmukh_b • New Contributor

08-07-2023 11:29:49 AM

18427 Views
1 replies
0 kudos

Convert string date to date after changing format

Hi,I am using Data bricks SQL and came across a scenario. I have a date field whose dates are in format of 'YYYY-MM-DD'. I changed their format into 'MM/DD/YYYY' using DATE_FORMAT() function.EFF_DT = 2000-01-14 EFF_DT _2 = DATE_FORMAT(EFF_DT, 'MM/d...

Community Platform Discussions

Databricks SQL

date

sql

string

Reply

18427 Views
1 replies
0 kudos

08-07-2023 11:29:49 AM

View Replies

Latest Reply

-werners-
Esteemed Contributor III

08-08-2023 6:47:54 AM

0 kudos

if you use to_date, you will get a date column as mentioned above.If you want to use the format MM/dd/yyyy you can use date_format but this will return a string column.In order to use Spark date functions, Date string should comply with Spark DateTyp...

0 kudos

08-08-2023 6:47:54 AM

by DineshKumar • New Contributor III

07-28-2023 5:51:47 AM

1365 Views
1 replies
0 kudos

How to install AWS .pem file in databricks cluster to make a db connection to MySql RDS

I am trying to make a connection between AWS Mysql RDS and Databricks. I am using the below code to establish the connection. But its failed due to certificate is not installed. I have the .pem file with me. Could anyone help on how install this in D...

Community Platform Discussions

Reply

1365 Views
1 replies
0 kudos

07-28-2023 5:51:47 AM

View Replies

Latest Reply

Debayan
Databricks Employee

08-07-2023 11:44:27 PM

0 kudos

Hi, Could you please provide the error code or the full error stack? Please tag @Debayan with your next comment which will notify me. Thank you!

0 kudos

08-07-2023 11:44:27 PM

by FutureLegend • New Contributor III

08-02-2023 10:33:34 PM

4237 Views
2 replies
1 kudos

Resolved! Download Dolly model on local machine

Hi~ I am new to LLM engineering, and am trying to download the Dolly-v2-7b model on local machine, so I don't need to connect to internet each time I am going to run the Dolly-v2-7b. Is it possible to do that? Thanks a lot!

Community Platform Discussions

Reply

4237 Views
2 replies
1 kudos

08-02-2023 10:33:34 PM

View Replies

Latest Reply

FutureLegend
New Contributor III

08-07-2023 1:08:40 AM

1 kudos

Hi Kaniz and Sean, thanks for your responses and time.I was trying Kaniz's method, but got a reply from Sean, so I tried that too. I downloaded the file from the link Sean provided and saved it on my local machine, then used the code for Dollyv2 (htt...

1 kudos

08-07-2023 1:08:40 AM

1 More Replies

by TalY • New Contributor II

08-02-2023 12:12:19 AM

7308 Views
5 replies
0 kudos

Python notebook crashes with "The Python kernel is unresponsive"

While using a Python notebook that works on my machine it crashes on the same point with the errors "The Python kernel is unresponsive" and "The Python process exited with exit code 134 (SIGABRT: Aborted).", but with no stacktrace for debugging the ...

Community Platform Discussions

Reply

7308 Views
5 replies
0 kudos

08-02-2023 12:12:19 AM

View Replies

Latest Reply

TalY
New Contributor II

08-07-2023 12:50:19 AM

0 kudos

I am using the following DBR 12.2 LTS (includes Apache Spark 3.3.2, Scala 2.12).Fatal error: The Python kernel is unresponsive.--------------------------------------------------------------------------- The Python process exited with exit code 134 (S...

0 kudos

08-07-2023 12:50:19 AM

4 More Replies

by Hani4hanuman • New Contributor II

08-05-2023 7:40:16 AM

2098 Views
2 replies
1 kudos

Databricks notebook issue

Hi,I'm trying to run ADF pipeline.However, it is getting fail at Notebook activity with below error.Error :NoSuchMethodError: com.microsoft.sqlserver.jdbc.SQLServerBulkCopy.writeToServer(Lcom/microsoft/sqlserver/jdbc/ISQLServerBulkRecord;)V I think i...

Community Platform Discussions

Reply

2098 Views
2 replies
1 kudos

08-05-2023 7:40:16 AM

View Replies

Latest Reply

Hani4hanuman
New Contributor II

08-06-2023 9:07:44 PM

1 kudos

@shan_chandra Thanks for your reply as per your suggetion changed Databricks version from 9.1LTS to 12.2LTSBut after change this when i check library which you provided(i.e com.microsoft.azure:spark-mssql-connector_2.12:1.3.0) under Maven it is not...

1 kudos

08-06-2023 9:07:44 PM

1 More Replies

by lightningStrike • New Contributor III

07-10-2023 6:09:00 AM

2844 Views
3 replies
0 kudos

unable to install pymqi in azure databricks

Hi,I am trying to install pymqi via below command:pip install pymqi However, I am getting below error message:Python interpreter will be restarted. Collecting pymqi Using cached pymqi-1.12.10.tar.gz (91 kB) Installing build dependencies: started Inst...

Community Platform Discussions

Reply

2844 Views
3 replies
0 kudos

07-10-2023 6:09:00 AM

View Replies

Latest Reply

sean_owen
Databricks Employee

08-04-2023 6:27:45 PM

0 kudos

I don't think so, because it won't be specific to Databricks - this is all a property of the third party packages. And, there are billions of possible library conflicts. But this is not an example of a package conflict. It's an example of not complet...

0 kudos

08-04-2023 6:27:45 PM

2 More Replies

by alejandrofm • Valued Contributor

07-31-2023 7:26:55 AM

4133 Views
1 replies
1 kudos

Resolved! Configure job to use one cluster instance to multiple jobs

Hi! I have several tiny jobs that run in parallel and I want them to run on the same cluster:- Tasks type Python Script: I send the parameters this way to run the pyspark scripts.- Job compute cluster created as (copied JSON from Databricks Job UI)Ho...

Community Platform Discussions

cluster

job

job cluster

Reply

4133 Views
1 replies
1 kudos

07-31-2023 7:26:55 AM

View Replies

Latest Reply

KoenZandvliet
New Contributor III

08-04-2023 8:16:15 AM

1 kudos

Unfortunately, running multiple jobs in parallel using a single job cluster is not supported (yet). New in databricks is the possibility to create a job that orchestrates multiple jobs. These jobs will however still use their own cluster (configurati...

1 kudos

08-04-2023 8:16:15 AM

by div19882021 • New Contributor

07-27-2023 6:47:35 AM

905 Views
1 replies
1 kudos

Is there a solution that we can display the worker types based on spark version selection using api?

Is there a solution that allows us to display the worker types or driver types based on the selection of Spark version using an api?

Community Platform Discussions

Reply

905 Views
1 replies
1 kudos

07-27-2023 6:47:35 AM

View Replies

Latest Reply

sean_owen
Databricks Employee

08-04-2023 8:08:35 AM

1 kudos

Can you clarify what you mean? Worker and driver types are not related to Spark version.

1 kudos

08-04-2023 8:08:35 AM

by pabloanzorenac • New Contributor II

07-31-2023 6:07:29 PM

1893 Views
2 replies
2 kudos

Resolved! Reduce EBS Default Volumes

By default Databricks creates 2 volumes: one with 30GB and the other one with 150GB. We have a lot of nodes in our pools and so a los of Terabytes of Volumes, but we are not making any use of them in the jobs. Is there any way to reduce the volumes? ...

Community Platform Discussions

Reply

1893 Views
2 replies
2 kudos

07-31-2023 6:07:29 PM

View Replies

Latest Reply

sean_owen
Databricks Employee

08-04-2023 8:05:01 AM

2 kudos

Yes, EBS vols are essential for shuffle spill for example. You are probably using them!

2 kudos

08-04-2023 8:05:01 AM

1 More Replies

by KrishZ • Contributor

08-02-2023 8:55:31 AM

5512 Views
1 replies
0 kudos

Uninstalling a preinstalled python package from Databricks

[Datasets](https://pypi.org/project/datasets/) python package comes preinstalled on databricks clusters. I want to uninstall it or completely prevent it's installation when I create/start a cluster.I couldn't find any solution on stackoverflow.And I ...

Community Platform Discussions

Reply

5512 Views
1 replies
0 kudos

08-02-2023 8:55:31 AM

View Replies

Latest Reply

sean_owen
Databricks Employee

08-04-2023 8:00:58 AM

0 kudos

@Retired_mod note that you can't actually uninstall packages in the runtime with pip.

0 kudos

08-04-2023 8:00:58 AM

Databricks Community

Forum Posts

Struct type limitation: possible hidden limit for parquet tables

Resolved! How do you properly read database-files (.db) with Spark in Python after the JDBC update?

Cron Schedule like 0 58/30 6,7,8,9,10,11,12,13,14,15,16,17 ? * MON,TUE,WED,THU,FRI * does not work

Resolved! Databricks Add-on for Splunk v1.2 - Error in 'databricksquery' command

global init script from workspace file ?

Convert string date to date after changing format

How to install AWS .pem file in databricks cluster to make a db connection to MySql RDS

Resolved! Download Dolly model on local machine

Python notebook crashes with "The Python kernel is unresponsive"

Databricks notebook issue

unable to install pymqi in azure databricks

Resolved! Configure job to use one cluster instance to multiple jobs

Is there a solution that we can display the worker types based on spark version selection using api?

Resolved! Reduce EBS Default Volumes

Uninstalling a preinstalled python package from Databricks

Connect with Databricks Users in Your Area

Understanding Autoscaling in Databricks: Under Wha...

Delta Live Table Pipeline

requirements.txt with cluster libraries

fetching metadata for tables in a database stored ...

Setting a preset list of values in a task paramete...