Community Discussions

by Ikanip • New Contributor

yesterday

182 Views
4 replies
0 kudos

How to choose a compute, and how to find alternatives for the current compute being used?

We are using a compute for an Interactive Cluster in Production which incurs X amount of cost. We want to know what are the options available to use with near about the same processing power as the current compute but incur a cost of Y, which is less...

Community Discussions

Reply

182 Views
4 replies
0 kudos

yesterday

View Replies

Latest Reply

raphaelblg
New Contributor III

yesterday

0 kudos

Hello @Ikanip , You can utilize the Databricks Pricing Calculator to estimate costs. For detailed information on compute capacity, please refer to your cloud provider's documentation regarding Virtual Machine instance types.

0 kudos

yesterday

3 More Replies

by Databricks_S • Visitor

16 hours ago

118 Views
0 replies
0 kudos

issue related to Cluster Policy

Hello Databricks Community,I am currently working on creating a Terraform script to provision clusters in Databricks. However, I've noticed that by default, the clusters created using Terraform have the policy set to "Unrestricted."I would like to co...

Community Discussions

Reply

118 Views
0 replies
0 kudos

16 hours ago

by scottbisaillon • Visitor

17 hours ago

172 Views
0 replies
0 kudos

Databricks Running Jobs and Terraform

What happens to a currently running job when a workspace is deployed again using Terraform? Are the jobs paused/resumed, or are they left unaffected without any down time? Searching for this specific scenario doesn't seem to come up with anything and...

Community Discussions

Reply

172 Views
0 replies
0 kudos

17 hours ago

by ChristopherS5 • New Contributor

Wednesday

129 Views
1 replies
0 kudos

Step-by-step guide to creating a Unity Catalog in Azure Databricks.

Hello everyone,There isn't an official document outlining the step-by-step procedure for enabling Unity Catalog in Azure Databricks.If anyone has created documentation or knows the process, please share it here.Thank you in advance.

Community Discussions

Reply

129 Views
1 replies
0 kudos

Wednesday

View Replies

Latest Reply

PL_db
New Contributor III

17 hours ago

0 kudos

Setup Unity Catalog on Azure Unity Catalog best practices Which guidance/procedure are you missing?

0 kudos

17 hours ago

by Hubcap7700 • New Contributor

Wednesday

69 Views
1 replies
0 kudos

Native Slack Integration

Hi,Are there any plans to build native slack integration? I'm envisioning a one-time connector to Slack that would automatically populate all channels and users to select to use for example when configuring an alert notification. It is does not seem ...

Community Discussions

Reply

69 Views
1 replies
0 kudos

Wednesday

View Replies

Latest Reply

Kaniz
Community Manager

20 hours ago

0 kudos

Hi @Hubcap7700, If you have any further details or specific requirements, feel free to share, and I’ll be happy to assist!

0 kudos

20 hours ago

by sujan1 • New Contributor

yesterday

47 Views
1 replies
0 kudos

requirements.txt with cluster libraries

Cluster libraries are supported from version 15.0 - Databricks Runtime 15.0 | Databricks on AWS.How can I specify requirements.txt file path in the libraries in a job cluster in my workflow? Can I use relative path? Is it relative from the root of th...

Community Discussions

Reply

47 Views
1 replies
0 kudos

yesterday

View Replies

Latest Reply

Kaniz
Community Manager

21 hours ago

0 kudos

To specify the requirements.txt file path for libraries in a job cluster workflow in Databricks, you have a few options. Let’s break it down: Upload the requirements.txt File: First, upload your requirements.txt file to your Databricks workspace....

0 kudos

21 hours ago

by Abhay_1002 • New Contributor

yesterday

48 Views
1 replies
0 kudos

Archive file support in Jar Type application

In my spark application, I am using set of python libraries. I am submitting spark application as Jar Task. But I am not able to find any option provide Archive Files.So, in order to handle python dependencies, I am using approach:Create archive file...

Community Discussions

Reply

48 Views
1 replies
0 kudos

yesterday

View Replies

Latest Reply

Kaniz
Community Manager

21 hours ago

0 kudos

Hi @Abhay_1002, Using --py-files Argument: When submitting a Spark application, you can use the --py-files argument to add Python files (including .zip or .egg archives) to be distributed with your application1. However, this approach is typical...

0 kudos

21 hours ago

by EirikMa • New Contributor II

03-25-2024 3:11:12 AM

307 Views
2 replies
0 kudos

UTF-8 troubles in DLT

Issues with UTF-8 in DLTI am having issues with UTF-8 in DLT:I have tried to set the spark config on the cluster running the DLT pipeline: I have fixed this with normal compute under advanced settings like this:spark.conf.set("spark.driver.extraJava...

Community Discussions

data engineering

Reply

307 Views
2 replies
0 kudos

03-25-2024 3:11:12 AM

View Replies

Latest Reply

EirikMa
New Contributor II

23 hours ago

0 kudos

Hi @Kaniz! Sorry for a long wait...The problem is not the columns or the data itself, the UTF-8 option for csv is working fine. The issue is with table_names not being compatible it seems. If I run the query through Auto Loader outside DLT and use ba...

0 kudos

23 hours ago

1 More Replies

by mderela • New Contributor II

Monday

79 Views
1 replies
0 kudos

Databricks bundles - good practice for multiprocessing envs

I'm seeking advice regarding Databricks bundles. In my scenario, I have multiple production environments where I aim to execute the same DLT. To simplify, let's assume the DLT reads data from 'eventhub-region-name,' with this being the only differing...

Community Discussions

Reply

79 Views
1 replies
0 kudos

Monday

View Replies

Latest Reply

Kaniz
Community Manager

yesterday

0 kudos

Hi @mderela, When dealing with Databricks bundles in a multi-environment setup, there are some best practices you can follow to ensure smooth execution and maintainable code. Let’s explore a couple of recommendations: Parameterization and Configu...

0 kudos

yesterday

by liormayn • New Contributor II

2 weeks ago

520 Views
4 replies
3 kudos

OSError: [Errno 78] Remote address changed

Hello:)as part of deploying an app that previously ran directly on emr to databricks, we are running experiments using LTS 9.1, and getting the following error: PythonException: An exception was thrown from a UDF: 'pyspark.serializers.SerializationEr...

Community Discussions

Reply

520 Views
4 replies
3 kudos

2 weeks ago

View Replies

Latest Reply

liormayn
New Contributor II

yesterday

3 kudos

Hey @NandiniN The error currently stopped happening, but we are not feeling "safe" yet,could you tell me when the fix was published? just so we try and pin point to see if the fix is what solved it?

3 kudos

yesterday

3 More Replies

by tim-mcwilliams • New Contributor

Wednesday

73 Views
0 replies
0 kudos

Notebook cell gets hung up but code completes

Have been running into an issue when running a pymc-marketing model in a Databricks notebook. The cell that fits the model gets hung up and the progress bar stops moving, however the code completes and dumps all needed output into a folder. After the...

Community Discussions

Reply

73 Views
0 replies
0 kudos

Wednesday

by Geetk87 • New Contributor

Wednesday

40 Views
0 replies
0 kudos

Creating a Stored Procedure in Unity Catalog.

Can Databricks directly create stored procedures using the CREATE PROCEDURE syntax?

Community Discussions

Reply

40 Views
0 replies
0 kudos

Wednesday

by GeKo • New Contributor II

a week ago

541 Views
3 replies
0 kudos

column "storage_sub_directory" is now always NULL in system.information_schema.tables

Hello,I am running a job that depends on the information provided in column storage_sub_directory in system.information_schema.tables .... and it worked until 1-2 weeks ago.Now I discovered in the doc that this column is deprecated and always null , ...

Community Discussions

Unity Catalog

unitycatalog

Reply

541 Views
3 replies
0 kudos

a week ago

View Replies

Latest Reply

NandiniN
Valued Contributor II

Tuesday

0 kudos

Hello, Linking the documentation - https://docs.databricks.com/en/sql/language-manual/information-schema/tables.html#definition STORAGE_SUB_DIRECTORY STRING Yes No Deprecated. Always NULL.

0 kudos

Tuesday

2 More Replies

by Abhay_1002 • New Contributor

Wednesday

82 Views
1 replies
0 kudos

Issue with Python Package Management in Spark application

In a pyspark application, I am using set of python libraries. In order to handle python dependencies while running pyspark application, I am using the approach provided by spark : Create archive file of Python virtual environment using required set o...

Community Discussions

Reply

82 Views
1 replies
0 kudos

Wednesday

View Replies

Latest Reply

NandiniN
Valued Contributor II

Wednesday

0 kudos

Hi, I have not tried it but based on the doc you have to go by this approach. ./environment/bin/pythonmust be replaced with the correct path. import os from pyspark.sql import SparkSession os.environ['PYSPARK_PYTHON'] = "./environment/bin/python" sp...

0 kudos

Wednesday

by Nagarathna • New Contributor II

3 weeks ago

154 Views
3 replies
1 kudos

File not found error when trying to read json file from aws s3 using with open.

I am trying to reading json from aws s3 using with open in databricks notebook using shared cluster.Error message:No such file or directory:'/dbfs/mnt/datalake/input_json_schema.json'In single instance cluster the above error is not found.

Community Discussions

Reply

154 Views
3 replies
1 kudos

3 weeks ago

View Replies

Latest Reply

NandiniN
Valued Contributor II

Wednesday

1 kudos

Hi @Nagarathna , I just tried it on a shared cluster and did not face any issue. What is the exact error that you are facing? Complete stacktrace might help. Just to confirm are you accessing the "/dbfs/mnt/datalake/input.json" from the same workspac...

1 kudos

Wednesday

2 More Replies

Databricks

Forum Posts

How to choose a compute, and how to find alternatives for the current compute being used?

issue related to Cluster Policy

Databricks Running Jobs and Terraform

Step-by-step guide to creating a Unity Catalog in Azure Databricks.

Native Slack Integration

requirements.txt with cluster libraries

Archive file support in Jar Type application

UTF-8 troubles in DLT

Databricks bundles - good practice for multiprocessing envs

OSError: [Errno 78] Remote address changed

Notebook cell gets hung up but code completes

Creating a Stored Procedure in Unity Catalog.

column "storage_sub_directory" is now always NULL in system.information_schema.tables

Issue with Python Package Management in Spark application

File not found error when trying to read json file from aws s3 using with open.

Can't run .py file using workflows anymore

Databrciks: failure logs

Cannot create delta location with mount path

Spark read CSV does not throw Exception if the fil...

how to run a group of cells in databricks ?