Community Discussions

by Abhay_1002 • New Contributor

05-02-2024 4:57:02 AM

237 Views
1 replies
0 kudos

Archive file support in Jar Type application

In my spark application, I am using set of python libraries. I am submitting spark application as Jar Task. But I am not able to find any option provide Archive Files.So, in order to handle python dependencies, I am using approach:Create archive file...

Community Discussions

Reply

237 Views
1 replies
0 kudos

05-02-2024 4:57:02 AM

View Replies

Latest Reply

Kaniz_Fatma
Community Manager

05-03-2024 2:22:12 AM

0 kudos

Hi @Abhay_1002, Using --py-files Argument: When submitting a Spark application, you can use the --py-files argument to add Python files (including .zip or .egg archives) to be distributed with your application1. However, this approach is typical...

0 kudos

05-03-2024 2:22:12 AM

by EirikMa • New Contributor II

03-25-2024 3:11:12 AM

1342 Views
2 replies
0 kudos

UTF-8 troubles in DLT

Issues with UTF-8 in DLTI am having issues with UTF-8 in DLT:I have tried to set the spark config on the cluster running the DLT pipeline: I have fixed this with normal compute under advanced settings like this:spark.conf.set("spark.driver.extraJava...

Community Discussions

data engineering

Reply

1342 Views
2 replies
0 kudos

03-25-2024 3:11:12 AM

View Replies

Latest Reply

EirikMa
New Contributor II

05-03-2024 12:42:18 AM

0 kudos

Hi @Kaniz_Fatma! Sorry for a long wait...The problem is not the columns or the data itself, the UTF-8 option for csv is working fine. The issue is with table_names not being compatible it seems. If I run the query through Auto Loader outside DLT and ...

0 kudos

05-03-2024 12:42:18 AM

1 More Replies

by mderela • New Contributor II

04-29-2024 7:17:26 PM

292 Views
1 replies
0 kudos

Databricks bundles - good practice for multiprocessing envs

I'm seeking advice regarding Databricks bundles. In my scenario, I have multiple production environments where I aim to execute the same DLT. To simplify, let's assume the DLT reads data from 'eventhub-region-name,' with this being the only differing...

Community Discussions

Reply

292 Views
1 replies
0 kudos

04-29-2024 7:17:26 PM

View Replies

Latest Reply

Kaniz_Fatma
Community Manager

05-02-2024 11:34:47 PM

0 kudos

Hi @mderela, When dealing with Databricks bundles in a multi-environment setup, there are some best practices you can follow to ensure smooth execution and maintainable code. Let’s explore a couple of recommendations: Parameterization and Configu...

0 kudos

05-02-2024 11:34:47 PM

by Abhay_1002 • New Contributor

05-01-2024 2:49:01 AM

254 Views
1 replies
0 kudos

Issue with Python Package Management in Spark application

In a pyspark application, I am using set of python libraries. In order to handle python dependencies while running pyspark application, I am using the approach provided by spark : Create archive file of Python virtual environment using required set o...

Community Discussions

Reply

254 Views
1 replies
0 kudos

05-01-2024 2:49:01 AM

View Replies

Latest Reply

NandiniN
Honored Contributor

05-01-2024 3:29:11 AM

0 kudos

Hi, I have not tried it but based on the doc you have to go by this approach. ./environment/bin/pythonmust be replaced with the correct path. import os from pyspark.sql import SparkSession os.environ['PYSPARK_PYTHON'] = "./environment/bin/python" sp...

0 kudos

05-01-2024 3:29:11 AM

by Nagarathna • New Contributor II

04-15-2024 12:36:08 AM

1033 Views
3 replies
1 kudos

File not found error when trying to read json file from aws s3 using with open.

I am trying to reading json from aws s3 using with open in databricks notebook using shared cluster.Error message:No such file or directory:'/dbfs/mnt/datalake/input_json_schema.json'In single instance cluster the above error is not found.

Community Discussions

Reply

1033 Views
3 replies
1 kudos

04-15-2024 12:36:08 AM

View Replies

Latest Reply

NandiniN
Honored Contributor

05-01-2024 3:07:24 AM

1 kudos

Hi @Nagarathna , I just tried it on a shared cluster and did not face any issue. What is the exact error that you are facing? Complete stacktrace might help. Just to confirm are you accessing the "/dbfs/mnt/datalake/input.json" from the same workspac...

1 kudos

05-01-2024 3:07:24 AM

2 More Replies

by databricksdev • New Contributor II

04-17-2024 6:52:04 AM

475 Views
2 replies
0 kudos

Can we customize job run name when running azure data bricks notebook jobs from azure data factory

Hi All,we are executing databricks notebook activity inside the child pipeline thru ADF. we are getting child pipeline name in job name while executing databricks job. Is it possible to get master pipeline name as job name or customize job name thr...

Community Discussions

Reply

475 Views
2 replies
0 kudos

04-17-2024 6:52:04 AM

View Replies

Latest Reply

NandiniN
Honored Contributor

05-01-2024 12:13:23 AM

0 kudos

I think we should raise a Request/Product Feedback. Not sure if it would be Databricks that would own it or Microsoft but you may submit feedback for Databricks here - https://docs.databricks.com/en/resources/ideas.html

0 kudos

05-01-2024 12:13:23 AM

1 More Replies

by MOUNIKASIMHADRI • New Contributor

04-26-2024 8:51:39 AM

2308 Views
2 replies
1 kudos

Insufficient Permissions Issue on Databricks

I have encountered a technical issue on Databricks.While executing commands both in Spark and SQL within the Databricks environment, I’ve run into permission-related errors from selecting files from DBFS. "org.apache.spark.SparkSecurityException: [IN...

Community Discussions

Reply

2308 Views
2 replies
1 kudos

04-26-2024 8:51:39 AM

View Replies

Latest Reply

NandiniN
Honored Contributor

04-30-2024 11:55:11 PM

1 kudos

Hi @MOUNIKASIMHADRI , Workspace admins get ANY FILE granted by default. They can explicitly grant it to non-admin users. Hence as suggested in the kb, GRANT SELECT ON ANY FILE TO `<user@domain-name>`

1 kudos

04-30-2024 11:55:11 PM

1 More Replies

by dbx_687_3__1b3Q • New Contributor III

04-26-2024 11:55:04 AM

630 Views
2 replies
0 kudos

Impersonating a user

How do I impersonate a user? I can't find any documentation that explains how to do this or even hint that it's possible.Use case: I perform administrative tasks like assign grants and roles to catalogs, schemas, and tables for the benefit of busines...

Community Discussions

Reply

630 Views
2 replies
0 kudos

04-26-2024 11:55:04 AM

View Replies

Latest Reply

NandiniN
Honored Contributor

04-30-2024 11:10:41 PM

0 kudos

Hidbx_687_3__1b3Q, Actually, I have seen impersonation, is this something that you are looking for? https://docs.gcp.databricks.com/en/dev-tools/google-id-auth.html#step-5-impersonate-the-google-cloud-service-account

0 kudos

04-30-2024 11:10:41 PM

1 More Replies

by AlexG • New Contributor II

12-18-2023 11:31:46 AM

1019 Views
3 replies
1 kudos

Query results in csv file include 'null' string for blank cell

After running a sql script, when downloading the results to a csv file, the file includes a null string for blank cells (see screenshot). Is ther a setting I can change to simply get empty cells instead?

Community Discussions

Reply

1019 Views
3 replies
1 kudos

12-18-2023 11:31:46 AM

View Replies

Latest Reply

NandiniN
Honored Contributor

04-30-2024 11:03:40 PM

1 kudos

Hi AlexG, I tested with the table content containing null and with empty data and it works as expected in the download option too. Here is an eg: CREATE TABLE my_table_null_test1 ( id INT, name STRING ); INSERT INTO my_table_null_test1 (id, name)...

1 kudos

04-30-2024 11:03:40 PM

2 More Replies

by DataBricks_Use1 • New Contributor

04-30-2024 7:50:49 AM

464 Views
2 replies
0 kudos

FileReadException Error

Hi,I am getting FilereadException Error while reading JSON file using REST API Connector.It comes when data is huge in Json File and it's not able to handle more than 1 Lac records.Error details:org.apache.spark.SparkException: Job aborted due to sta...

Community Discussions

Reply

464 Views
2 replies
0 kudos

04-30-2024 7:50:49 AM

View Replies

Latest Reply

NandiniN
Honored Contributor

04-30-2024 10:19:05 PM

0 kudos

Hello @DataBricks_Use1 , It would great if you could add the entire stack trace, as Jose mentioned. But there should be a "Caused by:" section below which would give you an idea of what's the reason for this failure and then you can work on that. fo...

0 kudos

04-30-2024 10:19:05 PM

1 More Replies

by Phani1 • Valued Contributor

04-26-2024 2:34:25 AM

352 Views
1 replies
0 kudos

temporary tables or dataframes,

We have to generate over 70 intermediate tables. Should we use temporary tables or dataframes, or should we create delta tables and truncate and reload? Having too many temporary tables could lead to memory problems. In this situation, what is the mo...

Community Discussions

delta

Reply

352 Views
1 replies
0 kudos

04-26-2024 2:34:25 AM

View Replies

Latest Reply

NandiniN
Honored Contributor

04-30-2024 5:27:34 PM

0 kudos

Hi Phani1, It would be a use case specific answer, so if it is possible I would suggest to work with the Solution Architect on this or share some more insights for a better guidance. When I say that, I just would want to understand would we really ne...

0 kudos

04-30-2024 5:27:34 PM

by Koa • New Contributor

04-29-2024 8:11:19 AM

349 Views
1 replies
0 kudos

Databrick Dashboard state not cleared when login as other user.

Hi all, I am using Databricks and created a notebook and would like to run in Dashboard. It works correctly. I share the Dashboard with another user UserA with "Can Run" permission When I login as a UserA and login and accesses Dashboard then does a...

Community Discussions

Reply

349 Views
1 replies
0 kudos

04-29-2024 8:11:19 AM

View Replies

Latest Reply

Kaniz_Fatma
Community Manager

04-30-2024 3:06:49 AM

0 kudos

Hi @Koa, You’ve encountered a security concern related to Databricks and handling JWT tokens within notebooks. Dashboard State Persistence: When you share a dashboard with another user (in this case, UserA), any updates made by that user will re...

0 kudos

04-30-2024 3:06:49 AM

by Ruby8376 • Valued Contributor

04-22-2024 7:50:55 AM

464 Views
2 replies
1 kudos

Databricks sql API <- Delta tables

We have an integration flow where we want to expose databricks data for querying through odata(webapp). For this piecedatabricks sql API <- Delta tables :2 questions here:1. can you share link/documentation on how we can integrate databricks <-delta ...

Community Discussions

Reply

464 Views
2 replies
1 kudos

04-22-2024 7:50:55 AM

View Replies

Latest Reply

shan_chandra
Esteemed Contributor

04-22-2024 10:19:44 AM

1 kudos

Hi @Ruby8376 - can you please review the similar posts where the resolution is provided https://community.databricks.com/t5/warehousing-analytics/databricks-sql-restful-api-to-query-delta-table/td-p/8617 https://www.databricks.com/blog/2023/03/07/da...

1 kudos

04-22-2024 10:19:44 AM

1 More Replies

by tomos_phillips1 • New Contributor II

04-05-2024 5:49:17 AM

3055 Views
6 replies
0 kudos

GeoPandas Insall

hi,I cannot install geopandas in my notebook, ive tried all different forms of generic fix, pip installs etc but always get this error:CalledProcessError: Command 'pip --disable-pip-version-check install geopandas' returned non-zero exit status 1.---...

Community Discussions

Reply

3055 Views
6 replies
0 kudos

04-05-2024 5:49:17 AM

View Replies

Latest Reply

Kaniz_Fatma
Community Manager

04-29-2024 3:14:20 AM

0 kudos

Hi @vbvasa, The error message indicates that a GDAL API version must be specified. You can address this by providing a path to gdal-config using a GDAL_CONFIG environment variable or by using a GDAL_VERSION environment variable1.To set the GDAL_CONF...

0 kudos

04-29-2024 3:14:20 AM

5 More Replies

by ChristopherQ1 • New Contributor

04-29-2024 1:48:51 AM

370 Views
1 replies
0 kudos

Can we share Delta table data with Salesforce using OData?

Hello!I'm seeking recommendations for streaming on-demand data from Databricks Delta tables to Salesforce. Is OData a viable choice?Thanks.

Community Discussions

Reply

370 Views
1 replies
0 kudos

04-29-2024 1:48:51 AM

View Replies

Latest Reply

Kaniz_Fatma
Community Manager

04-29-2024 3:04:02 AM

0 kudos

Hi @ChristopherQ1, OData (Open Data Protocol) is a standard for building and consuming RESTful APIs. It provides a consistent way to expose and consume data over the web.While OData can be used for data integration, it’s essential to evaluate whethe...

0 kudos

04-29-2024 3:04:02 AM

Databricks Community

Forum Posts

Archive file support in Jar Type application

UTF-8 troubles in DLT

Databricks bundles - good practice for multiprocessing envs

Issue with Python Package Management in Spark application

File not found error when trying to read json file from aws s3 using with open.

Can we customize job run name when running azure data bricks notebook jobs from azure data factory

Insufficient Permissions Issue on Databricks

Impersonating a user

Query results in csv file include 'null' string for blank cell

FileReadException Error

temporary tables or dataframes,

Databrick Dashboard state not cleared when login as other user.

Databricks sql API <- Delta tables

GeoPandas Insall

Can we share Delta table data with Salesforce using OData?

Submit your feedback and win a $25 gift card!

I am facing issue with DBFS File server

Databricks data engineer associate exam Suspended

Unable to access Account console under Azure Datab...

Unable to upload files from DBFS