Data Engineering

Forum Posts

Sorted by:

by jwilliam • Contributor

07-05-2022 7:22:57 AM

1893 Views
3 replies
2 kudos

Resolved! How to mount Azure Blob Storage with OAuth2?

We already know that we can mount Azure Data Lake Gen2 with OAuth2 using this:configs = {"fs.azure.account.auth.type": "OAuth", "fs.azure.account.oauth.provider.type": "org.apache.hadoop.fs.azurebfs.oauth2.ClientCredsTokenProvider", ...

Data Engineering

1893 Views
3 replies
2 kudos

07-05-2022 7:22:57 AM

View Replies

Latest Reply

mathijs-fish
New Contributor III

02-14-2024 7:16:45 AM

2 kudos

Is there any update on this feature request? OAuth still seems not to be working with Azure Blob Storage... Configuration works fine for ADLS gen 2, but for Azure Blob Storage still only SAS and Account key seems to be working.

2 kudos

02-14-2024 7:16:45 AM

2 More Replies

by thethirtyfour • New Contributor III

02-13-2024 2:53:01 PM

719 Views
2 replies
1 kudos

Resolved! Install R Package "sf"

Hi,I am trying to install the following four dependency packages in order to install "slu-openGIS/postmastr" directly from GitHub:unitssftigristidycensusWhen attempting to install "units", I received the following configuration error: %r install.pack...

Data Engineering

719 Views
2 replies
1 kudos

02-13-2024 2:53:01 PM

View Replies

Latest Reply

thethirtyfour
New Contributor III

02-14-2024 5:09:23 AM

1 kudos

Thank you!

1 kudos

02-14-2024 5:09:23 AM

1 More Replies

by jcozar • Contributor

01-02-2024 3:41:02 AM

4257 Views
5 replies
2 kudos

Resolved! CDC and raw data

Hi, I am using debezium server to send data from Postgres to a Kafka topic (in fact, Azure EventHub). My question is, what are the best practices and recommendations to save raw data and then implement a medallion architecture?For clarification, I wa...

Data Engineering

4257 Views
5 replies
2 kudos

01-02-2024 3:41:02 AM

View Replies

Latest Reply

jcozar
Contributor

02-14-2024 4:06:10 AM

2 kudos

Thank you very much @Palash01 ! It has been really helpful!

2 kudos

02-14-2024 4:06:10 AM

4 More Replies

by rhevarr • New Contributor II

02-06-2024 2:19:39 AM

304 Views
1 replies
0 kudos

Course: Apache Spark Programming with Databricks ID: E-P0W7ZV // Issue Classroom-Setup

Hello,I am trying to run the Classroom-Setup from the course files notebook (ASP 1.1 - Databricks Platform)(Course:Apache Spark™ Programming with DatabricksID: E-P0W7ZV)Instructions: "Setup:Run classroom setup to mount Databricks training datasets an...

Data Engineering

academy

Course

Databricks

spark

304 Views
1 replies
0 kudos

02-06-2024 2:19:39 AM

View Replies

Latest Reply

Kaniz
Community Manager

02-14-2024 2:12:51 AM

0 kudos

Hi @rhevarr, Thank you for posting your concern on Community! To expedite your request, please list your concerns on our ticketing portal. Our support staff would be able to act faster on the resolution (our standard resolution time is 24-48 hours.

0 kudos

02-14-2024 2:12:51 AM

by karthik-kobai • New Contributor II

02-06-2024 11:18:49 AM

338 Views
1 replies
0 kudos

Databricks-jdbc and vulnerabilities CVE-2021-36090 CVE-2023-6378 CVE-2023-6481

The latest version of Databricks-jdbc available through Maven (2.6.36) now has these three vulnerabilities:https://www.cve.org/CVERecord?id=CVE-2021-36090https://www.cve.org/CVERecord?id=CVE-2023-6378https://www.cve.org/CVERecord?id=CVE-2023-6481All ...

Data Engineering

338 Views
1 replies
0 kudos

02-06-2024 11:18:49 AM

View Replies

Latest Reply

Kaniz
Community Manager

02-14-2024 2:09:39 AM

0 kudos

Hi @karthik-kobai, Thank you for bringing this to my attention! Let’s address the vulnerabilities in the Databricks JDBC driver. The current version of the Databricks JDBC driver you mentioned is 2.6.361. It appears that this version has dependen...

0 kudos

02-14-2024 2:09:39 AM

by KrzysztofPrzyso • New Contributor II

02-06-2024 9:46:58 AM

531 Views
1 replies
0 kudos

databricks-connect, dbutils, abfss path, URISyntaxException

When trying to use `dbutils.fs.cp` in the #databricks-connect #databricks-connect context to upload files to Azure Datalake Gen2 I get a malformed URI errorI have used the code provided here:https://learn.microsoft.com/en-gb/azure/databricks/dev-tool...

Data Engineering

abfss

databricks-connect

531 Views
1 replies
0 kudos

02-06-2024 9:46:58 AM

View Replies

Latest Reply

Kaniz
Community Manager

02-14-2024 2:02:25 AM

0 kudos

Hi @KrzysztofPrzyso, It appears that you’re encountering an issue with relative paths in absolute URIs when using dbutils.fs.cp in the context of Databricks Connect to upload files to Azure Data Lake Gen2. Let’s break down the problem and explore po...

0 kudos

02-14-2024 2:02:25 AM

by subrat • New Contributor

02-13-2024 3:09:41 AM

726 Views
1 replies
0 kudos

Missing 'DBAcademy DLT' as a Cluster Policy when creating Delta Live Tables pipeline

Hi There,I'm currently going through Module 4 of the Data Engineering Associate pathway, specifically lesson 4.1 - DLT UI Walkthrough. We are instructed to specify the Cluster Policy as 'DBAcademy DLT' when configuring the pipeline. However, this opt...

Data Engineering

726 Views
1 replies
0 kudos

02-13-2024 3:09:41 AM

View Replies

Latest Reply

Kaniz
Community Manager

02-14-2024 1:45:22 AM

0 kudos

Hi @subrat., Thank you for posting your concern on Community! To expedite your request, please list your concerns on our ticketing portal. Our support staff would be able to act faster on the resolution (our standard resolution time is 24-48 hours.

0 kudos

02-14-2024 1:45:22 AM

by yatharth • New Contributor III

02-13-2024 4:37:34 AM

1782 Views
1 replies
2 kudos

Resolved! AWS CLI Commands

I wish to run aws CLI command in databricks, is there a way i can achieve the same, to be more specific i would like to run:aws cloudwatch get-metric-statistics --metric-name BucketSizeBytes --namespace AWS/S3 --start-time 2017-03-06T00:00:00Z --end-...

Data Engineering

AWS-CLI

1782 Views
1 replies
2 kudos

02-13-2024 4:37:34 AM

View Replies

Latest Reply

Kaniz
Community Manager

02-14-2024 1:36:50 AM

2 kudos

Hi @yatharth , Certainly! You can use the Databricks CLI to run commands and automate tasks within your Databricks environment. The Databricks CLI provides a set of commands that allow you to interact with Databricks workspaces, clusters, librari...

2 kudos

02-14-2024 1:36:50 AM

by rt-slowth • Contributor

02-06-2024 10:04:55 PM

917 Views
5 replies
0 kudos

Error : . If you expect to delete or update rows to the source table in the future.......

Flow 'user_silver' has FAILED fatally. An error occurred because we detected an update or delete to one or more rows in the source table. Streaming tables may only use append-only streaming sources. If you expect to delete or update rows to the sourc...

Data Engineering

917 Views
5 replies
0 kudos

02-06-2024 10:04:55 PM

View Replies

Latest Reply

Palash01
Contributor III

02-13-2024 10:41:53 PM

0 kudos

Hey @rt-slowth Just checking in if the provided solution was helpful to you. If yes, please accept this as a Best Solution so that this thread can be considered closed.

0 kudos

02-13-2024 10:41:53 PM

4 More Replies

by rt-slowth • Contributor

02-01-2024 5:48:43 PM

1464 Views
6 replies
0 kudos

Questions about the design of bronze, silver, and gold for live streaming pipelines

I'm envisioning a live streaming pipeline.The bronze, or data ingestion, is being fetched using the directory listing mode of the autoloader.I'm not using File Notification Mode because I detect about 2-300 data changes per hour.I'm thinking about im...

Data Engineering

Delta Live Table

spark

1464 Views
6 replies
0 kudos

02-01-2024 5:48:43 PM

View Replies

Latest Reply

Kaniz
Community Manager

02-11-2024 11:00:42 PM

0 kudos

Hey there! Thanks a bunch for being part of our awesome community! We love having you around and appreciate all your questions. Take a moment to check out the responses – you'll find some great info. Your input is valuable, so pick the best solution...

0 kudos

02-11-2024 11:00:42 PM

5 More Replies

by Dilorom • New Contributor

08-17-2022 12:52:05 PM

1619 Views
1 replies
0 kudos

How to connect to Dynamics CRM server in Databricks.

Currently I have access to Dynamics CRM backend server via AAD, and I can query tables via XRM tool. I am trying to connect to Dynamics CRM backend server in Databricks, and I am not sure how the connection needs to be set up or if any other access n...

Data Engineering

1619 Views
1 replies
0 kudos

08-17-2022 12:52:05 PM

View Replies

Latest Reply

sheridan06
New Contributor III

02-13-2024 1:39:36 PM

0 kudos

Hi Dilorom - did you ever solve your issue? I'm trying to connect to Microsoft Dynamics Business Central and get an error when I run %pip install dynamics365bc.ERROR: Could not find a version that satisfies the requirement dynamics365bc (from version...

0 kudos

02-13-2024 1:39:36 PM

by User16869510359 • Esteemed Contributor

06-23-2021 6:34:13 AM

4191 Views
2 replies
0 kudos

Resolved! How does Delta solve the large number of small file problems?

Delta creates more small files during merge and updates operations.

Data Engineering

4191 Views
2 replies
0 kudos

06-23-2021 6:34:13 AM

View Replies

Latest Reply

User16869510359
Esteemed Contributor

06-23-2021 6:45:02 AM

0 kudos

Delta solves the large number of small file problems using the below operations available for a Delta table. Optimize writes helps to optimizes the write operation by adding an additional shuffle step and reducing the number of output files. By defau...

0 kudos

06-23-2021 6:45:02 AM

1 More Replies

by ranged_coop • Valued Contributor II

06-20-2022 1:51:42 AM

8187 Views
24 replies
29 kudos

How to install Chromium Browser and Chrome Driver on DBX runtime 10.4 and above ?

Hi Team,We are wondering if there is a recommended way to install the chromium browser and chrome driver on Databricks Runtime 10.4 and above ?I have been through the site and have come across several links to this effect, but they all seem to be ins...

Data Engineering

8187 Views
24 replies
29 kudos

06-20-2022 1:51:42 AM

View Replies

Latest Reply

Kaizen
Contributor III

02-13-2024 9:44:55 AM

29 kudos

Look into Playwrite instead of Selenium. I went through the same process y'all went through here (ended up writing a init script to install the drivers etc)This is all done for you in playwright. Refer to this post - I hope it helps!!https://communit...

29 kudos

02-13-2024 9:44:55 AM

23 More Replies

by noimeta • Contributor II

09-29-2022 1:11:43 AM

6784 Views
17 replies
12 kudos

Resolved! Error when create an external location using code

I'm trying to create an external location from notebook, and I got this kind of error[PARSE_SYNTAX_ERROR] Syntax error at or near 'LOCATION'(line 1, pos 16) == SQL == CREATE EXTERNAL LOCATION IF NOT EXISTS test_location URL 's3://test-bronze/db/tes...

Data Engineering

6784 Views
17 replies
12 kudos

09-29-2022 1:11:43 AM

View Replies

Latest Reply

Lokeshv
New Contributor II

02-13-2024 8:48:41 AM

12 kudos

Hey everyone,I'm facing an issue with retrieving data from a volume or table that contains a string with a symbol, for example, 'databricks+'. Whenever I try to retrieve this data, I encounter a syntax error. Can anyone help me resolve this issue?

12 kudos

02-13-2024 8:48:41 AM

16 More Replies

by seefoods • New Contributor III

01-10-2024 6:17:40 AM

383 Views
2 replies
0 kudos

cluster metrics collection

Hello @Debayan please how can i collect metrics provided by clusters metrics for databricks runtime 13.1 or latest using shell bash script. Cordially, Aubert EMAKO

Data Engineering

383 Views
2 replies
0 kudos

01-10-2024 6:17:40 AM

View Replies

Latest Reply

Debayan
Esteemed Contributor III

01-17-2024 8:42:44 PM

0 kudos

Hi, Cluster metrics is an UI tool and available in the UI only. For reference: https://docs.databricks.com/en/compute/cluster-metrics.html

0 kudos

01-17-2024 8:42:44 PM

1 More Replies

User

Count

1601

736

343

284

247

Databricks

Forum Posts

Resolved! How to mount Azure Blob Storage with OAuth2?

Resolved! Install R Package "sf"

Resolved! CDC and raw data

Course: Apache Spark Programming with Databricks ID: E-P0W7ZV // Issue Classroom-Setup

Databricks-jdbc and vulnerabilities CVE-2021-36090 CVE-2023-6378 CVE-2023-6481

databricks-connect, dbutils, abfss path, URISyntaxException

Missing 'DBAcademy DLT' as a Cluster Policy when creating Delta Live Tables pipeline

Resolved! AWS CLI Commands

Error : . If you expect to delete or update rows to the source table in the future.......

Questions about the design of bronze, silver, and gold for live streaming pipelines

How to connect to Dynamics CRM server in Databricks.

Resolved! How does Delta solve the large number of small file problems?

How to install Chromium Browser and Chrome Driver on DBX runtime 10.4 and above ?

Resolved! Error when create an external location using code

cluster metrics collection

DELTA_EXCEED_CHAR_VARCHAR_LIMIT

Not able to set run_as service_principal_name

Pyspark operations slowness in CLuster 14.3LTS as ...

[Databricks Assets Bundles] Workflow trigger on fi...

Addressing Pipeline Error Handling in Databricks b...