Data Engineering

Forum Posts

Sorted by:

by horatiug • New Contributor III

10-05-2022 9:31:44 AM

6672 Views
8 replies
3 kudos

Create workspace in Databricks deployed in Google Cloud using terraform

In the documentation https://registry.terraform.io/providers/databricks/databricks/latest/docs https://docs.gcp.databricks.com/dev-tools/terraform/index.html I could not find documentation on how to provision Databricks workspaces in GCP. Only cre...

Data Engineering

6672 Views
8 replies
3 kudos

10-05-2022 9:31:44 AM

View Replies

Latest Reply

Anonymous
Not applicable

11-13-2022 8:01:04 PM

3 kudos

Hi @horatiu guja Does @Debayan Mukherjee response answer your question?If yes, would you be happy to mark it as best so that other members can find the solution more quickly? Else, we can help you with more details.

3 kudos

11-13-2022 8:01:04 PM

7 More Replies

by Arumugam • Databricks Partner

11-17-2022 3:12:07 AM

6462 Views
5 replies
1 kudos

DLT Pipeline failed to Start due to "The Execution Contained atleast one disallowed language

Hi , im trying to setup DLT pipeline ,its a basic pipeline for testing purpose im facing the issue while starting the pipeline , any help is appreciated Code :@dlt.table(name="dlt_bronze_cisco_hardware")def dlt_cisco_networking_bronze_hardware(): ret...

Data Engineering

6462 Views
5 replies
1 kudos

11-17-2022 3:12:07 AM

View Replies

Latest Reply

Vivian_Wilfred
Databricks Employee

11-17-2022 5:30:45 AM

1 kudos

Hi @Arumugam Ramachandran seems like you have a spark config set on your DLT job cluster that allows only python and SQL code. Check the spark config (cluster policy).In any case, the python code should work. Verify the notebook's default language, ...

1 kudos

11-17-2022 5:30:45 AM

4 More Replies

by sreedata • New Contributor III

03-31-2022 6:47:42 AM

6822 Views
4 replies
10 kudos

Resolved! Date field getting changed when reading from excel file to dataframe

The date field is getting changed while reading data from source .xls file to the dataframe. In the source xl file all columns are strings but i am not sure why date column alone behaves differentlyIn Source file date is 1/24/2022.In dataframe it is ...

Data Engineering

6822 Views
4 replies
10 kudos

03-31-2022 6:47:42 AM

View Replies

Latest Reply

Pradeep_Namani
New Contributor III

11-17-2022 6:56:19 AM

10 kudos

Hi Team, @Merca Ovnerud I am also facing same issue , below is the code snippet which I am using df=spark.read.format("com.crealytics.spark.excel").option("header","true").load("/mnt/dataplatform/Tenant_PK/Results.xlsx")I have a couple of date colum...

10 kudos

11-17-2022 6:56:19 AM

3 More Replies

by Anonymous • Not applicable

06-11-2021 8:02:20 AM

5944 Views
2 replies
0 kudos

Cluster Modes

Given that there are three different kinda of cluster modes, when is it appropriate to use each one?

Data Engineering

5944 Views
2 replies
0 kudos

06-11-2021 8:02:20 AM

View Replies

Latest Reply

User16826994223
Databricks Employee

06-14-2021 5:45:52 AM

0 kudos

Standard clustersA Standard cluster is recommended for a single user. Standard clusters can run workloads developed in any language: Python, SQL, R, and Scala.High Concurrency clustersA High Concurrency cluster is a managed cloud resource. The key be...

0 kudos

06-14-2021 5:45:52 AM

1 More Replies

by am777 • New Contributor

11-16-2022 7:04:11 PM

9335 Views
1 replies
1 kudos

I am new to Databricks and SQL. My CASE statement is not working and I cannot figure out why. Below is my code and the error message I'm receiving. Grateful for any and all suggestions. I'm trying to put yrs_to_mat into buckets.

SELECT *, yrs_to_mat, CASE WHEN < 3 THEN "under3" WHEN => 3 AND < 5 THEN "3to5" WHEN => 5 AND < 10 THEN "5to10" WHEN => 10 AND < 15 THEN "10to15" WHEN => 15 THEN "over15" ELSE null END AS maturity_bucket FROM mat...

Data Engineering

9335 Views
1 replies
1 kudos

11-16-2022 7:04:11 PM

View Replies

Latest Reply

Pat
Esteemed Contributor

11-17-2022 4:40:44 AM

1 kudos

Hi @Anne-Marie Wood ,I think it's more SQL general issue:you are not comparing any value to `< 3`it should be something like :WHEN X < 3 THEN "under3" SELECT *, yrs_to_mat, CASE WHEN X < 3 THEN "under3" WHEN X => 3 AND <...

1 kudos

11-17-2022 4:40:44 AM

by LukaszJ • Contributor III

08-17-2022 10:26:10 AM

5949 Views
5 replies
4 kudos

Resolved! Mount Azure Blob Storage with Cluster access control

Hello.I want to mount and share for the one group the container from Azure Blob Storage (It could be simple blob storage or Azure Data Lake Storage gen 2). But I am not able to do it because I am using Cluster with Table Access Control.This is my cod...

Data Engineering

5949 Views
5 replies
4 kudos

08-17-2022 10:26:10 AM

View Replies

Latest Reply

LukaszJ
Contributor III

11-17-2022 1:05:16 AM

4 kudos

I have a good solution to the problem:I am using Python library.There are some documentation.Topic to be closed.Best regards,Łukasz

4 kudos

11-17-2022 1:05:16 AM

4 More Replies

by HashStudioz • New Contributor

11-17-2022 12:42:42 AM

798 Views
0 replies
0 kudos

Rs 485 IoT Gateway

RS-485 IoT Gateway is used for transmitting data from one device to another usually far away by using a wired LAN or a Wi-Fi. HashStudioz Technologies Inc. provides Smart IoT Gateway Solutions for Businesses like Pharma industries Etc. Our IoT Gatewa...

Data Engineering

798 Views
0 replies
0 kudos

11-17-2022 12:42:42 AM

by logan0015 • Contributor

10-11-2022 10:19:05 AM

2871 Views
3 replies
3 kudos

How do you access a streaming live table's snapshots?

I have read that delta live tables will keep a history of 7 days. However after creating a streaming live table and using the dlt.apply_changes function. With this codedef run_pipeline(table_name,keys,sequence_by): lower_table_name = table_name.l...

Data Engineering

2871 Views
3 replies
3 kudos

10-11-2022 10:19:05 AM

View Replies

Latest Reply

Anonymous
Not applicable

11-16-2022 11:19:07 PM

3 kudos

Hi @Logan Nicol Hope all is well! Just wanted to check in if you were able to resolve your issue and would you be happy to share the solution or mark an answer as best? Else please let us know if you need more help. We'd love to hear from you.Thanks...

3 kudos

11-16-2022 11:19:07 PM

2 More Replies

by Muni • New Contributor II

10-10-2022 9:10:16 AM

10260 Views
5 replies
0 kudos

Resolved! Can we expose REST API using Databricks python notebook?

We are trying to expose a REST API endpoint using python notebook. Does databricks allow this?

Data Engineering

10260 Views
5 replies
0 kudos

10-10-2022 9:10:16 AM

View Replies

Latest Reply

Anonymous
Not applicable

11-16-2022 11:08:18 PM

0 kudos

Hi @Muniyappan Mani Hope all is well! Just wanted to check in if you were able to resolve your issue and would you be happy to share the solution or mark an answer as best? Else please let us know if you need more help. We'd love to hear from you.Th...

0 kudos

11-16-2022 11:08:18 PM

4 More Replies

by ABose8 • Contributor

10-10-2022 7:02:09 AM

2877 Views
3 replies
2 kudos

I attended "The Best Data Warehouse Is a Lakehouse" on September 20 but have not recieved my voucher although I recieved a mail which mentio...

I attended "The Best Data Warehouse Is a Lakehouse" on September 20 but have not recieved my voucher although I recieved a mail which mentioned below details.@Kaniz Fatma Please help me on same.Mail Id: bose.innovator8@gmail.comPlease complete the ...

Data Engineering

2877 Views
3 replies
2 kudos

10-10-2022 7:02:09 AM

View Replies

Latest Reply

Anonymous
Not applicable

11-16-2022 10:59:53 PM

2 kudos

Hi @Arindam Bose That's great to hear! Please mark the answer as best.Thanks and Regards

2 kudos

11-16-2022 10:59:53 PM

2 More Replies

by Sajid1 • Contributor

10-10-2022 5:34:52 AM

5602 Views
3 replies
6 kudos

Resolved! Photon Acceleration not getting enabled for ML runtime in Azure

I tried to enable the photon acceleration in ML runtime 9.1 LTS ML (Scala 2.12,Spark 3.1.2) but getting error "selected runtime version does not support photon".I tried for other versions of ML runtime with single and multinode , access mode being s...

Data Engineering

5602 Views
3 replies
6 kudos

10-10-2022 5:34:52 AM

View Replies

Latest Reply

Anonymous
Not applicable

11-16-2022 10:39:35 PM

6 kudos

Hi @Sajid Thavalengal Rahiman Does @Kaniz Fatma response answer your question? If yes, would you be happy to mark it as best so that other members can find the solution more quickly?We'd love to hear from you.Thanks!

6 kudos

11-16-2022 10:39:35 PM

2 More Replies

by 140015 • Databricks Partner

10-10-2022 1:49:43 AM

5258 Views
6 replies
1 kudos

Delta Live Tables @expect compare tables count between two stages

Hello,I'm wondering if there is an option to make an expectation on DLT that will compare the number of records between two stages and e.g. fail if there is a difference between those counts?I mean something like this:@dlt.table()def bronze(): Some...

Data Engineering

5258 Views
6 replies
1 kudos

10-10-2022 1:49:43 AM

View Replies

Latest Reply

Anonymous
Not applicable

11-16-2022 10:15:06 PM

1 kudos

Hi @Jacek Dembowiak Hope all is well! Just wanted to check in if you were able to resolve your issue and would you be happy to share the solution or mark an answer as best? Else please let us know if you need more help. We'd love to hear from you.Th...

1 kudos

11-16-2022 10:15:06 PM

5 More Replies

by SPuchnanda • New Contributor III

10-08-2022 1:05:47 PM

2341 Views
3 replies
1 kudos

I have cleared Databricks Foundational Accreditation today and received the completion certificate. But I didnt receive the badge for the same. By whe...

I have cleared Databricks Foundational Accreditation today and received the completion certificate. But I didnt receive the badge for the same. By when I will receive the badge ?

Data Engineering

2341 Views
3 replies
1 kudos

10-08-2022 1:05:47 PM

View Replies

Latest Reply

Anonymous
Not applicable

11-16-2022 10:11:42 PM

1 kudos

Hi @Sakshi Puchnanda Does @Hubert Dudek response answer your question? If yes, would you be happy to mark it as best so that other members can find the solution more quickly?We'd love to hear from you.Thanks!

1 kudos

11-16-2022 10:11:42 PM

2 More Replies

by plynton • New Contributor II

10-08-2022 8:07:48 AM

2319 Views
3 replies
1 kudos

Incorrect results with df.query()

I have tried pulling a single row from a .csv using df.query()However, the data being returned doesn't coincide with the data I'm expecting - it is pulling a different row. Here is my code:df = spark.read.option("header",True).csv(data_fldr + "config...

Data Engineering

2319 Views
3 replies
1 kudos

10-08-2022 8:07:48 AM

View Replies

Latest Reply

Anonymous
Not applicable

11-16-2022 10:06:03 PM

1 kudos

Hi @Peter Ott Does @Hubert Dudek response answer your question? If yes, would you be happy to mark it as best so that other members can find the solution more quickly?We'd love to hear from you.Thanks!

1 kudos

11-16-2022 10:06:03 PM

2 More Replies

by bozhu • Contributor

10-07-2022 2:41:19 PM

3218 Views
2 replies
1 kudos

Multiple DLT Pipelines Sharing a Single Cluster

When you use Workflows to orchestrate standard notebooks, they can share a single cluster. It will be awesome if we can achieve the same for DLT pipelines orchestrated in a Workflows Job.I understand DLT pipelines utilising their own special clusters...

Data Engineering

3218 Views
2 replies
1 kudos

10-07-2022 2:41:19 PM

View Replies

Latest Reply

Anonymous
Not applicable

11-16-2022 9:17:12 PM

1 kudos

Hi @Bo Zhu Does @Hubert Dudek response answer your question? If yes, would you be happy to mark it as best so that other members can find the solution more quickly?We'd love to hear from you.Thanks!

1 kudos

11-16-2022 9:17:12 PM

1 More Replies

Databricks Community

Forum Posts

Create workspace in Databricks deployed in Google Cloud using terraform

DLT Pipeline failed to Start due to "The Execution Contained atleast one disallowed language

Resolved! Date field getting changed when reading from excel file to dataframe

Cluster Modes

I am new to Databricks and SQL. My CASE statement is not working and I cannot figure out why. Below is my code and the error message I'm receiving. Grateful for any and all suggestions. I'm trying to put yrs_to_mat into buckets.

Resolved! Mount Azure Blob Storage with Cluster access control

Rs 485 IoT Gateway

How do you access a streaming live table's snapshots?

Resolved! Can we expose REST API using Databricks python notebook?

I attended "The Best Data Warehouse Is a Lakehouse" on September 20 but have not recieved my voucher although I recieved a mail which mentio...

Resolved! Photon Acceleration not getting enabled for ML runtime in Azure

Delta Live Tables @expect compare tables count between two stages

I have cleared Databricks Foundational Accreditation today and received the completion certificate. But I didnt receive the badge for the same. By whe...

Incorrect results with df.query()

Multiple DLT Pipelines Sharing a Single Cluster

Databricks to Salesforce Core (Not cloud)

Databricks optimization for query perfomance and p...

Parametrize the DLT pipeline for dynamic loading o...

File Arrival Trigger - Multiple tables

Issue while handling Deletes and Inserts in Struct...