Data Engineering

Forum Posts

Sorted by:

by marcuskw • Contributor

4 weeks ago

4155 Views
1 replies
0 kudos

Resolved! Lakehouse Federation for SQL Server and Security Policy

We've been able to setup a Foreign Catalog using the following documentation:https://learn.microsoft.com/en-us/azure/databricks/query-federation/sql-serverHowever the tables that have RLS using a Security Policy appear empty. I imagine that this solu...

Data Engineering

4155 Views
1 replies
0 kudos

4 weeks ago

View Replies

Latest Reply

marcuskw
Contributor

4 weeks ago

0 kudos

Was a bit quick here, found out that the SUSER_NAME() of the query is of course the connection that was setup.So the User/Password defined here:Once I added that same user to the RLS logic I get the correct result.

0 kudos

4 weeks ago

by 64883 • New Contributor

11-30-2022 1:59:51 AM

439 Views
1 replies
0 kudos

Support for Delta tables multicluster writes in Databricks cluster

Hello, We're using Databricks on AWS and we've recently started using Delta tables. We're using R.While the code below[1] works in a notebook, when running it from RStudio on a Databricks cluster we get the following error: java.lang.IllegalStateExce...

Data Engineering

439 Views
1 replies
0 kudos

11-30-2022 1:59:51 AM

View Replies

Latest Reply

NandiniN
Valued Contributor II

a month ago

0 kudos

Sorry, for being very late here - If you can not use multi write to false, we can try to split this table into separate tables for each stream.

0 kudos

a month ago

by _Raju • New Contributor II

a month ago

620 Views
2 replies
0 kudos

Cast string to decimal

Hello, can anyone help me with the below error.I'm trying to cast the string column into decimal. When I try to do that I'm getting the "Py4JJavaError: An error occurred while calling t.addCustomDisplayData. : java.sql.SQLException: Status of query a...

Data Engineering

620 Views
2 replies
0 kudos

a month ago

View Replies

Latest Reply

Kaniz
Community Manager

a month ago

0 kudos

Hi @_Raju, The error message you provided indicates that there’s an issue with the value 'WVL383', which is not recognized as a valid numeric value. Here are some steps to troubleshoot and resolve this issue: First, examine the data in the column yo...

0 kudos

a month ago

1 More Replies

by BeginnerBob • New Contributor III

07-18-2022 8:33:55 AM

15886 Views
6 replies
3 kudos

Convert Date to YYYYMMDD in databricks sql

Hi,I have a date column in a delta table called ADate. I need this in the format YYYYMMDD.In TSQL this is easy. However, I can't seem to be able to do this without splitting the YEAR, MONTH and Day and concatenating them together.Any ideas?

Data Engineering

15886 Views
6 replies
3 kudos

07-18-2022 8:33:55 AM

View Replies

Latest Reply

JayDoubleYou42
New Contributor II

a month ago

3 kudos

I'll share I'm having a variant of the same issue. I have a varchar field in the form YYYYMMDD which I'm trying to join to another varchar field from another table in the form of MM/DD/YYYY. Does anyone know of a way to do this in SPARK SQL without s...

3 kudos

a month ago

5 More Replies

by Cami • Contributor III

a month ago

773 Views
2 replies
0 kudos

VIEW JSON result value in view which based on volume

Hello guys!I have the following case:It has been decided that the json file will be read from a following definition ( from volume) , which more or less looks like this: CREATE OR REPLACE VIEW [catalog_name].[schema_name].v_[object_name] AS SELECT r...

Data Engineering

773 Views
2 replies
0 kudos

a month ago

View Replies

Latest Reply

Kaniz
Community Manager

a month ago

0 kudos

Hi @Cami, It seems you’re encountering an issue with the NULL value in the result column of your view despite having the flag enabled. Let’s address this step by step: View Column Not Null Issue: When creating a view, SQL Server sometimes belie...

0 kudos

a month ago

1 More Replies

by akisugi • New Contributor III

04-06-2024 7:12:37 AM

1410 Views
5 replies
0 kudos

Resolved! Is it possible to control the ordering of the array values created by array_agg()?

Hi! I would be glad to ask you some questions.I have the following data. I would like to get this kind of result. I want `move` to correspond to the order of `hist`.Therefore, i considered the following query.```with tmp as (select * from (values(1, ...

Data Engineering

1410 Views
5 replies
0 kudos

04-06-2024 7:12:37 AM

View Replies

Latest Reply

akisugi
New Contributor III

a month ago

0 kudos

Hi @ThomazRossito This is a great idea. It can solve my problem.Thank you.

0 kudos

a month ago

4 More Replies

by 57410 • New Contributor

a month ago

713 Views
1 replies
0 kudos

Deploy python application with submodules - Poetry library management

Hi,I'm using DBX (I'll soon move to Databricks Asset Bundle, but it doesn't change anything in my situation) to deploy a Python application to Databricks. I'm also using Poetry to manage my libraries and dependencies.My project looks like this :Proje...

Data Engineering

713 Views
1 replies
0 kudos

a month ago

View Replies

Latest Reply

Kaniz
Community Manager

a month ago

0 kudos

Hi @57410, It seems you’re transitioning from DBX to Databricks Asset Bundles (DABs) for managing your complex data, analytics, and ML projects on the Databricks platform. Let’s dive into the details and address the issue you’re facing. Databricks...

0 kudos

a month ago

by cool_cool_cool • New Contributor II

a month ago

294 Views
2 replies
2 kudos

Trigger Dashboard Update At The End of a Workflow

Heya I have a workflow that computes some data and writes to a delta table, and I have a dashboard that is based on the table. How can I trigger refresh on the dashboard once the workflow is finished? Thanks!

Data Engineering

294 Views
2 replies
2 kudos

a month ago

View Replies

Latest Reply

Kaniz
Community Manager

a month ago

2 kudos

Hi @cool_cool_cool, To ensure your dashboard reflects the most up-to-date data after your Databricks workflow completes, consider the following options: Scheduled Notebook Refresh: Dashboards do not automatically live-refresh when presented from ...

2 kudos

a month ago

1 More Replies

by mikeagicman • New Contributor

a month ago

291 Views
1 replies
0 kudos

Handling Unknown Fields in DLT Pipeline

HiI'm working on a DLT pipeline where I read JSON files stored in S3.I'm using the auto loader to identify the file schema and adding schema hints for some fields to specify their type.When running it against a single data file that contains addition...

Data Engineering

291 Views
1 replies
0 kudos

a month ago

View Replies

Latest Reply

Kaniz
Community Manager

a month ago

0 kudos

Hi @mikeagicman, When you encounter the error message 'terminated with exception: [UNKNOWN_FIELD_EXCEPTION.NEW_FIELDS_IN_RECORD_WITH_FILE_PATH] Encountered unknown fields during parsing.', it means that the data file contains fields that are not defi...

0 kudos

a month ago

by EdemSeitkh • New Contributor III

01-23-2024 6:52:08 AM

1541 Views
4 replies
0 kudos

Resolved! Pass catalog/schema/table name as a parameter to sql task

Hi, i am trying to pass catalog name as a parameter into query for sql task, and it pastes it with single quotes, which results in error. Is there a way to pass raw value or other possible workarounds? query:INSERT INTO {{ catalog }}.pas.product_snap...

Data Engineering

1541 Views
4 replies
0 kudos

01-23-2024 6:52:08 AM

View Replies

Latest Reply

lathaniel
New Contributor III

a month ago

0 kudos

@EdemSeitkh can you elaborate on your workaround? Curious how you were able to implement an enum paramter in DBSQL.I'm running into this same issue now.

0 kudos

a month ago

3 More Replies

by 939772 • New Contributor III

a month ago

681 Views
1 replies
0 kudos

Resolved! DLT refresh unexpectedly failing

We're hitting an error with a delta live table refresh since yesterday; nothing has changed in our system yet there appears to be a configuration error: { ... "timestamp": "2024-04-08T23:00:10.630Z", "message": "Update b60485 is FAILED.",...

Data Engineering

681 Views
1 replies
0 kudos

a month ago

View Replies

Latest Reply

939772
New Contributor III

a month ago

0 kudos

Apparently the `custom_tags` of `ResourceClass` is now extraneous -- removing it from config corrected our problem.

0 kudos

a month ago

by brian_zavareh • New Contributor III

04-02-2024 2:48:58 PM

1668 Views
5 replies
4 kudos

Resolved! Optimizing Delta Live Table Ingestion Performance for Large JSON Datasets

I'm currently facing challenges with optimizing the performance of a Delta Live Table pipeline in Azure Databricks. The task involves ingesting over 10 TB of raw JSON log files from an Azure Data Lake Storage account into a bronze Delta Live Table la...

Data Engineering

autoloader

bigdata

delta-live-tables

json

1668 Views
5 replies
4 kudos

04-02-2024 2:48:58 PM

View Replies

Latest Reply

standup1
New Contributor III

a month ago

4 kudos

Hey @brian_zavareh , see this document. I hope this can help.https://learn.microsoft.com/en-us/azure/databricks/compute/cluster-config-best-practicesJust keep in mind that there's some extra cost from Azure VM side, check your Azure Cost Analysis for...

4 kudos

a month ago

4 More Replies

by standup1 • New Contributor III

a month ago

926 Views
2 replies
0 kudos

Recover a deleted DLT pipeline

Hello,does anyone know how to recover a deleted dlt pipeline, or at least recover deleted tables that were managed by the dlt pipeline ? We have a pipeline that stopped working and throwing all kind of errors, so we decided to create a new one and de...

Data Engineering

926 Views
2 replies
0 kudos

a month ago

View Replies

Latest Reply

standup1
New Contributor III

a month ago

0 kudos

Thank you, Kanzi. Just to confirm that I understood you correctly. If the pipeline is deleted [like in our case] without having version control, backup configuration..etc already implemented. There's no way to recover those tables, not the pipeline. ...

0 kudos

a month ago

1 More Replies

by Adrianj • New Contributor III

02-26-2024 11:36:42 AM

2631 Views
9 replies
5 kudos

Databricks Bundles - How to select which jobs resources to deploy per target?

Hello, My team and I are experimenting with bundles, we follow the pattern of having one main file Databricks.yml and each job definition specified in a separate yaml for modularization. We wonder if it is possible to select from the main Databricks....

Data Engineering

2631 Views
9 replies
5 kudos

02-26-2024 11:36:42 AM

View Replies

Latest Reply

HrushiM
New Contributor II

a month ago

5 kudos

Hi @Adrianj , Please refer this medium.com post. I have tried explaining how dynamically you can change the content of the databricks.yml for each of the environment by maintaining single databricks.yml file with adequate level of parameters. In your...

5 kudos

a month ago

8 More Replies

by Shas_DataE • New Contributor II

a month ago

717 Views
2 replies
0 kudos

Alerts and Dashboard

Hi Team,In my Databricks workspace, i have created an alerts using the query in such a way the schedule will run on daily basis and the results will get populated to dashboard. The results from dashboard will be notified via email, but i am seeing re...

Data Engineering

717 Views
2 replies
0 kudos

a month ago

View Replies

Latest Reply

Ayushi_Suthar
Honored Contributor

a month ago

0 kudos

HI @Shas_DataE, Good Day! Could you please check and confirm if there are any special characters in the table column? At this moment, special characters are compatible with Excel. If yes then please drop the column that has that special character a...

0 kudos

a month ago

1 More Replies

User

Count

1603

736

344

284

247

Databricks

Forum Posts

Resolved! Lakehouse Federation for SQL Server and Security Policy

Support for Delta tables multicluster writes in Databricks cluster

Cast string to decimal

Convert Date to YYYYMMDD in databricks sql

VIEW JSON result value in view which based on volume

Resolved! Is it possible to control the ordering of the array values created by array_agg()?

Deploy python application with submodules - Poetry library management

Trigger Dashboard Update At The End of a Workflow

Handling Unknown Fields in DLT Pipeline

Resolved! Pass catalog/schema/table name as a parameter to sql task

Resolved! DLT refresh unexpectedly failing

Resolved! Optimizing Delta Live Table Ingestion Performance for Large JSON Datasets

Recover a deleted DLT pipeline

Databricks Bundles - How to select which jobs resources to deploy per target?

Alerts and Dashboard

Load multiple delta tables at once from Sql server

Starting Serverless sql cluster on GCP

"Can't login to databricks socket is closed" when ...

Temporary views no longer working for Share Comput...

Does DLT use one single SparkSession?