Data Engineering

Forum Posts

Sorted by:

by Ela • New Contributor III

02-08-2023 9:44:48 PM

1380 Views
1 replies
1 kudos

Checking for availability of dynamic data masking functionality in SQL.

I am looking forward for functionality similar to snowflake which allows attaching masking to a existing column. Documents found related to masking with encryption but my use case is on the existing table. Solutions using views along with Dynamic Vie...

Data Engineering

1380 Views
1 replies
1 kudos

02-08-2023 9:44:48 PM

View Replies

Latest Reply

sivankumar86
New Contributor II

03-07-2024 3:26:31 PM

1 kudos

Unity catalog provide similar feature https://docs.databricks.com/en/data-governance/unity-catalog/row-and-column-filters.html

1 kudos

03-07-2024 3:26:31 PM

by elgeo • Valued Contributor II

02-15-2023 5:56:42 AM

19829 Views
3 replies
2 kudos

Data type length enforcement

Hello. Is there a way to enforce the length of a column in SQL? For example that a column has to be exactly 18 characters? Thank you!

Data Engineering

19829 Views
3 replies
2 kudos

02-15-2023 5:56:42 AM

View Replies

Latest Reply

databricks31
New Contributor II

03-03-2024 11:26:29 PM

2 kudos

we are facing similar issues while write into adls location delta format, after that we created on top delta location unity catalog tables. below format of data type length should be possible to change spark sql supported ?Azure SQL Spark ...

2 kudos

03-03-2024 11:26:29 PM

2 More Replies

by Ajay-Pandey • Esteemed Contributor III

02-23-2023 3:30:55 AM

1984 Views
2 replies
7 kudos

docs.databricks.com

Rename and drop columns with Delta Lake column mapping. Hi all,Now databricks started supporting column rename and drop.Column mapping requires the following Delta protocols:Reader version 2 or above.Writer version 5 or above.Blog URL##Available in D...

Data Engineering

1984 Views
2 replies
7 kudos

02-23-2023 3:30:55 AM

View Replies

Latest Reply

Poovarasan
New Contributor III

03-03-2024 9:51:03 PM

7 kudos

Above mentioned feature is not working in the DLT pipeline. if the scrip has more than 4 columns

7 kudos

03-03-2024 9:51:03 PM

1 More Replies

by numersoz • New Contributor III

12-07-2022 7:55:32 PM

4353 Views
3 replies
5 kudos

Resolved! Z-Ordering Timestamp Column

Hi,I've large Delta Table for IoT data for over 10K different sensors with timestamp, sensor name and value columns at 1 second precision.Query pattern is usually random 5-100 sensors at a time. But typically involves specific year/month/day interval...

Data Engineering

4353 Views
3 replies
5 kudos

12-07-2022 7:55:32 PM

View Replies

Latest Reply

Oliver_Angelil
Valued Contributor II

07-29-2023 3:31:31 PM

5 kudos

@numersoz did you z-order on the timestamp column or on less granular columns, like Year, Month, or Day. timestamp column is very granular (high cardinality) since it also includes hour, minute, second...

5 kudos

07-29-2023 3:31:31 PM

2 More Replies

by THIAM_HUATTAN • Valued Contributor

06-19-2023 6:41:58 AM

7223 Views
3 replies
0 kudos

Parquet column cannot be converted. Column: [Rainfall_Value], Expected: DoubleType, Found: INT64

Data Engineering

7223 Views
3 replies
0 kudos

06-19-2023 6:41:58 AM

View Replies

Latest Reply

Lakshay
Databricks Employee

06-20-2023 5:59:55 AM

0 kudos

Hi @THIAM HUAT TAN , The issue is because the schema defined for the column "Rainfall_Value" is of DoubleType and the values present in the data frame are of Integer type. This could be because of one or multiple values. Depending on the data, you ...

0 kudos

06-20-2023 5:59:55 AM

2 More Replies

by Rubens • New Contributor II

06-02-2023 7:17:35 AM

2474 Views
1 replies
3 kudos

how to alter a column into an IDENTITY column

Here's me use case: I'm migrating out of an old DWH, into Databricks. When moving dimension tables into Databricks, I'd like old SKs (surrogate keys) to be maintained, while creating the SKs column as an IDENTITY column, so new dimension values get a...

Data Engineering

2474 Views
1 replies
3 kudos

06-02-2023 7:17:35 AM

View Replies

Latest Reply

Anonymous
Not applicable

06-17-2023 3:03:00 AM

3 kudos

Hi @Ronen Levi Great to meet you, and thanks for your question! Let's see if your peers in the community have an answer to your question. Thanks.

3 kudos

06-17-2023 3:03:00 AM

by guostong • New Contributor III

06-16-2023 2:27:16 PM

5224 Views
1 replies
1 kudos

How to update the items in array of struct column with sql

create table test.json_test_01 ( id int, description string, struct_address STRUCT<street_number: STRING, street_name: STRING, city: STRING, province: STRING>, arrary_phone ARRAY<STRUCT<phone_number: STRING, phone_type: STRING>> ); insert into ...

Data Engineering

5224 Views
1 replies
1 kudos

06-16-2023 2:27:16 PM

View Replies

Latest Reply

Anonymous
Not applicable

06-17-2023 2:29:43 AM

1 kudos

Hi @Richard Guo Great to meet you, and thanks for your question! Let's see if your peers in the community have an answer to your question. Thanks.

1 kudos

06-17-2023 2:29:43 AM

by DB_795688_DB_44 • New Contributor II

06-15-2023 2:28:35 AM

2201 Views
4 replies
2 kudos

error: at least one column must be specified for the table.

Data Engineering

2201 Views
4 replies
2 kudos

06-15-2023 2:28:35 AM

View Replies

Latest Reply

Anonymous
Not applicable

06-15-2023 8:27:16 PM

2 kudos

Hi @anand R Hope everything is going great.Just wanted to check in if you were able to resolve your issue. If yes, would you be happy to mark an answer as best so that other members can find the solution more quickly? If not, please tell us so we ca...

2 kudos

06-15-2023 8:27:16 PM

3 More Replies

by Jujiro • New Contributor III

12-02-2022 9:25:59 AM

9600 Views
11 replies
7 kudos

Random error: At least one column must be specified for the table?

I have the following code in a notebook. It is randomly giving me the error, "At least one column must be specified for the table." The error occurs (if at all it occurs) only on the first run after attaching to a cluster.Cluster details:Summary5-1...

Data Engineering

9600 Views
11 replies
7 kudos

12-02-2022 9:25:59 AM

View Replies

Latest Reply

Harold
New Contributor II

06-05-2023 6:21:51 PM

7 kudos

Please check if this could help or not:spark.databricks.delta.catalog.update.enabled false

7 kudos

06-05-2023 6:21:51 PM

10 More Replies

by Ajay-Pandey • Esteemed Contributor III

06-01-2023 6:37:30 AM

3612 Views
2 replies
3 kudos

Resolved! Column is accessible after dropping the same column

Hi Today I have seen very Strang behavior of databricks.I have dropped one column from a dataframe and assigned the result to a new dataframe but I am able to use the dropped column in the filter command.In general scenario I should get an error but ...

Data Engineering

3612 Views
2 replies
3 kudos

06-01-2023 6:37:30 AM

View Replies

Latest Reply

Sandeep
Contributor III

06-02-2023 3:44:36 AM

3 kudos

@Ajay Pandey , this is a known behavior. Please refer this JIRA for details: https://issues.apache.org/jira/browse/SPARK-30421

3 kudos

06-02-2023 3:44:36 AM

1 More Replies

by Leszek • Contributor

09-16-2022 3:39:39 AM

2686 Views
1 replies
1 kudos

IDENTITY column duplication when using BY DEFAULT parameter

Hi, I created delta table with identity column using this syntax:Id BIGINT GENERATED BY DEFAULT AS IDENTITYMy steps:1) Created table with Id using syntax above.2) Added two rows with Id = 1 and Id = 2 (BY DEFAULT allows to do that).3) Run Insert (wit...

Data Engineering

2686 Views
1 replies
1 kudos

09-16-2022 3:39:39 AM

View Replies

Latest Reply

dileep_vikram
New Contributor II

05-11-2023 3:01:59 AM

1 kudos

Use below alter command to sync the identity column.alter table table_name change column col_name sync identity

1 kudos

05-11-2023 3:01:59 AM

by RichardDriven • New Contributor III

04-19-2023 7:39:02 PM

8144 Views
2 replies
1 kudos

How to apply a UDF to a property in an array of structs

I have a column that contains an array of structs as follows:"column" : [ { "struct_field1": "struct_value", "struct_field2": "struct_value" }, { "struct_field1": "struct_value", "struct_field2": "struct_value" } ]I want to apply a udf to each f...

Data Engineering

8144 Views
2 replies
1 kudos

04-19-2023 7:39:02 PM

View Replies

by Pawelski • New Contributor

04-18-2023 10:54:38 AM

1409 Views
1 replies
0 kudos

incorrect eventsPath in ASP 1.5 - DataFrame & Column module

Data Engineering

1409 Views
1 replies
0 kudos

04-18-2023 10:54:38 AM

View Replies

Latest Reply

Anonymous
Not applicable

04-23-2023 9:15:31 PM

0 kudos

Hi @Paweł Tomczyk Hope everything is going great.Just wanted to check in if you were able to resolve your issue. If yes, would you be happy to mark an answer as best so that other members can find the solution more quickly? If not, please tell us so...

0 kudos

04-23-2023 9:15:31 PM

by QuicKick • New Contributor

02-13-2023 10:01:21 AM

7152 Views
2 replies
0 kudos

How do I search for all the columns/field names starting with "XYZ"

I would like to do a big search on all field/columns names that contain "XYZ".I tried below sql but it's giving me an error.SELECT table_name,column_nameFROM information_schema.columnsWHERE column_name like '%<account>%'order by table_name, column_na...

Data Engineering

7152 Views
2 replies
0 kudos

02-13-2023 10:01:21 AM

View Replies

Latest Reply

Anonymous
Not applicable

04-17-2023 2:55:21 AM

0 kudos

Hi @Ian Fox Thank you for posting your question in our community! We are happy to assist you.To help us provide you with the most accurate information, could you please take a moment to review the responses and select the one that best answers your ...

0 kudos

04-17-2023 2:55:21 AM

1 More Replies

by Istuti • Contributor

01-13-2023 8:02:55 PM

2545 Views
1 replies
2 kudos

Please guide on the algorithm for masking of column in databricks which is compatible (can be unmasked) with sqlserver.

Data Engineering

2545 Views
1 replies
2 kudos

01-13-2023 8:02:55 PM

View Replies

Latest Reply

Anonymous
Not applicable

04-10-2023 8:02:26 AM

2 kudos

@Istuti Gupta :There are several algorithms you can use to mask a column in Databricks in a way that is compatible with SQL Server. One commonly used algorithm is called pseudonymization or tokenization.Here's an example of how you can implement pse...

2 kudos

04-10-2023 8:02:26 AM