Data Engineering

Forum Posts

Sorted by:

by PrebenOlsen • New Contributor III

4 weeks ago

364 Views
2 replies
0 kudos

How to migrate Git repos with DLT configurations

Hi!I want to migrate all my databricks related code from one github repo to another. I knew this wouldn't be straight forward. When I copy my code for one DLT, I get the errororg.apache.spark.sql.catalyst.ExtendedAnalysisException: Table 'vessel_batt...

Data Engineering

364 Views
2 replies
0 kudos

4 weeks ago

View Replies

Latest Reply

PrebenOlsen
New Contributor III

3 weeks ago

0 kudos

Does cloning take considerably less time then recreating the tables?Can I resume append operations to a cloned table?

0 kudos

3 weeks ago

1 More Replies

by Anshul_DBX • New Contributor

3 weeks ago

162 Views
1 replies
1 kudos

Masking rules with Delta Sharing

Hi,We tried Delta sharing to PBI which worked fine, But facing issues while trying to apply row, column level filtering or data masking. It fails with error that its not supported.Can anyone please confirm, if delta sharing with masking rules works w...

Data Engineering

162 Views
1 replies
1 kudos

3 weeks ago

View Replies

Latest Reply

Yeshwanth
Valued Contributor II

3 weeks ago

1 kudos

Hi @Anshul_DBX good day! The issue you are encountering is due to a limitation in Delta Sharing. As per the provided information, Delta Sharing does not support row-level security or column masks. This means that you cannot apply row and column level...

1 kudos

3 weeks ago

by Yohannes • New Contributor

3 weeks ago

159 Views
1 replies
0 kudos

Databricks cli workflow

Is there a way that I can set up and configure a Databricks workflow job and tasks from Databricks cli or api tools by using python? Any help would be appreciated. #databricksworkflow #databricks

Data Engineering

159 Views
1 replies
0 kudos

3 weeks ago

View Replies

Latest Reply

steyler-db
New Contributor III

3 weeks ago

0 kudos

Hello and yes, you can set up and configure a Databricks workflow job and tasks using Databricks CLI or API tools with Python. Here are some resources and steps to guide you: Create and run Databricks Jobs: This document: ( https://docs.databrick...

0 kudos

3 weeks ago

by de-hru • New Contributor III

05-22-2023 12:48:02 AM

959 Views
2 replies
1 kudos

Address Validation, Correction and Enrichment with Databricks Spark Engine

Hi all!In our project, we're thinking about "Validation, Correction and Enrichment of Postal Addresses" with Databricks. For sure we'd need some kind of batch processing, because we have millions of addresses in our system.I'm aware of Address Valida...

Data Engineering

959 Views
2 replies
1 kudos

05-22-2023 12:48:02 AM

View Replies

Latest Reply

Sam99
New Contributor II

3 weeks ago

1 kudos

Happy to help. Feel free to reach out https://www.linkedin.com/in/saleh-sultan-143ab036?utm_source=share&utm_campaign=share_via&utm_content=profile&utm_medium=android_app

1 kudos

3 weeks ago

1 More Replies

by Phani1 • Valued Contributor

3 weeks ago

165 Views
1 replies
0 kudos

udf in databricks

Hi Team,Is there a particular reason why we should avoid using UDF and instead convert to DataFrame code?Are there any restrictions or limitations (in terms of performance or governance) when using UDFs in Databricks? Regards,Janga

Data Engineering

udf

165 Views
1 replies
0 kudos

3 weeks ago

View Replies

Latest Reply

Walter_C
Valued Contributor II

3 weeks ago

0 kudos

Hello some of the things you need to take in consideration is that:UDFs might introduce significant processing bottlenecks into code execution. Databricks uses a number of different optimizers automatically for code written with included Apache Spark...

0 kudos

3 weeks ago

by ande • New Contributor

3 weeks ago

163 Views
1 replies
0 kudos

IP address for accessing external SFTP server

I am trying to pull in data to my Databricks workspace via an external SFTP server. I am using Azure for my compute. To access the SFTP server they need to whitelist my IP address. My IP address in Azure Databricks seems to be constantly changing fro...

Data Engineering

163 Views
1 replies
0 kudos

3 weeks ago

View Replies

Latest Reply

Walter_C
Valued Contributor II

3 weeks ago

0 kudos

Azure Databricks, like many cloud services, does not provide static IP addresses for outbound connections. This is because the compute resources are dynamically allocated and can change over time. One potential workaround could be to use a Virtual N...

0 kudos

3 weeks ago

by User15787040559 • New Contributor III

06-22-2021 3:39:55 PM

18481 Views
2 replies
5 kudos

What's the difference between a Global view and a Temp view?

The difference between Global and Temp is how the lifetime of the view is tied to the application:http://spark.apache.org/docs/latest/api/python/reference/api/pyspark.sql.DataFrame.createOrReplaceTempView.html?highlight=createorreplacetempview#pyspar...

Data Engineering

18481 Views
2 replies
5 kudos

06-22-2021 3:39:55 PM

View Replies

Latest Reply

ScottSmithDB
Valued Contributor

3 weeks ago

5 kudos

Correct A Temp View is scoped to the SparkSession and dropped when that session closes. Each notebook runs in its own SparkSession. The Global Temp View is scoped to the cluster and dropped when the cluster re-starts or you drop it. ---------------...

5 kudos

3 weeks ago

1 More Replies

by sp1 • New Contributor II

01-15-2023 10:02:07 PM

8917 Views
7 replies
4 kudos

Resolved! Pass date value as parameter in Databricks SQL notebook

I want to pass yesterday date (In the example 20230115*.csv) in the csv file. Don't know how to create parameter and use it here.CREATE OR REPLACE TEMPORARY VIEW abc_delivery_logUSING CSVOPTIONS ( header="true", delimiter=",", inferSchema="true", pat...

Data Engineering

8917 Views
7 replies
4 kudos

01-15-2023 10:02:07 PM

View Replies

Latest Reply

Asifpanjwani
New Contributor II

3 weeks ago

4 kudos

@Kaniz @sp1 @Chaitanya_Raju @daniel_sahal Hi Everyone,I need the same scenario on SQL code, because my DBR cluster not allowed me to run python codeError: Unsupported cell during execution. SQL warehouses only support executing SQL cells.I appreciate...

4 kudos

3 weeks ago

6 More Replies

by Paul92S • New Contributor III

02-20-2024 9:29:32 AM

1590 Views
2 replies
1 kudos

Resolved! DELTA_EXCEED_CHAR_VARCHAR_LIMIT

Hi,I am having an issue of loading source data into a delta table/ unity catalog. The error we are recieving is the following:grpc_message:"[DELTA_EXCEED_CHAR_VARCHAR_LIMIT] Exceeds char/varchar type length limitation. Failed check: (isnull(\'metric_...

Data Engineering

1590 Views
2 replies
1 kudos

02-20-2024 9:29:32 AM

View Replies

Latest Reply

Palash01
Contributor III

02-20-2024 5:49:50 PM

1 kudos

Hey @Paul92S Looking at the error message it looks like column "metric_name" is the culprit here:Understanding the Error:Character Limit Violation: The error indicates that values in the metric_name column are exceeding the maximum length allowed fo...

1 kudos

02-20-2024 5:49:50 PM

1 More Replies

by Hubert-Dudek • Esteemed Contributor III

03-14-2023 6:39:22 AM

6110 Views
10 replies
6 kudos

Databricks now supports event-driven workloads, especially for loading cloud files from external locations. This means you can save costs and resource...

Databricks now supports event-driven workloads, especially for loading cloud files from external locations. This means you can save costs and resources by triggering your Databricks jobs only when new files arrive in your cloud storage instead of mou...

Data Engineering

6110 Views
10 replies
6 kudos

03-14-2023 6:39:22 AM

View Replies

Latest Reply

adriennn
Contributor

3 weeks ago

6 kudos

@daniel_sahal I get your point, but if for a scheduled trigger you can get all kind of attributes on the trigger time (arguably, this is available for all the triggers), then why wouldn't the most important attribute of a file event not be available ...

6 kudos

3 weeks ago

9 More Replies

by Rene • New Contributor II

4 weeks ago

263 Views
2 replies
1 kudos

Can we build IOT data trading platform by using Databricks?

I have an idea of sharing & trading IoT data streamlined from many data sources on the incentive platform.I would be appreciate it if you guys discuss with me about the idea.Thank you

Data Engineering

263 Views
2 replies
1 kudos

4 weeks ago

View Replies

Latest Reply

betty4920taylor
New Contributor II

4 weeks ago

1 kudos

Hello @Rene,Building an IoT data trading platform using Databricks is indeed a feasible and innovative idea. Databricks provides a unified analytics platform that can handle massive amounts of data processing and advanced analytics, which is essentia...

1 kudos

4 weeks ago

1 More Replies

by stevenayers-bge • New Contributor III

3 weeks ago

147 Views
1 replies
1 kudos

Bug with enabling UniForm Data Format?

In the documentation for enabling iceberg compatibility on delta tables, it states that the minReaderVersion for IcebergCompatV1 and IcebergCompatV2 is 2 (https://docs.databricks.com/en/delta/uniform.html#requirements).However, when you run the REORG...

Data Engineering

147 Views
1 replies
1 kudos

3 weeks ago

View Replies

Latest Reply

daniel_sahal
Esteemed Contributor

3 weeks ago

1 kudos

@stevenayers-bge I've just checked source code of delta and you're right - documentation states that tat minReaderVersion should be >=2, but source code is upgrading it to 3https://github.com/delta-io/delta/blob/78970abd96dfc0278e21c04cda442bb05ccde4...

1 kudos

3 weeks ago

by angel_ba • New Contributor II

3 weeks ago

133 Views
1 replies
0 kudos

unity catalog system.access.audit lag

Hello,We have unity catalog enabled workspace. To get the completion time of a pipeline that runs multiple times a day, I am checking system.access.audit table. Comparing the completion time of the pipeline compared to other pipeline time I am creat...

Data Engineering

133 Views
1 replies
0 kudos

3 weeks ago

View Replies

Latest Reply

daniel_sahal
Esteemed Contributor

3 weeks ago

0 kudos

@angel_ba System tables are still in public preview thus there are some limitations, one of them is a blocker for your use case.Currently no support for real-time monitoring. Data is updated throughout the day. If you don’t see a log for a recent eve...

0 kudos

3 weeks ago

by nikhilkumawat • New Contributor III

04-27-2023 6:37:46 AM

5455 Views
6 replies
3 kudos

Resolved! Get file information while using "Trigger jobs when new files arrive" https://docs.databricks.com/workflows/jobs/file-arrival-triggers.html

I am currently trying to use this feature of "Trigger jobs when new file arrive" in one of my project. I have an s3 bucket in which files are arriving on random days. So I created a job to and set the trigger to "file arrival" type. And within the no...

Data Engineering

5455 Views
6 replies
3 kudos

04-27-2023 6:37:46 AM

View Replies

Latest Reply

adriennn
Contributor

3 weeks ago

3 kudos

Looks like a major oversight not to be able to get the information on what file(s) have triggered the job. Anyway, the above explanations given by Anon read like the replies of ChatGPT, especially the scenario where a dataframe is passed to a trigger...

3 kudos

3 weeks ago

5 More Replies

by zahra_Khedri • New Contributor

3 weeks ago

154 Views
1 replies
0 kudos

An error occurred when loading Jobs and Workflows App.

Hi,I was trying to open the Workflows but there is an error "An error occurred when loading Jobs and Workflows App." we need help to know why it happened and how we can resolve it please.

Data Engineering

154 Views
1 replies
0 kudos

3 weeks ago

View Replies

Latest Reply

GeoPer
New Contributor II

3 weeks ago

0 kudos

Same...and the weirdest is that all of the services looks healthy in https://status.databricks.com/Region: eu-central-1Provider: AWSCould anyone provide some info here?

0 kudos

3 weeks ago

User

Count

1603

737

344

284

247

Databricks

Forum Posts

How to migrate Git repos with DLT configurations

Masking rules with Delta Sharing

Databricks cli workflow

Address Validation, Correction and Enrichment with Databricks Spark Engine

udf in databricks

IP address for accessing external SFTP server

What's the difference between a Global view and a Temp view?

Resolved! Pass date value as parameter in Databricks SQL notebook

Resolved! DELTA_EXCEED_CHAR_VARCHAR_LIMIT

Databricks now supports event-driven workloads, especially for loading cloud files from external locations. This means you can save costs and resource...

Can we build IOT data trading platform by using Databricks?

Bug with enabling UniForm Data Format?

unity catalog system.access.audit lag

Resolved! Get file information while using "Trigger jobs when new files arrive" https://docs.databricks.com/workflows/jobs/file-arrival-triggers.html

An error occurred when loading Jobs and Workflows App.

External table from external location

How to increase executor memory in Databricks jobs

Databricks job keep getting failed due to executor...

Set up connection to on prem sql server

Git Integration with Databricks Query Files and Az...