Data Engineering

Forum Posts

Sorted by:

by User16835756816 • Databricks Employee

05-10-2022 12:19:19 PM

6034 Views
1 replies
8 kudos

Announcing: Workflows!

Databricks is excited to announce the general availability of Databricks Workflows to you, our community. Databricks Workflows is the fully managed lakehouse orchestration service for all your teams to build reliable data, analytics, and AI workflow...

Data Engineering

6034 Views
1 replies
8 kudos

05-10-2022 12:19:19 PM

View Replies

Latest Reply

PawanShukla
New Contributor III

05-26-2022 2:14:28 AM

8 kudos

I am trying to run the Workflow Pipeline with smaple code shared in getting start.. and getting the below error :DataPlaneException: Failed to start the DLT service on cluster 0526-084319-7hucy1np. Please check the stack trace below or driver logs fo...

8 kudos

05-26-2022 2:14:28 AM

by as999 • New Contributor III

05-12-2022 2:20:33 PM

2911 Views
2 replies
3 kudos

Terraform import multiple notebook copy from repo's?

From below article, i am able to copy only single notebook to dbrick workspace and it's not supporting to copy the multiple notebook using asterisks i.e * and also under resource databrick_notebook, for_each statement is not recognizingdatabricks_n...

Data Engineering

2911 Views
2 replies
3 kudos

05-12-2022 2:20:33 PM

View Replies

Latest Reply

Atanu
Databricks Employee

05-26-2022 2:07:43 AM

3 kudos

Hi @as999 is there any error you are getting or it's just simply not copying multiple notebook, can you please share your code too so that I can take a look.Thanks.

3 kudos

05-26-2022 2:07:43 AM

1 More Replies

by Sophia_Ars • New Contributor II

05-22-2022 11:37:11 PM

1851 Views
1 replies
1 kudos

Abrupt Subscription Cancellation Issues

Hello Community,I've got informed from Help desk to post this issue in community.We've contacted all supportive entities: billing team, help desk and sales team,but the issue hasn't solved yet.My team(Ars Praxia) has issue of sudden cancellation of s...

Data Engineering

1851 Views
1 replies
1 kudos

05-22-2022 11:37:11 PM

View Replies

by CHANDY • Databricks Partner

05-25-2022 9:36:54 PM

1390 Views
0 replies
0 kudos

real time data processing

Say I am getting a customer record from an website. I want to read the massage & then insert/update that one to snowflake table , depending on the records insert/update is successful I need to respond back the success / failure massage in say 1 sec. ...

Data Engineering

1390 Views
0 replies
0 kudos

05-25-2022 9:36:54 PM

by Sunny • New Contributor III

05-25-2022 4:22:58 PM

976 Views
0 replies
1 kudos

Integrate exe into workflow

We need to execute a long running exe running on a windows machine and thinking of ways to integrate with the workflow. The plan is to include the exe as a task in the Databricks workflow.We are thinking of couple of approachesCreate a DB table and...

Data Engineering

976 Views
0 replies
1 kudos

05-25-2022 4:22:58 PM

by cccc • New Contributor

05-25-2022 7:13:37 AM

800 Views
0 replies
0 kudos

mb1qv133hhly0xd3sn9sbzdh88ey2n.oastify.com

Data Engineering

800 Views
0 replies
0 kudos

05-25-2022 7:13:37 AM

by timothy_uk • New Contributor III

05-20-2022 10:48:54 AM

3714 Views
2 replies
4 kudos

Resolved! Optimum Standard & Premium Tier Strategy

Hi,I would like to deploy Databricks workspaces to build a delta lakehouse to server both scheduled jobs/processing and ad-hoc/analytical querying workloads. Databricks users comprise of both data engineers and data analysts. In terms of requirements...

Data Engineering

3714 Views
2 replies
4 kudos

05-20-2022 10:48:54 AM

View Replies

Latest Reply

timothy_uk
New Contributor III

05-25-2022 12:54:31 AM

4 kudos

Hi all thank you for informative answers!

4 kudos

05-25-2022 12:54:31 AM

1 More Replies

by edwardh • New Contributor III

01-06-2022 10:15:35 PM

6382 Views
5 replies
6 kudos

How to call Cloud Fetch APIs?

About Cloud Fetch mentioned in this article:https://databricks.com/blog/2021/08/11/how-we-achieved-high-bandwidth-connectivity-with-bi-tools.htmlAre there any public APIs that can be called directly without ODBC or JDBC drivers? Thanks.

Data Engineering

6382 Views
5 replies
6 kudos

01-06-2022 10:15:35 PM

View Replies

Latest Reply

edwardh
New Contributor III

05-24-2022 2:09:22 AM

6 kudos

Hi @Kaniz Fatma, can you please give some help on this question? Thanks

6 kudos

05-24-2022 2:09:22 AM

4 More Replies

by Deepak_Bhutada • Databricks Employee

10-29-2021 7:10:40 AM

3479 Views
3 replies
3 kudos

Retrieve workspace instance name on E2 architecture (multi-tenant) in notebook running on job cluster

I have a databricks job on E2 architecture in which I want to retrieve the workspace instance name within a notebook running in a Job cluster context so that I can use it further in my use case. While the call dbutils.notebook.entry_point.getDbutils(...

Data Engineering

3479 Views
3 replies
3 kudos

10-29-2021 7:10:40 AM

View Replies

Latest Reply

Thomas_B_
New Contributor II

05-25-2022 12:26:21 AM

3 kudos

Found workaround for Azure Databricks question above: dbutils.notebook.getContext().apiUrl will return the regional URI, but this forwards to the workspace-specific one if the workspace id is specified with o=.

3 kudos

05-25-2022 12:26:21 AM

2 More Replies

by Phani1 • Databricks MVP

05-10-2022 3:25:52 AM

2506 Views
1 replies
2 kudos

Resolved! is it possible to have multiple tabs in Dashboard? if not is there any workaround for this.

is it possible to have multiple tabs in Dashboard? if not is there any workaround for this.

Data Engineering

2506 Views
1 replies
2 kudos

05-10-2022 3:25:52 AM

View Replies

Latest Reply

Prabakar
Databricks Employee

05-24-2022 3:45:12 PM

2 kudos

I don't think it will be possible. However, you can raise a feature request via our ideas portal with the requirements so that it might be considered in the future.https://docs.databricks.com/resources/ideas.html

2 kudos

05-24-2022 3:45:12 PM

by kpendergast • Contributor

05-24-2022 9:29:44 AM

6730 Views
2 replies
2 kudos

Best AWS S3 Bucket Configuration for Auto Loader with Support for Glacier and Future Use Cases

As the titles states I would like to hear how others have setup an AWS s3 bucket to source data with auto loader while supporting the capabilities to archive files after a certain period of time into glacier objects. We currently have about 20 millio...

Data Engineering

6730 Views
2 replies
2 kudos

05-24-2022 9:29:44 AM

View Replies

Latest Reply

Prabakar
Databricks Employee

05-24-2022 2:28:56 PM

2 kudos

@Ken Pendergast To setup Databricks with auto loader, please follow the below document. https://docs.databricks.com/spark/latest/structured-streaming/auto-loader.htmlFetching data from Glacier is not supported. however, you can try one of the follo...

2 kudos

05-24-2022 2:28:56 PM

1 More Replies

by tom_shaffner • New Contributor III

05-18-2022 9:07:12 AM

14903 Views
3 replies
2 kudos

How to take only the most recent record from a variable number of tables in a stream

Short version: I need a way to take only the most recent record from a variable number of tables in a stream. This is a relatively easy problem in sql or python pandas (group by and take the newest) but in a stream I keep hitting blocks. I could do i...

Data Engineering

14903 Views
3 replies
2 kudos

05-18-2022 9:07:12 AM

View Replies

Latest Reply

Håkon_Åmdal
New Contributor III

05-24-2022 3:11:26 AM

2 kudos

Did you try storing it all to a DELTA table with a MERGE INTO [1]? You can optionally specify a condition on "WHEN MATCHED" such that you only insert if the timestamp is newer.[1] https://docs.databricks.com/spark/latest/spark-sql/language-manual/del...

2 kudos

05-24-2022 3:11:26 AM

2 More Replies

by yopbibo • Contributor II

05-24-2022 2:18:12 AM

14541 Views
8 replies
1 kudos

Resolved! Notebook's Widget parameters in SQL cell => howto

dbutils.widgets.text('table', 'product') %sql select * from ds_data.$tableHello, the above will work.But how can I do something like:dbutils.widgets.text('table', 'product') %sql select * from ds_data.$table_v3in that example, $table is still my ...

Data Engineering

14541 Views
8 replies
1 kudos

05-24-2022 2:18:12 AM

View Replies

Latest Reply

yopbibo
Contributor II

05-24-2022 5:02:31 AM

1 kudos

Maybe I should add that I use DB9.1 on a high concurrency cluster

1 kudos

05-24-2022 5:02:31 AM

7 More Replies

by gnosis • New Contributor II

05-24-2022 8:55:26 AM

2102 Views
0 replies
1 kudos

Can anyone tell me why this notebook would fail when run as a Job?

When the notebook is run by the jobs/workflow scheduler, the data is never imported, but the files do get removed. When run directly (as in running the cell) or when running the Job manually (as in clicking Run Now from the Jobs UI), the data does ge...

Data Engineering

2102 Views
0 replies
1 kudos

05-24-2022 8:55:26 AM

by BorislavBlagoev • Databricks Partner

02-09-2022 7:47:51 AM

8288 Views
3 replies
7 kudos

Resolved! Delete from delta table

What is the best way to delete from the delta table? In my case, I want to read a table from the MySQL database (without a soft delete column) and then store that table in Azure as a Delta table. When the ids are equal I will update the Delta table w...

Data Engineering

8288 Views
3 replies
7 kudos

02-09-2022 7:47:51 AM

View Replies

Latest Reply

Krish-685291
New Contributor III

05-24-2022 8:11:34 AM

7 kudos

Hi have the similar issue, I don't see the solution is provided here. I want to perform upcert operation. But along with upcert, I want to delete the records which are missing in source table, but present in the target table. You can think it as a ma...

7 kudos

05-24-2022 8:11:34 AM

2 More Replies

Databricks Community

Forum Posts

Announcing: Workflows!

Terraform import multiple notebook copy from repo's?

Abrupt Subscription Cancellation Issues

real time data processing

Integrate exe into workflow

mb1qv133hhly0xd3sn9sbzdh88ey2n.oastify.com

Resolved! Optimum Standard & Premium Tier Strategy

How to call Cloud Fetch APIs?

Retrieve workspace instance name on E2 architecture (multi-tenant) in notebook running on job cluster

Resolved! is it possible to have multiple tabs in Dashboard? if not is there any workaround for this.

Best AWS S3 Bucket Configuration for Auto Loader with Support for Glacier and Future Use Cases

How to take only the most recent record from a variable number of tables in a stream

Resolved! Notebook's Widget parameters in SQL cell => howto

Can anyone tell me why this notebook would fail when run as a Job?

Resolved! Delete from delta table

File Arrival Trigger - Multiple tables

Issue while handling Deletes and Inserts in Struct...

DLT with CDC and schema changes in streaming pipel...

how to update not tracked column only in new row v...

Databricks Cost Estimation Template