Data Engineering

Forum Posts

Sorted by:

by User16835756816 • Databricks Employee

11-28-2022 12:04:54 PM

8036 Views
4 replies
11 kudos

How can I extract data from different sources and transform it into a fresh, reliable data pipeline?

Tip: These steps are built out for AWS accounts and workspaces that are using Delta Lake. If you would like to learn more watch this video and reach out to your Databricks sales representative for more information.Step 1: Create your own notebook or ...

Data Engineering

8036 Views
4 replies
11 kudos

11-28-2022 12:04:54 PM

View Replies

Latest Reply

Ajay-Pandey
Databricks MVP

12-04-2022 11:02:29 PM

11 kudos

Thanks @Nithya Thangaraj

11 kudos

12-04-2022 11:02:29 PM

3 More Replies

by BigJay • New Contributor II

10-14-2021 6:36:12 PM

7818 Views
5 replies
5 kudos

Capture num_affected_rows in notebooks

If I run some code, say for an ETL process to migrate data from bronze to silver storage, when a cell executes it reports num_affected_rows in a table format. I want to capture that and log it in my logger. Is it stored in a variable or syslogged som...

Data Engineering

7818 Views
5 replies
5 kudos

10-14-2021 6:36:12 PM

View Replies

Latest Reply

-werners-
Esteemed Contributor III

10-19-2021 1:25:56 AM

5 kudos

Good one Dan! I never thought of using the delta api for this but there you go.

5 kudos

10-19-2021 1:25:56 AM

4 More Replies

by User16826994223 • Databricks Employee

06-17-2021 12:54:41 AM

1673 Views
1 replies
0 kudos

How is the ETL process different than trigger once stream

I am little confused between what to use between structured stream(trigger once) and etl batch jobs, can I get help here on which basis i should make my decision.

Data Engineering

1673 Views
1 replies
0 kudos

06-17-2021 12:54:41 AM

View Replies

Latest Reply

sajith_appukutt
Databricks Employee

06-22-2021 10:39:50 PM

0 kudos

In Structured Streaming, triggers are used to specify how often a streaming query should produce results. A RunOnce trigger will fire only once and then will stop the query - effectively running it like a batch job.Now, If your source data is a strea...

0 kudos

06-22-2021 10:39:50 PM

Databricks Community

How can I extract data from different sources and transform it into a fresh, reliable data pipeline?

Capture num_affected_rows in notebooks

How is the ETL process different than trigger once stream