Data Engineering

Forum Posts

Sorted by:

by Dayaa • New Contributor II

03-13-2023 9:48:54 AM

2718 Views
3 replies
4 kudos

Resolved! Load data into Azure SQL Database from Azure Databricks ( restricted table not a whole workspace tables)

Hi ,I want to share limited tables in my databricks workspace and users will connect to my databricks through Azure Data factory and will load data into Azure SQL. Is this possible using Delta Sharing? Or any other method or tool?

Data Engineering

2718 Views
3 replies
4 kudos

03-13-2023 9:48:54 AM

View Replies

Latest Reply

Anonymous
Not applicable

03-18-2023 8:50:44 PM

4 kudos

Hi @Dayananthan Marimuthu Thank you for your question! To assist you better, please take a moment to review the answer and let me know if it best fits your needs.Please help us select the best solution by clicking on "Select As Best" if it does.Your...

4 kudos

03-18-2023 8:50:44 PM

2 More Replies

by JacintoArias • New Contributor III

01-28-2022 2:17:02 AM

8083 Views
5 replies
1 kudos

Spark predicate pushdown on parquet files when using limit

Hi,While developing an ETL for a large dataset I want to get a sample of the top rows to check that my the pipeline "just runs", so I add a limit clause when reading the dataset.I'm surprised to see that instead of creating a single task as in a sho...

Data Engineering

8083 Views
5 replies
1 kudos

01-28-2022 2:17:02 AM

View Replies

Latest Reply

JacekLaskowski
New Contributor III

03-13-2023 6:34:27 AM

1 kudos

It's been a while since the question was asked, and in the meantime Delta Lake 2.2.0 hit the shelves with the exact feature the OP asked about, i.e. LIMIT pushdown:LIMIT pushdown into Delta scan. Improve the performance of queries containing LIMIT cl...

1 kudos

03-13-2023 6:34:27 AM

4 More Replies

by lizou • Contributor II

01-07-2023 6:58:48 AM

4016 Views
3 replies
0 kudos

bug: add csv data UI: missing leading zero

use add data UI, add csv manually, even set data type as string, the leading zero will be missingexample csvval1,val20012345, abcafter load data, 123,abc is stored in table

Data Engineering

4016 Views
3 replies
0 kudos

01-07-2023 6:58:48 AM

View Replies

Latest Reply

lizou
Contributor II

01-08-2023 4:06:01 PM

0 kudos

there are no issues using spark.read in notebooksthe issue is specific to using Add Data User interface and adding a csv file manually.

0 kudos

01-08-2023 4:06:01 PM

2 More Replies

by Anonymous • Not applicable

06-24-2021 10:12:44 PM

1525 Views
2 replies
0 kudos

Is Autoloader an option to load data to Databricks from Azure SQL?

Data Engineering

1525 Views
2 replies
0 kudos

06-24-2021 10:12:44 PM

View Replies

Latest Reply

sajith_appukutt
Honored Contributor II

06-25-2021 12:01:55 PM

0 kudos

If you are looking for incrementally loading data from Azure SQL, checkout one of our technology partners that support change-data-capture or setup debezium for sql-server. These solutions could land data in a streaming fashion to kafka/kinesis/even...

0 kudos

06-25-2021 12:01:55 PM

1 More Replies

by User16137833804 • Databricks Employee

06-23-2021 1:14:27 PM

1665 Views
1 replies
1 kudos

Once I set up the Git Server Proxy, what would be the best way to set alerts in case the Cluster Proxy goes down?

Data Engineering

1665 Views
1 replies
1 kudos

06-23-2021 1:14:27 PM

View Replies

Latest Reply

sajith_appukutt
Honored Contributor II

06-23-2021 7:51:55 PM

1 kudos

You could have the single node cluster where proxy is installed monitored by one of the tools like cloudwatch, azure monitor, datadog etc and have it configured to send alerts on node failure

1 kudos

06-23-2021 7:51:55 PM

by User16826994223 • Honored Contributor III

06-15-2021 9:03:27 AM

5885 Views
1 replies
0 kudos

What does it mean that Delta Lake supports multi-cluster writes

What does it mean that Delta Lake supports multi-cluster writes ,Please explain , Ca we write same delta table with Multiple cluster

Data Engineering

5885 Views
1 replies
0 kudos

06-15-2021 9:03:27 AM

View Replies

Latest Reply

User16826994223
Honored Contributor III

06-15-2021 9:03:40 AM

0 kudos

It means that Delta Lake does locking to make sure that queries writing to a table from multiple clusters at the same time won’t corrupt the table. However, it does not mean that if there is a write conflict (for example, update and delete the same t...

0 kudos

06-15-2021 9:03:40 AM

by Anonymous • Not applicable

06-02-2021 5:01:52 PM

1107 Views
0 replies
0 kudos

Append subset of columns to target Snowflake table

I’m using the databricks-snowflake connector to load data into a Snowflake table. Can someone point me to any example of how we can append only a subset of columns to a target Snowflake table (for example some columns in the target snowflake table ar...

Data Engineering

1107 Views
0 replies
0 kudos

06-02-2021 5:01:52 PM

Databricks Community

Resolved! Load data into Azure SQL Database from Azure Databricks ( restricted table not a whole workspace tables)

Spark predicate pushdown on parquet files when using limit

bug: add csv data UI: missing leading zero

Is Autoloader an option to load data to Databricks from Azure SQL?

Once I set up the Git Server Proxy, what would be the best way to set alerts in case the Cluster Proxy goes down?

What does it mean that Delta Lake supports multi-cluster writes

Append subset of columns to target Snowflake table