Data Engineering

Forum Posts

Sorted by:

by User16835756816 • Valued Contributor

03-17-2022 7:33:06 AM

12946 Views
2 replies
10 kudos

Resolved! How do I resolve problems when deploying a workspace with AWS Quickstart cloud formation template?

I am unable to deploy a workspace on AWS using Quickstart from my account console.Short description-You might receive one of the following common errors users face:Wrong credentialsElastic IP and VPC limit reachedRegion unavailableResolution-Wrong cr...

Data Engineering

12946 Views
2 replies
10 kudos

03-17-2022 7:33:06 AM

View Replies

Latest Reply

aiwithqasim
Contributor

12-28-2022 8:42:20 PM

10 kudos

Really great explanation. The error that I was encountering since yesterday was Failed to create CreateStorageConfiguraiton and CreateCredentialConfiguration. The first step to put the password manually helped me to solve the issue

10 kudos

12-28-2022 8:42:20 PM

1 More Replies

by IG1 • New Contributor II

12-19-2022 7:29:45 AM

834 Views
3 replies
2 kudos

Why there's no "New Union" option with Databricks connection

I'm trying to use databricks connect with tableau but it doesn't give me the "New Union" option. Is this normal or it's particular to me? My tableau desktop version is 2021.3

Data Engineering

834 Views
3 replies
2 kudos

12-19-2022 7:29:45 AM

View Replies

Latest Reply

Aviral-Bhardwaj
Esteemed Contributor III

12-20-2022 6:03:12 AM

2 kudos

there is option for connecting tableau find there SPARK SQL then it should work after adding proper connection string

2 kudos

12-20-2022 6:03:12 AM

2 More Replies

by johnb1 • New Contributor III

12-16-2022 5:01:31 AM

10318 Views
13 replies
13 kudos

Certified Data Engineer Associate - v2 vs. v3 (Databricks Academy)

Which version of the Data Engineering with Databricks learning plan should I do? v2 or v3? Is there a Certified Data Engineer Associate V3 Exam already?Where can I find practice exams for Certified Data Engineer Associate V3?

Data Engineering

10318 Views
13 replies
13 kudos

12-16-2022 5:01:31 AM

View Replies

Latest Reply

Frank_Tao
New Contributor II

12-28-2022 2:49:25 PM

13 kudos

I would suggest choose v3 - it was latest version and covered more topic.

13 kudos

12-28-2022 2:49:25 PM

12 More Replies

by VVill_T • Contributor

12-14-2022 4:30:59 PM

1622 Views
4 replies
7 kudos

How to write a Delta Live Table(dlt) pipeline output to Databricks SQL directly

Hi,I am trying to see if it is possible to setup a direct connection from dlt pipeline to a table in Databricks SQL by configuring the Target Schema: with poc being a location of schema like "dbfs:/***/***/***/poc.db The error message was just a...

Data Engineering

1622 Views
4 replies
7 kudos

12-14-2022 4:30:59 PM

View Replies

Latest Reply

youssefmrini
Honored Contributor III

12-21-2022 1:24:18 AM

7 kudos

When ever you store a Delta Table to Hive Metastore. This table will be available in Databricks SQL Workspace ( Data Explorer ) under hive_metastore catalog.

7 kudos

12-21-2022 1:24:18 AM

3 More Replies

by alexgv12 • New Contributor III

11-01-2022 7:35:47 AM

737 Views
1 replies
3 kudos

creation of tables with cdc

I am using cdc to create different tables, these tables can have one or more dependencies, what is the best practice to create these tables without losing records or changes in both the base table and the join tables? for exampleselect * from ( ...

Data Engineering

737 Views
1 replies
3 kudos

11-01-2022 7:35:47 AM

View Replies

Latest Reply

alexgv12
New Contributor III

12-28-2022 7:00:20 AM

3 kudos

more detail

3 kudos

12-28-2022 7:00:20 AM

by Prototype998 • New Contributor III

12-27-2022 9:53:50 PM

1466 Views
1 replies
5 kudos

Resolved! Where can we use Broadcast variable?

best situations where we can use broadcast variables ?

Data Engineering

1466 Views
1 replies
5 kudos

12-27-2022 9:53:50 PM

View Replies

Latest Reply

Rishabh264
Honored Contributor II

12-27-2022 11:50:56 PM

5 kudos

hey @Punit Chauhan BV are used in the same way for RDD, DataFrame, and Dataset.When you run a Spark RDD, DataFrame jobs that has the Broadcast variables defined and used, Spark does the following.Spark breaks the job into stages that have distribute...

5 kudos

12-27-2022 11:50:56 PM

by SudiptaBiswas • New Contributor III

12-21-2022 6:01:26 AM

1660 Views
3 replies
3 kudos

databricks autoloader getting stuck in flattening json files for different scenarios similar in nature.

I have a databricks autoloader notebook that reads json files from an input location and writes the flattened version of json files to an output location. However, the notebook is behaving differently for two different but similar scenarios as descri...

Data Engineering

1660 Views
3 replies
3 kudos

12-21-2022 6:01:26 AM

View Replies

Latest Reply

jose_gonzalez
Moderator

12-27-2022 3:51:16 PM

3 kudos

Could you provide a code snippet? also do you see any error logs in the driver logs?

3 kudos

12-27-2022 3:51:16 PM

2 More Replies

by Rishabh264 • Honored Contributor II

12-27-2022 3:42:22 AM

635 Views
1 replies
4 kudos

PrivilegesSELECT: gives read access to an object.CREATE: gives ability to create an object (for example, a table in a schema).MODIFY: gives ability to...

PrivilegesSELECT: gives read access to an object.CREATE: gives ability to create an object (for example, a table in a schema).MODIFY: gives ability to add, delete, and modify data to or from an object.USAGE: does not give any abilities, but is an add...

Data Engineering

635 Views
1 replies
4 kudos

12-27-2022 3:42:22 AM

View Replies

Latest Reply

Aviral-Bhardwaj
Esteemed Contributor III

12-27-2022 8:40:33 PM

4 kudos

thanks sir

4 kudos

12-27-2022 8:40:33 PM

by Yatoom • New Contributor II

12-27-2022 5:02:38 AM

1297 Views
2 replies
2 kudos

Disable access to mount point for client code

We are building a platform where we automatically execute Databricks jobs using Python packages delivered by our end-users. We want to create a mount point so that we can deliver the cluster's driver logs to an external storage. However, we don't wan...

Data Engineering

1297 Views
2 replies
2 kudos

12-27-2022 5:02:38 AM

View Replies

Latest Reply

Aviral-Bhardwaj
Esteemed Contributor III

12-27-2022 8:34:51 PM

2 kudos

Check with cloud providers

2 kudos

12-27-2022 8:34:51 PM

1 More Replies

by Aviral-Bhardwaj • Esteemed Contributor III

12-17-2022 9:33:28 PM

2815 Views
1 replies
35 kudos

Understand Trigger Intervals in Streaming Pipelines in Databricks When defining a streaming write, the trigger the method specifies when the system sh...

Understand Trigger Intervals in Streaming Pipelines in DatabricksWhen defining a streaming write, the trigger the method specifies when the system should process the next set of data. Triggers are specified when defining how data will be written to a...

Data Engineering

2815 Views
1 replies
35 kudos

12-17-2022 9:33:28 PM

View Replies

Latest Reply

jose_gonzalez
Moderator

12-27-2022 4:14:12 PM

35 kudos

Thank you for sharing

35 kudos

12-27-2022 4:14:12 PM

by espenol • New Contributor III

12-22-2022 5:31:37 AM

3125 Views
4 replies
3 kudos

Resolved! Can't read large multiline json,

Hey! So I'm struggling to read a multiline json. Some details:It's gzipped from the API I get it fromjust a single file in the folder currrentlystored in ADLS Gen2 storage. 95 MB zipped, approximately 1.2 GB unzippedI can read it just fine using the...

Data Engineering

3125 Views
4 replies
3 kudos

12-22-2022 5:31:37 AM

View Replies

Latest Reply

espenol
New Contributor III

12-22-2022 6:00:15 AM

3 kudos

Thanks a lot for the help! Removing colon fixed it. Now I need to fix the Data Factory instance that writes to my storage container. Hope it's easy, Data Factory is such a hassle.

3 kudos

12-22-2022 6:00:15 AM

3 More Replies

by JavedN • New Contributor

12-22-2022 9:50:07 AM

771 Views
3 replies
3 kudos

500550

[Simba][SimbaSparkJDBCDriver] (500550) The next rowset buffer is already marked as consumed. The fetch thread might have terminated unexpectedly. Foreground thread ID

Data Engineering

771 Views
3 replies
3 kudos

12-22-2022 9:50:07 AM

View Replies

Latest Reply

Hubert-Dudek
Esteemed Contributor III

12-22-2022 1:52:35 PM

3 kudos

You can try to add UseNativeQuery=1 property to JDBC syntax.

3 kudos

12-22-2022 1:52:35 PM

2 More Replies

by DaanC • New Contributor II

12-23-2022 5:14:47 AM

1511 Views
5 replies
3 kudos

Databricks administrator accreditation exam

Hi,I have recently passed the administrator accreditation exam in the partner academy. However, I have recieved no certificate. How can I get my certificate?

Data Engineering

1511 Views
5 replies
3 kudos

12-23-2022 5:14:47 AM

View Replies

Latest Reply

Anonymous
Not applicable

12-26-2022 5:11:21 AM

3 kudos

Hi @Daan Cremers Thank you for reaching out! Please submit a ticket to our Training Team here: https://help.databricks.com/s/contact-us?ReqType=training and our team will get back to you shortly.

3 kudos

12-26-2022 5:11:21 AM

4 More Replies

by lsanchezo • New Contributor II

12-23-2022 7:59:37 AM

1834 Views
4 replies
2 kudos

My Databricks Lakehouse badge doesn't appear in Databricks Academy

Hello, I passed the Databricks Lakehouse accreditation exam at the Databricks academy, but the badge does not appear on the platform and it has been about an hour, I hope you can help me. What it does let me do is download a certificate.

Data Engineering

1834 Views
4 replies
2 kudos

12-23-2022 7:59:37 AM

View Replies

Latest Reply

Anonymous
Not applicable

12-26-2022 10:44:27 PM

2 kudos

Hi @Luis Sanchez Otiniano Thank you for reaching out! Please submit a ticket to our Training Team here: https://help.databricks.com/s/contact-us?ReqType=training and our team will get back to you shortly.

2 kudos

12-26-2022 10:44:27 PM

3 More Replies

by naveen123 • New Contributor II

12-23-2022 10:59:06 PM

844 Views
3 replies
3 kudos

Previous data is getting wiped off for delta tables

I am using only insert sql query to insert the hist. load but previous data getting deleted.Tried with python query also but same issue persists.Reading the data from gcp bucket(parquet file)writing the data into gcp bucket(delta file)..the deleted f...

Data Engineering

844 Views
3 replies
3 kudos

12-23-2022 10:59:06 PM

View Replies

Latest Reply

jose_gonzalez
Moderator

12-27-2022 3:04:40 PM

3 kudos

Share your query and also look for any error messages in the driver logs. This might help to undertand better what is happening.

3 kudos

12-27-2022 3:04:40 PM

2 More Replies

User

Count

1602

736

344

284

247

Databricks

Forum Posts

Resolved! How do I resolve problems when deploying a workspace with AWS Quickstart cloud formation template?

Why there's no "New Union" option with Databricks connection

Certified Data Engineer Associate - v2 vs. v3 (Databricks Academy)

How to write a Delta Live Table(dlt) pipeline output to Databricks SQL directly

creation of tables with cdc

Resolved! Where can we use Broadcast variable?

databricks autoloader getting stuck in flattening json files for different scenarios similar in nature.

PrivilegesSELECT: gives read access to an object.CREATE: gives ability to create an object (for example, a table in a schema).MODIFY: gives ability to...

Disable access to mount point for client code

Understand Trigger Intervals in Streaming Pipelines in Databricks When defining a streaming write, the trigger the method specifies when the system sh...

Resolved! Can't read large multiline json,

500550

Databricks administrator accreditation exam

My Databricks Lakehouse badge doesn't appear in Databricks Academy

Previous data is getting wiped off for delta tables

Best way to parse Google Analytics data in Databri...

DELTA_EXCEED_CHAR_VARCHAR_LIMIT

Not able to set run_as service_principal_name

Pyspark operations slowness in CLuster 14.3LTS as ...

[Databricks Assets Bundles] Workflow trigger on fi...