Data Engineering

Forum Posts

Sorted by:

by Gerhard • New Contributor III

11-25-2022 5:59:42 AM

1032 Views
0 replies
1 kudos

Read proprietary files and transform contents to a table - error resilient process needed

We do have data stored in HDF5 files in a "proprietary" way. This data needs to be read, converted and transformed before it can be inserted into a delta table.All of this transformation is done in a custom python function that takes the HDF5 file an...

Data Engineering

1032 Views
0 replies
1 kudos

11-25-2022 5:59:42 AM

by tassiodahora • New Contributor III

05-23-2022 6:00:35 AM

46980 Views
3 replies
8 kudos

Resolved! Failed to merge incompatible data types LongType and StringType

Guys, good morning!I am writing the results of a json in a delta table, only the json structure is not always the same, if the field does not list in the json it generates type incompatibility when I append(dfbrzagend.write .format("delta") .mode("ap...

Data Engineering

46980 Views
3 replies
8 kudos

05-23-2022 6:00:35 AM

View Replies

Latest Reply

Kaniz
Community Manager

06-14-2022 10:05:20 AM

8 kudos

Hi @Tássio Santos , We haven’t heard from you on the last response from @Chetan Kardekar , and I was checking back to see if you have a resolution yet. If you have any solution, please share it with the community as it can be helpful to others. Oth...

8 kudos

06-14-2022 10:05:20 AM

2 More Replies

by Geeta1 • Valued Contributor

11-24-2022 9:30:46 AM

1185 Views
1 replies
8 kudos

Activate gift certificate in Databricks Community Reward store

Hello all,Can anyone let me know about the "Activate Gift Certificate" option in Databricks Community Reward store? What is its purpose and how we can use it?

Data Engineering

1185 Views
1 replies
8 kudos

11-24-2022 9:30:46 AM

View Replies

Latest Reply

yogu
Honored Contributor III

11-25-2022 1:23:33 AM

8 kudos

you earn points with forum interaction. Those points can be exchanged for 'credits'.With those credits you can buy Databricks swag.Your lifetime points (so the cumulated amount of points) are not affected by this.

8 kudos

11-25-2022 1:23:33 AM

by Manjusha • New Contributor II

10-13-2022 5:16:00 AM

1559 Views
1 replies
1 kudos

SocketTimeout exception when running a display command on spark dataframe

I am using runtime 9.1LTSI have a R notebook that reads a csv into a R dataframe and does some transformations and finally is converted to spark dataframe using the createDataFrame function.after that when I call the display function on this spark da...

Data Engineering

1559 Views
1 replies
1 kudos

10-13-2022 5:16:00 AM

View Replies

Latest Reply

Anonymous
Not applicable

11-24-2022 10:36:21 PM

1 kudos

Hi @Manjusha Unnikrishnan Great to meet you, and thanks for your question! Let's see if your peers in the community have an answer to your question first. Or else bricksters will get back to you soon. Thanks.

1 kudos

11-24-2022 10:36:21 PM

by MBV3 • New Contributor III

11-24-2022 3:09:11 PM

1183 Views
1 replies
2 kudos

Delete a file from GCS folder

What is the best way to delete files from the gcp bucket inside spark job?

Data Engineering

1183 Views
1 replies
2 kudos

11-24-2022 3:09:11 PM

View Replies

Latest Reply

Unforgiven
Valued Contributor III

11-24-2022 8:06:03 PM

2 kudos

@M Baig yes you need just to create service account for databricks and than assign storage admin role to bucket. After that you can mount GCS standard way:bucket_name = "<bucket-name>"mount_name = "<mount-name>"dbutils.fs.mount("gs://%s" % bucket_na...

2 kudos

11-24-2022 8:06:03 PM

by User16844487905 • New Contributor III

10-14-2021 3:31:45 PM

2836 Views
4 replies
5 kudos

AWS quickstart - Cloudformation failure When deploying your workspace with the recommended AWS quickstart method, a Cloudformation template will be la...

AWS quickstart - Cloudformation failureWhen deploying your workspace with the recommended AWS quickstart method, a Cloudformation template will be launched in your AWS account. If you experience a failure with the error message along the lines of ROL...

Data Engineering

2836 Views
4 replies
5 kudos

10-14-2021 3:31:45 PM

View Replies

Latest Reply

yalun
New Contributor III

11-24-2022 7:11:54 AM

5 kudos

How do I launch the "Quickstart" again? Where is it in the console?

5 kudos

11-24-2022 7:11:54 AM

3 More Replies

by SM • New Contributor III

01-20-2022 6:32:50 AM

3426 Views
4 replies
10 kudos

How to use Azure Data lake as a storage location to store the Delta Live Tables?

I am trying write data into Azure Datalake. I am reading files from Azure Blob Storage however when I try to create the Delta Live Table to Azure Datalake I get error the following errorshaded.databricks.azurebfs.org.apache.hadoop.fs.azurebfs.contrac...

Data Engineering

3426 Views
4 replies
10 kudos

01-20-2022 6:32:50 AM

View Replies

Latest Reply

RThornton
New Contributor III

10-05-2022 1:29:54 PM

10 kudos

@Kaniz Fatma I don't think you quite understand the question. I'm running into the same problem. When creating a Delta Live Table pipeline to write to Azure Data Lake Storage (abfss://etc...) as the Storage Location, the pipeline fails with the erro...

10 kudos

10-05-2022 1:29:54 PM

3 More Replies

by AmarJT • New Contributor II

11-23-2022 4:41:48 PM

1827 Views
2 replies
6 kudos

Lakehouse Fundamentals Accreditation badge not received

Hi Team,I have successfully passed the test after completion of the course. But i have not received any badge from your side. I have just been provided a certificate. Certificate ID:ID: E-E04YDVAs mentioned in the web portals i tried accessing "http...

Data Engineering

1827 Views
2 replies
6 kudos

11-23-2022 4:41:48 PM

View Replies

Latest Reply

Geeta1
Valued Contributor

11-24-2022 7:28:59 AM

6 kudos

Hi @Amarjeet Kumar , you will receive the badge in a day after completion. Even I received it a day after I cleared the exam. If you don't receive it the next day also, then you can raise a ticket at https://help.databricks.com/s/contact-us?ReqType...

6 kudos

11-24-2022 7:28:59 AM

1 More Replies

by yogu • Honored Contributor III

11-22-2022 7:29:55 AM

1118 Views
2 replies
18 kudos

Resolved! Explain about "Activate Gift Certificate" section in Databricks Community Rewards.

Hello everyone,Can any one explain about the Active Gift Certificate in in Databricks Community Rewards. And how to use it?

Data Engineering

1118 Views
2 replies
18 kudos

11-22-2022 7:29:55 AM

View Replies

Latest Reply

-werners-
Esteemed Contributor III

11-24-2022 7:05:21 AM

18 kudos

it boils down to this:you earn points with forum interaction. Those points can be exchanged for 'credits'.With those credits you can buy Databricks swag.Your lifetime points (so the cumulated amount of points) are not affected by this.

18 kudos

11-24-2022 7:05:21 AM

1 More Replies

by yalun • New Contributor III

11-24-2022 7:10:22 AM

491 Views
0 replies
4 kudos

I cannot create a workspace, help me please.

They are grey I cannot click them. And if I hover my cursor on top of them, there is no any info.What am I gonna do?

Data Engineering

491 Views
0 replies
4 kudos

11-24-2022 7:10:22 AM

by -werners- • Esteemed Contributor III

10-13-2022 12:20:55 AM

1322 Views
3 replies
22 kudos

Resolved! Package cells (scala), who uses them?

So I was wondering who uses package cells in scala?We have this library (jar) which has some useful functions we use all over the place. But that's about it. So I think we can do the same thing without a jar but with package cells.But I never hear ...

Data Engineering

1322 Views
3 replies
22 kudos

10-13-2022 12:20:55 AM

View Replies

Latest Reply

Anonymous
Not applicable

11-20-2022 1:40:08 AM

22 kudos

Hi @Werner Stinckens Hope all is well! Just wanted to check in if you were able to resolve your issue and would you be happy to share the solution or mark an answer as best? Else please let us know if you need more help. We'd love to hear from you.T...

22 kudos

11-20-2022 1:40:08 AM

2 More Replies

by Constantino • New Contributor III

11-23-2022 9:05:02 AM

920 Views
1 replies
2 kudos

Is there any way to prevent non-admin users from creating new jobs?

This is specific to creating new jobs, I understand that various permissions can be set on existing jobs using job access control. This seems to suggest no, I can't find anything in the Databricks docs either.

Data Engineering

920 Views
1 replies
2 kudos

11-23-2022 9:05:02 AM

View Replies

Latest Reply

-werners-
Esteemed Contributor III

11-24-2022 6:33:51 AM

2 kudos

nope.Looked for that too, but it does not seem to be possible. Perhaps with Unity catalog, as there you have more permission controls.But using Unity is not an overnight decision.

2 kudos

11-24-2022 6:33:51 AM

by Anonymous • Not applicable

11-23-2022 6:59:28 PM

1538 Views
3 replies
28 kudos

Resolved! Refresh Dashboard also make all related queried refresh?

Hi all,I have a quick currious. I know both query and dashboard page in Databricks SQL have refresh button to can them refresh. But one question it, when I'm in Dashboard page and click the refesh button. Does this thing also force every related quer...

Data Engineering

1538 Views
3 replies
28 kudos

11-23-2022 6:59:28 PM

View Replies

Latest Reply

Anonymous
Not applicable

11-24-2022 3:37:06 AM

28 kudos

Thanks all your support. It's totally clear for me now!!!

28 kudos

11-24-2022 3:37:06 AM

2 More Replies

by Pradeep_Namani • New Contributor III

11-23-2022 10:40:57 PM

2827 Views
5 replies
2 kudos

Date field getting changed when reading from excel file to dataframe in pyspark

The date field is getting changed while reading data from source .xls file to the dataframe. In the source xl file all columns are strings but i am not sure why date column alone behaves differentlyIn Source file date is 1/24/1947.In pyspark datafram...

Data Engineering

2827 Views
5 replies
2 kudos

11-23-2022 10:40:57 PM

View Replies

Latest Reply

-werners-
Esteemed Contributor III

11-24-2022 2:37:36 AM

2 kudos

how about using inferschema one single time to create a correct DF, then create a schema from the df-schema.something like this f.e.from pyspark.sql.types import StructType # Save schema from the original DataFrame into json: schema_json = df.s...

2 kudos

11-24-2022 2:37:36 AM

4 More Replies

by JordanYaker • Contributor

09-23-2022 3:16:51 PM

3362 Views
7 replies
8 kudos

Resolved! Is anyone else experiencing intermittent "Failure starting REPL" errors with PySpark Jobs?

I have a Multi-Task Job that is running a bunch of PySpark notebooks and about 30-60% of the time, my jobs fail with the following error:I haven't seen any consistency with this error. I've had as many as all of the tasks in the job giving this error...

Data Engineering

3362 Views
7 replies
8 kudos

09-23-2022 3:16:51 PM

View Replies

Latest Reply

James_Cole
New Contributor III

11-21-2022 4:05:48 AM

8 kudos

Hi. Did you ever got a resolution to this problem outside of rolling back to 10.4? I have recently moved some workloads over to runtime 11.3 and am experiencing intermittent "repl did not start in 30 seconds." errors.I have increased the repl timeout...

8 kudos

11-21-2022 4:05:48 AM

6 More Replies

User

Count

1602

737

348

285

247

Databricks Community

Forum Posts

Read proprietary files and transform contents to a table - error resilient process needed

Resolved! Failed to merge incompatible data types LongType and StringType

Activate gift certificate in Databricks Community Reward store

SocketTimeout exception when running a display command on spark dataframe

Delete a file from GCS folder

AWS quickstart - Cloudformation failure When deploying your workspace with the recommended AWS quickstart method, a Cloudformation template will be la...

How to use Azure Data lake as a storage location to store the Delta Live Tables?

Lakehouse Fundamentals Accreditation badge not received

Resolved! Explain about "Activate Gift Certificate" section in Databricks Community Rewards.

I cannot create a workspace, help me please.

Resolved! Package cells (scala), who uses them?

Is there any way to prevent non-admin users from creating new jobs?

Resolved! Refresh Dashboard also make all related queried refresh?

Date field getting changed when reading from excel file to dataframe in pyspark

Resolved! Is anyone else experiencing intermittent "Failure starting REPL" errors with PySpark Jobs?

Data prefixed by ' > '

Pandas_UDF not working on shared access mode but w...

Databricks Asset Bundle (DAB) from a Git repo?

I am getting NoneType error when running a query f...

How to import a function to another notebook?