Data Engineering

Forum Posts

Sorted by:

by data_testing1 • New Contributor III

07-05-2022 5:16:18 PM

29113 Views
7 replies
13 kudos

Can databricks be used locally to learn it or is it cloud only

I'm tired of telling clients or referrals I don't know databricks but it seems like the only option is to have a big AWS account and then use databricks on that data. Can I download it locally for training, upskilling with python or is it only for cl...

Data Engineering

29113 Views
7 replies
13 kudos

07-05-2022 5:16:18 PM

View Replies

Latest Reply

Kaniz
Community Manager

07-07-2022 10:59:23 PM

13 kudos

Hi @Andrew Schell, We haven't heard from you on the last response from @Hubert Dudek , and I was checking back to see if his suggestions helped you. Or else, If you have any solution, please share it with the community as it can be helpful to other...

13 kudos

07-07-2022 10:59:23 PM

6 More Replies

by Danny_Heinrich • New Contributor

07-07-2022 10:56:11 PM

580 Views
0 replies
0 kudos

Unexpected behaviour when creating multiple Account-Level Accounts

We've had a qustion regarding possibly unexpected behaviour when creating multiple accounts on the account-level on https://accounts.cloud.databricks.com/.Short Version:It's possible to create multiple accounts with different letter-cases on https://...

Data Engineering

580 Views
0 replies
0 kudos

07-07-2022 10:56:11 PM

by nadia • New Contributor II

06-12-2022 2:19:33 PM

14635 Views
2 replies
2 kudos

Resolved! Executor heartbeat timed out

Hello, I'm trying to read a table that is located on Postgreqsl and contains 28 million rows. I have the following result:"SparkException: Job aborted due to stage failure: Task 0 in stage 0.0 failed 4 times, most recent failure: Lost task 0.3 in sta...

Data Engineering

14635 Views
2 replies
2 kudos

06-12-2022 2:19:33 PM

View Replies

Latest Reply

jose_gonzalez
Moderator

07-07-2022 5:26:14 PM

2 kudos

Hi @Boumaza nadia ,Did you check the executor 3 logs when the cluster was active? if you get this error message again, I will highly recommend to check the executor's logs to be sure on what was the cause of the issue.

2 kudos

07-07-2022 5:26:14 PM

1 More Replies

by WayneDeleersnyd • New Contributor III

06-17-2022 7:59:15 AM

1537 Views
3 replies
1 kudos

Resolved! ipywidgets not working in DBR 11.0 on Community Edition

I'm looking forward to using ipywidgets which should be working in DBR 11.0 as they provide more options when creating a notebook UI. I saw that DBR 11.0 is available as of yesterday so I created a test cluster in the Databricks Community Edition ju...

Data Engineering

1537 Views
3 replies
1 kudos

06-17-2022 7:59:15 AM

View Replies

Latest Reply

User16752242622
Valued Contributor

06-17-2022 8:40:55 AM

1 kudos

Hi @Wayne Deleersnyder I was able to import ipywidgets in DBR 11.0. As you can see in the output below. The slider is visibleYou are facing this issue probably because the community edition has limited access. To get all the features you should at l...

1 kudos

06-17-2022 8:40:55 AM

2 More Replies

by THIAM_HUATTAN • Valued Contributor

06-22-2022 12:52:14 AM

664 Views
3 replies
2 kudos

pd.read_csv failed

https://i.imgur.com/PAGzSr9.png

Data Engineering

664 Views
3 replies
2 kudos

06-22-2022 12:52:14 AM

View Replies

Latest Reply

THIAM_HUATTAN
Valued Contributor

06-24-2022 4:43:38 AM

2 kudos

Thanks for your kind reply:Below works for me:https://imgur.com/BmMzatIBut why, as you mentioned, using the classic path, below does not work?https://imgur.com/Ba1a4Iv

2 kudos

06-24-2022 4:43:38 AM

2 More Replies

by liftndrift • New Contributor

07-07-2022 10:22:03 AM

228 Views
0 replies
0 kudos

How to make a paper airplane

Data Engineering

228 Views
0 replies
0 kudos

07-07-2022 10:22:03 AM

by as999 • New Contributor III

05-12-2022 2:05:50 PM

780 Views
2 replies
0 kudos

DBrick workspace URL block outside the corporate network?

As per security concern, need to restrict/block the dbricks workspace url outside the corporate network. Tried below ip access list, it able to restrict only user login access out the corporate network but still the workspace id url is live outside t...

Data Engineering

780 Views
2 replies
0 kudos

05-12-2022 2:05:50 PM

View Replies

Latest Reply

Anonymous
Not applicable

07-07-2022 9:40:30 AM

0 kudos

Hi @as999 Hope everything is going great!Does @Atanu Sarkar's response answer your question? If yes, would you be happy to mark it as best so that other members can find the solution more quickly? Else please let us know if you need more help. We'...

0 kudos

07-07-2022 9:40:30 AM

1 More Replies

by flachboard • New Contributor

05-12-2022 9:35:34 AM

2731 Views
4 replies
1 kudos

How do you install R packages?

I've tried this, but it doesn't appear to be working: https://community.databricks.com/s/question/0D53f00001GHVX1CAP/unable-to-install-sf-and-rgeos-r-packages-on-the-clusterWhen I run the following after that init script, I receive an error.library(r...

Data Engineering

2731 Views
4 replies
1 kudos

05-12-2022 9:35:34 AM

View Replies

Latest Reply

Anonymous
Not applicable

07-07-2022 9:17:45 AM

1 kudos

Hey there @Christopher Flach Hope all is well! Just wanted to check in if you were able to resolve your issue and would you be happy to share the solution or mark an answer as best? Else please let us know if you need more help. We'd love to hear fr...

1 kudos

07-07-2022 9:17:45 AM

3 More Replies

by joel_iemma • New Contributor III

05-12-2022 5:55:38 AM

2461 Views
5 replies
0 kudos

Resolved! A void column was created after connecting to cosmos

Hi everyone, I have connected to Cosmos using this tutorial https://github.com/Azure/azure-sdk-for-java/tree/main/sdk/cosmos/azure-cosmos-spark_3_2-12/Samples/DatabricksLiveContainerMigrationAfter creating a table using a simple SQL command:CREATE TA...

Data Engineering

2461 Views
5 replies
0 kudos

05-12-2022 5:55:38 AM

View Replies

Latest Reply

Anonymous
Not applicable

07-07-2022 9:14:26 AM

0 kudos

Hey there @Joel iemma Hope all is well! Just wanted to check in if you would be happy to mark an answer as best for us, please? It would be really helpful for the other members too.Cheers!

0 kudos

07-07-2022 9:14:26 AM

4 More Replies

by Ashley1 • Contributor

05-11-2022 5:01:34 PM

1703 Views
2 replies
0 kudos

Resolved! JDBC Connectivity via workspace url when No Public IP selected.

Hi All, I think I might be missing something in regard to No Pubic IP Clusters. I have set this option on a workspace (Azure) and setup the appropriate subnets. To my surprise, when I went to setup a JDBC connection to the cluster the JDBC connec...

Data Engineering

1703 Views
2 replies
0 kudos

05-11-2022 5:01:34 PM

View Replies

Latest Reply

Anonymous
Not applicable

07-07-2022 9:04:07 AM

0 kudos

Hey there @Ashley Betts Hope you are well. Just wanted to see if you were able to find an answer to your question and would you like to mark an answer as best? It would be really helpful for the other members too.Cheers!

0 kudos

07-07-2022 9:04:07 AM

1 More Replies

by Kash • Contributor III

06-09-2022 6:49:15 AM

5166 Views
19 replies
13 kudos

Resolved! HELP! Converting GZ JSON to Delta causes massive CPU spikes and ETL's take days!

Hi there,I was wondering if I could get your advise.We would like to create a bronze delta table using GZ JSON data stored in S3 but each time we attempt to read and write it our clusters CPU spikes to 100%. We are not doing any transformations but s...

Data Engineering

5166 Views
19 replies
13 kudos

06-09-2022 6:49:15 AM

View Replies

Latest Reply

Kash
Contributor III

06-15-2022 5:47:02 AM

13 kudos

Hi Kaniz,Thanks for the note and thank you everyone for the suggestions and help. @Joseph Kambourakis I aded your suggestion to our load but I did not see any change in how our data loads or the time it takes to load data. I've done some additional ...

13 kudos

06-15-2022 5:47:02 AM

18 More Replies

by amichel • New Contributor III

02-22-2022 11:40:24 AM

4572 Views
5 replies
4 kudos

Resolved! Recommended way to integrate MongoDB as a streaming source

Current state:Data is stored in MongoDB Atlas which is used extensively by all servicesData lake is hosted in same AWS region and connected to MongoDB over private link Requirements:Streaming pipelines that continuously ingest, transform/analyze and ...

Data Engineering

4572 Views
5 replies
4 kudos

02-22-2022 11:40:24 AM

View Replies

Latest Reply

Kaniz
Community Manager

07-07-2022 2:14:29 AM

4 kudos

Hi @Alex Michel , We haven’t heard from you on the last response from the community members, and I was checking back to see if you have a resolution yet. If you have any solution, please share it with the community as it can be helpful to others. Ot...

4 kudos

07-07-2022 2:14:29 AM

4 More Replies

by thushar • Contributor

05-27-2022 1:00:28 AM

4024 Views
2 replies
1 kudos

How to install wheel package from git repo

Using VS code for development and a wheel package is created for shipment.We put this wheel package in Azure data lake storage and ADB notebook accessed this wheel package and installed it in the cluster. It is working fine. But instead of keeping th...

Data Engineering

4024 Views
2 replies
1 kudos

05-27-2022 1:00:28 AM

View Replies

Latest Reply

Kaniz
Community Manager

07-07-2022 4:54:34 AM

1 kudos

Hi @Thushar R, We haven’t heard from you on the last response from me , and I was checking back to see if his suggestions helped you. Or else, If you have any solution, please do share that with the community as it can be helpful to others.

1 kudos

07-07-2022 4:54:34 AM

1 More Replies

by pbezz • New Contributor III

05-02-2022 1:14:16 AM

3495 Views
12 replies
15 kudos

Visualisation libraries does not work on Databricks

Why is it that certain Python visualisation libraries do not work on Databricks? I am trying to install (via pip) and work with some data visualisation libraries - they work perfectly in a normal Jupyter Notebook but not on a Databricks notebook envi...

Data Engineering

3495 Views
12 replies
15 kudos

05-02-2022 1:14:16 AM

View Replies

Latest Reply

pbezz
New Contributor III

06-14-2022 12:17:12 PM

15 kudos

No switched to using html widgets.

15 kudos

06-14-2022 12:17:12 PM

11 More Replies

by Bittu6084 • New Contributor II

05-04-2022 5:00:20 AM

5752 Views
6 replies
5 kudos

Resolved! How can we alter table with auto increment column for a delta table

How can we alter table with auto increment column for a delta tableI have tried this but not working:ALTER TABLE dbgtpTest.student ADD COLUMN Student_Id identity(100,1)any Suggestions will be helpful

Data Engineering

5752 Views
6 replies
5 kudos

05-04-2022 5:00:20 AM

View Replies

Latest Reply

Kaniz
Community Manager

06-02-2022 3:57:38 AM

5 kudos

Hi @Bittu6084 (Customer) , Just a friendly follow-up. Do you still need help?

5 kudos

06-02-2022 3:57:38 AM

5 More Replies

User

Count

1601

736

343

284

247

Databricks

Forum Posts

Can databricks be used locally to learn it or is it cloud only

Unexpected behaviour when creating multiple Account-Level Accounts

Resolved! Executor heartbeat timed out

Resolved! ipywidgets not working in DBR 11.0 on Community Edition

pd.read_csv failed

How to make a paper airplane

DBrick workspace URL block outside the corporate network?

How do you install R packages?

Resolved! A void column was created after connecting to cosmos

Resolved! JDBC Connectivity via workspace url when No Public IP selected.

Resolved! HELP! Converting GZ JSON to Delta causes massive CPU spikes and ETL's take days!

Resolved! Recommended way to integrate MongoDB as a streaming source

How to install wheel package from git repo

Visualisation libraries does not work on Databricks

Resolved! How can we alter table with auto increment column for a delta table

Best way to parse Google Analytics data in Databri...

DELTA_EXCEED_CHAR_VARCHAR_LIMIT

Not able to set run_as service_principal_name

Pyspark operations slowness in CLuster 14.3LTS as ...

[Databricks Assets Bundles] Workflow trigger on fi...