Data Engineering

Forum Posts

Sorted by:

Start a conversation

by Mahalakshmi • New Contributor II

08-04-2021 10:53:25 PM

1818 Views
1 replies
1 kudos

Resolved! Spark UI is not working for completed jobs

Spark UI is not working for completed jobs

Data Engineering

1818 Views
1 replies
1 kudos

08-04-2021 10:53:25 PM

View Replies

Latest Reply

Hubert-Dudek
Esteemed Contributor III

11-22-2021 4:50:26 AM

1 kudos

Jobs executed from API jobs or Azure data factory are for example not available in spark management console.It can be also issue with community edition or spark settings.

1 kudos

11-22-2021 4:50:26 AM

by lprevost • Contributor II

08-09-2021 12:45:34 PM

3417 Views
1 replies
1 kudos

Resolved! Schema inferrence CSV picks up \r carriage returns

I'm using: frame = spark.read.csv(path=bucket+folder, inferSchema = True, header = True, multiLine=True ) to read in a series of CSV ...

Data Engineering

3417 Views
1 replies
1 kudos

08-09-2021 12:45:34 PM

View Replies

Latest Reply

Hubert-Dudek
Esteemed Contributor III

11-22-2021 4:47:52 AM

1 kudos

Files saved in Windows operation system contain carriage return and line feed in every line.Please add following option it can help: .option("ignoreTrailingWhiteSpace", true)

1 kudos

11-22-2021 4:47:52 AM

by missyT • New Contributor III

11-22-2021 3:54:30 AM

3663 Views
1 replies
4 kudos

Resolved! How to distinguish arrow-key from escape character with getch in C?

I want to know weather an arrow key or the escape character has ben pressed. But in order to check which arrow key has been pressed I need to do multiple blocking getch-calls bc the arrow-key sequence is bigger than 1 char. This is a problem when I c...

Data Engineering

3663 Views
1 replies
4 kudos

11-22-2021 3:54:30 AM

View Replies

Latest Reply

Hubert-Dudek
Esteemed Contributor III

11-22-2021 3:59:11 AM

4 kudos

getch () function returns two keycodes for arrow keys. Arrow put to getch '\033' and '[' and letter from A to D (up, down, right, left) so code will be something like:if (getch() == '\033') { getch(); // [ value switch(getch()) { ...

4 kudos

11-22-2021 3:59:11 AM

by sarvesh • Contributor III

11-22-2021 12:58:47 AM

5720 Views
3 replies
4 kudos

Resolved! Exception in thread "main" org.apache.spark.sql.AnalysisException: Cannot modify the value of a Spark config: spark.executor.memory;

I am trying to read a 16mb excel file and I was getting a gc overhead limit exceeded error to resolve that i tried to increase my executor memory with,spark.conf.set("spark.executor.memory", "8g")but i got the following stack :Using Spark's default l...

Data Engineering

5720 Views
3 replies
4 kudos

11-22-2021 12:58:47 AM

View Replies

Latest Reply

Prabakar
Databricks Employee

11-22-2021 1:24:25 AM

4 kudos

On the cluster configuration page, go to the advanced options. Click it to expand the field. There you will find the Spark tab and you can set the values there in the "Spark config".

4 kudos

11-22-2021 1:24:25 AM

2 More Replies

by sarvesh • Contributor III

11-17-2021 7:00:21 AM

9343 Views
9 replies
8 kudos

Resolved! Getting Null values at the place of data which was removed manually from excel file( solved )

Data Engineering

9343 Views
9 replies
8 kudos

11-17-2021 7:00:21 AM

View Replies

Latest Reply

Anonymous
Not applicable

11-19-2021 8:52:28 AM

8 kudos

@sarvesh singh - Thank you for letting us know. Would you be happy to mark the best answer so others can find the solution easily?

8 kudos

11-19-2021 8:52:28 AM

8 More Replies

by mdavidallen • New Contributor II

11-19-2021 11:58:22 AM

5144 Views
4 replies
2 kudos

Resolved! How to transfer ownership of a Databricks cloud standard account?

My email address is the owner of an account in a particular standard plan tenancy. I would like to transfer ownership to another user so they can change billing details, and take admin access going forward. How can this be accomplished?

Data Engineering

5144 Views
4 replies
2 kudos

11-19-2021 11:58:22 AM

View Replies

Latest Reply

Prabakar
Databricks Employee

11-19-2021 1:45:36 PM

2 kudos

Hi @David Allen To transfer account owner rights, contact your Databricks account representative. This is applicable for both legacy and E2 accounts.https://docs.databricks.com/administration-guide/account-settings/account-console.html#access-the-ac...

2 kudos

11-19-2021 1:45:36 PM

3 More Replies

by Chris_Shehu • Valued Contributor III

11-12-2021 11:03:26 AM

3792 Views
4 replies
3 kudos

Resolved! Is there a way to setup a local SQL endpoint?

Data Engineering

3792 Views
4 replies
3 kudos

11-12-2021 11:03:26 AM

View Replies

Latest Reply

Anonymous
Not applicable

11-19-2021 4:56:41 AM

3 kudos

You may have noticed that the local SQL endpoint is not listed in the options for getting started with APEX. The local SQL endpoint is an extremely useful feature for getting ADO.NET web services started. I say check this uk-dissertation.com review f...

3 kudos

11-19-2021 4:56:41 AM

3 More Replies

by Confused • New Contributor III

11-16-2021 1:43:12 AM

5837 Views
6 replies
1 kudos

Hi Guys Is there any documentation on where the /databricks-datasets/ mount is actually served from?We are looking at locking down where our workspace...

Hi GuysIs there any documentation on where the /databricks-datasets/ mount is actually served from?We are looking at locking down where our workspace can reach out to via the internet and as it currently stands we are unable to reach this.I did look ...

Data Engineering

5837 Views
6 replies
1 kudos

11-16-2021 1:43:12 AM

View Replies

Latest Reply

Anonymous
Not applicable

11-18-2021 11:50:40 AM

1 kudos

Hello Mat, Thanks for letting us know. Would you be happy to mark your answer as best if that will solve the problem for others? That way, members will be able to find the solution more easily.

1 kudos

11-18-2021 11:50:40 AM

5 More Replies

by MadelynM • Databricks Employee

11-08-2021 10:31:35 AM

3927 Views
2 replies
1 kudos

2021-08-Best-Practices-for-Your-Data-Architecture-v3-OG-1200x628

Thanks to everyone who joined the Best Practices for Your Data Architecture session on Getting Workloads to Production using CI/CD. You can access the on-demand session recording here, and the code in the Databricks Labs CI/CD Templates Repo. Posted ...

Data Engineering

3927 Views
2 replies
1 kudos

11-08-2021 10:31:35 AM

View Replies

Latest Reply

MadelynM
Databricks Employee

11-18-2021 1:00:21 PM

1 kudos

Here's the embedded links list!Jobs scheduling and orchestrationBuilt-in job scheduling: https://docs.databricks.com/jobs.html#schedule-a-job Periodic scheduling of the jobsExecute notebook / jar / Python script / Spark-submitMultitask JobsExecute no...

1 kudos

11-18-2021 1:00:21 PM

1 More Replies

by raymund • New Contributor III

11-03-2021 10:31:22 AM

5459 Views
7 replies
5 kudos

Resolved! Why adding the package 'org.apache.spark:spark-sql-kafka-0-10_2.12:3.0.1' failed in runtime 9.1.x-scala2.12 but was successful using runtime 8.2.x-scala2.12 ?

Using Databricks spark submit job, setting new cluster1] "spark_version": "8.2.x-scala2.12" => OK, works fine2] "spark_version": "9.1.x-scala2.12" => FAIL, with errorsException in thread "main" java.lang.ExceptionInInitializerError at com.databricks...

Data Engineering

5459 Views
7 replies
5 kudos

11-03-2021 10:31:22 AM

View Replies

Latest Reply

raymund
New Contributor III

11-10-2021 2:14:42 PM

5 kudos

this has been resolved by adding the following spark_conf (not thru --conf) "spark.hadoop.fs.file.impl": "org.apache.hadoop.fs.LocalFileSystem"example:------"new_cluster": { "spark_version": "9.1.x-scala2.12", ... "spark_conf": { "spar...

5 kudos

11-10-2021 2:14:42 PM

6 More Replies

by antoooks • New Contributor III

10-25-2021 1:10:39 AM

3922 Views
2 replies
4 kudos

Resolved! display() function always return connection refused on tunneling despite successfully retrieving the schema

Hi everyone,I am using SSH tunnelling with SSHTunnelForwarder to reach a target AWS RDS PostgreSQL database. The connection got through, however when I tried to display the retrieved data frame it always throws "connection refused" error. Please see ...

Data Engineering

3922 Views
2 replies
4 kudos

10-25-2021 1:10:39 AM

View Replies

Latest Reply

jose_gonzalez
Databricks Employee

11-12-2021 4:41:41 PM

4 kudos

hi @Kurnianto Trilaksono Sutjipto ,This seems like a connectivity issue with the url you are trying to connect to. It fails during the display() command because read is a lazy transformation and it will not be executed right away. On the other hand,...

4 kudos

11-12-2021 4:41:41 PM

1 More Replies

by Leszek • Contributor

11-17-2021 7:00:29 AM

6455 Views
5 replies
11 kudos

Resolved! Runtime SQL Configuration - how to make it simple

Hi, I'm running couple of Notebooks in my pipeline and I would like to set fixed value of 'spark.sql.shuffle.partitions' - same value for every notebook. Should I do that by adding spark.conf.set.. code in each Notebook (Runtime SQL configurations ar...

Data Engineering

6455 Views
5 replies
11 kudos

11-17-2021 7:00:29 AM

View Replies

Latest Reply

Leszek
Contributor

11-17-2021 11:41:57 PM

11 kudos

Hi, Thank you all for the tips. I tried before to set this option in Spark Config but didn't work for some reason. Today I tried again and it's working :).

11 kudos

11-17-2021 11:41:57 PM

4 More Replies

by SRS • New Contributor II

11-16-2021 1:40:28 AM

4900 Views
3 replies
5 kudos

Resolved! Delta Tables incremental backup method

Hello,Does anyone tried to create an incremental backup on delta tables? What I mean is to load into the backup storage only the latest parquet files part of the Delta Table and to refresh the _delta_log folder, instead of copying the whole files aga...

Data Engineering

4900 Views
3 replies
5 kudos

11-16-2021 1:40:28 AM

View Replies

Latest Reply

jose_gonzalez
Databricks Employee

11-17-2021 11:47:09 AM

5 kudos

Hi @Stefan Stegaru ,You can use Delta time travel to query the data that was just added on a specific version. Then like @Hubert Dudek mentioned, you can copy over this sub set of data to a new table or a new location. You will need to do a deep...

5 kudos

11-17-2021 11:47:09 AM

2 More Replies

by Mohit_m • Databricks Employee

11-16-2021 3:44:31 AM

5420 Views
3 replies
5 kudos

Resolved! Can't find or Enable "Files in Repos" feature

Not able to Find or Enable "Files in Repos" feature in the workspace, What could be the reason

Data Engineering

5420 Views
3 replies
5 kudos

11-16-2021 3:44:31 AM

View Replies

Latest Reply

Hubert-Dudek
Esteemed Contributor III

11-16-2021 4:17:21 AM

5 kudos

Please check your admin console.

5 kudos

11-16-2021 4:17:21 AM

2 More Replies

by Anonymous • Not applicable

11-03-2021 2:51:11 PM

4308 Views
4 replies
2 kudos

Resolved! Anyone using RAPIDS and cuGraph on a current runtime?

We're in the process of migrating a large graph computation workload to nvidia RAPIDS + cuGraph for GPU acceleration. The package isn't a part of the base runtime and it is available by conda package management only, so can't be installed via init sc...

Data Engineering

4308 Views
4 replies
2 kudos

11-03-2021 2:51:11 PM

View Replies

Latest Reply

Anonymous
Not applicable

11-16-2021 11:32:00 AM

2 kudos

Thanks @Prabakar Ammeappin , we're looking at this. Strangely, the last commit removed the rapids libraries from the base cuda-images. We're adding them back in.

2 kudos

11-16-2021 11:32:00 AM

3 More Replies

Databricks Community

Forum Posts

Resolved! Spark UI is not working for completed jobs

Resolved! Schema inferrence CSV picks up \r carriage returns

Resolved! How to distinguish arrow-key from escape character with getch in C?

Resolved! Exception in thread "main" org.apache.spark.sql.AnalysisException: Cannot modify the value of a Spark config: spark.executor.memory;

Resolved! Getting Null values at the place of data which was removed manually from excel file( solved )

Resolved! How to transfer ownership of a Databricks cloud standard account?

Resolved! Is there a way to setup a local SQL endpoint?

Hi Guys Is there any documentation on where the /databricks-datasets/ mount is actually served from?We are looking at locking down where our workspace...

2021-08-Best-Practices-for-Your-Data-Architecture-v3-OG-1200x628

Resolved! Why adding the package 'org.apache.spark:spark-sql-kafka-0-10_2.12:3.0.1' failed in runtime 9.1.x-scala2.12 but was successful using runtime 8.2.x-scala2.12 ?

Resolved! display() function always return connection refused on tunneling despite successfully retrieving the schema

Resolved! Runtime SQL Configuration - how to make it simple

Resolved! Delta Tables incremental backup method

Resolved! Can't find or Enable "Files in Repos" feature

Resolved! Anyone using RAPIDS and cuGraph on a current runtime?

Join Us as a Local Community Builder!

What are the options for "spark_conf.spark.databri...

Databricks AutoLoader IncrementalListing mode chan...

Establishing a Connection between ADLS Gen2, Datab...

Simple append only in DLT

How to restrict the values permitted in a job or t...