Data Engineering

Forum Posts

Sorted by:

by Braxx • Contributor II

05-27-2022 2:32:31 AM

2300 Views
2 replies
1 kudos

Resolved! delta table storage

I couldn't find it clearly explained anywhere, so hope sb here shed some light on that.Few questions:1) Where does delta tables are stored? Docs say: "Delta Lake uses versioned Parquet files to store your data in your cloud storage"So where exactly i...

Data Engineering

2300 Views
2 replies
1 kudos

05-27-2022 2:32:31 AM

View Replies

Latest Reply

Braxx
Contributor II

05-30-2022 8:01:03 AM

1 kudos

thanks, very helpful

1 kudos

05-30-2022 8:01:03 AM

1 More Replies

by sebg • New Contributor II

05-29-2022 9:27:51 PM

1676 Views
1 replies
1 kudos

Using (python) import on azure databricks

Hello,My team is currently working on azure databricks with a mid sized repo. When we wish to import pyspark functions and classes from other notebooks we currently use %run <relpath>which is less than ideal.I would like to replicate the functionalit...

Data Engineering

1676 Views
1 replies
1 kudos

05-29-2022 9:27:51 PM

View Replies

Latest Reply

Kaniz
Community Manager

05-30-2022 7:54:53 AM

1 kudos

Hi @Sebastian Gay , This section guides developing notebooks and jobs in Azure Databricks using Python. The first subsection provides links to tutorials for common workflows and tasks. The second subsection provides links to APIs, libraries, and cri...

1 kudos

05-30-2022 7:54:53 AM

by KevinXu • New Contributor III

05-11-2022 5:54:07 AM

4385 Views
2 replies
4 kudos

pyspark SQL cannot resolve 'explode()' due to data type mismatch

Running Pyspark script getting the following error depending on which xml I query:cannot resolve 'explode(...)' due to data type mismatchThe pyspark code:from pyspark.sql import SparkSession JOB_NAME = "Complex file to delimeted files transformer" ...

Data Engineering

4385 Views
2 replies
4 kudos

05-11-2022 5:54:07 AM

View Replies

Latest Reply

Kaniz
Community Manager

05-26-2022 4:35:00 AM

4 kudos

Hi @Kevin Xu, Can you share the script where you have used the "explode" function?NOTE: Explode does not apply on string column. If you do this then error will come.

4 kudos

05-26-2022 4:35:00 AM

1 More Replies

by Emmasophia666 • New Contributor

05-27-2022 11:43:14 AM

245 Views
0 replies
0 kudos

hosting-services-dedicated-shared-colocation-dubai-uae-middle-east-saudi-arabia-oman-qatar-bahrain-kuwait (1)

We offer the best web hosting solutions that are blazing fast, and ultra reliable & our sales & support team is here to help you find the right solutions

Data Engineering

245 Views
0 replies
0 kudos

05-27-2022 11:43:14 AM

by Raymond_Garcia • Contributor II

05-27-2022 9:20:49 AM

809 Views
0 replies
1 kudos

EsHadoopNoNodesLeftException: Connection error (check network and/or proxy settings)- all nodes failed; tried [[elasticsearch_server:80]]

Hi, I wondered if some of you have had this issue before and how it can be solved. In a Databricks Job, we have a UBQ with a Painless script for ES. these are the options. Staging and prod are the same configurations, but Staging is failing with the ...

Data Engineering

809 Views
0 replies
1 kudos

05-27-2022 9:20:49 AM

by ESG • New Contributor II

05-27-2022 7:42:50 AM

275 Views
0 replies
1 kudos

The (ESGRI) introduces the world’s most wanted environmental social and governance cutting edge easy to use software tools with next-gen tech.

Data Engineering

275 Views
0 replies
1 kudos

05-27-2022 7:42:50 AM

by alexa • New Contributor III

05-25-2022 9:19:28 AM

4617 Views
2 replies
0 kudos

Resolved! How to use dateadd in databricks sql ?

I am trying to something like this but getting error like :Error in SQL statement: AnalysisException: Undefined function: 'DATEADD'. This function is neither a registered temporary function nor a permanent function registered in the database 'default...

Data Engineering

4617 Views
2 replies
0 kudos

05-25-2022 9:19:28 AM

View Replies

Latest Reply

SergeRielau
Valued Contributor

05-27-2022 7:11:23 AM

0 kudos

Dateadd was added in DBR 10.4 and is in DBSQL current.SELECT DATEADD(HOUR,IFNULL(100, 0),current_date) AS Date_Created_Local=> 2022-05-31T04:00:00.000+0000.You can also use one of these casts to turn any wellformed string into an interval:SELECT curr...

0 kudos

05-27-2022 7:11:23 AM

1 More Replies

by 179019 • New Contributor

05-27-2022 4:54:20 AM

266 Views
0 replies
0 kudos

Pattern Matching not working as expected with 10.4 LTS databricks runtime version

Hi everyone, I recently upgraded the runtime version of one of the databricks job to 10.4 LTS but Pattern Matching is not working as expected the same code is working in 7.3 LTS.Basically doing this and returning Left or Right: val result = spark.sql...

Data Engineering

266 Views
0 replies
0 kudos

05-27-2022 4:54:20 AM

by alejandrofm • Valued Contributor

05-25-2022 2:33:17 PM

570 Views
2 replies
0 kudos

How to run sparkStream for earlier (not future messages)

Hi, I'm listening to a stream for kinesis, don't need the data in real-time, so I could run it on an hourly basis looking to achieve two things:-Save money by don't have a cluster up 24/7-Have bigger files saved for each readThe stream is constant so...

Data Engineering

570 Views
2 replies
0 kudos

05-25-2022 2:33:17 PM

View Replies

Latest Reply

Kaniz
Community Manager

05-26-2022 3:17:18 AM

0 kudos

Hi @Alejandro Martinez , Here is a somewhat similar issue on S.O. Please look and let us know if that helped.

0 kudos

05-26-2022 3:17:18 AM

1 More Replies

by sriramkumar • New Contributor II

05-25-2022 7:57:21 AM

1851 Views
3 replies
0 kudos

New Databricks Driver gives SQLNonTransientConnectionException when trying to connect to Databricks Instance

import com.databricks.client.jdbc.DataSource; import java.sql.*; public class testDatabricks { public static void main(String[] args) throws SQLException { String dbUrl = "jdbc:databricks://<hostname>:443;HttpPath=<HttpPath>;"; // Cop...

Data Engineering

1851 Views
3 replies
0 kudos

05-25-2022 7:57:21 AM

View Replies

Latest Reply

Atanu
Esteemed Contributor

05-26-2022 3:04:07 AM

0 kudos

This looks like due to maintenance on US . Are you still facing the issue @Sriramkumar Thamizharasan Is your workspace on eastus and eastus2 ?

0 kudos

05-26-2022 3:04:07 AM

2 More Replies

by repcak • New Contributor III

05-25-2022 3:31:41 AM

2830 Views
6 replies
3 kudos

Resolved! Delta Live Tables with EventHub

Hello,I would like to integrate Databricks Delta Live Tables with Eventhub, but i cannot install com.microsoft.azure:azure-eventhubs-spark_2.12:2.3.17 on delta live cluster.I tried installed in using Init script (by adding it in Json cluster settings...

Data Engineering

2830 Views
6 replies
3 kudos

05-25-2022 3:31:41 AM

View Replies

Latest Reply

Atanu
Esteemed Contributor

05-26-2022 3:00:37 AM

3 kudos

I think this has some details https://docs.microsoft.com/en-us/azure/event-hubs/event-hubs-kafka-spark-tutorial @Kacper Mucha is the issue resolved ?

3 kudos

05-26-2022 3:00:37 AM

5 More Replies

by Soma • Valued Contributor

05-06-2022 10:55:59 PM

1549 Views
8 replies
6 kudos

Resolved! Start Up Notebook in Databricks

Hi Team,Is it possible to have a startup notebook in databricks similar to init sql script ( This need to run on start of every cluster )

Data Engineering

1549 Views
8 replies
6 kudos

05-06-2022 10:55:59 PM

View Replies

Latest Reply

Soma
Valued Contributor

05-26-2022 5:18:15 AM

6 kudos

@Kaniz Fatma sure will do

6 kudos

05-26-2022 5:18:15 AM

7 More Replies

by venkad • Contributor

05-26-2022 2:49:14 AM

795 Views
2 replies
2 kudos

Resolved! Is Databricks Light Runtime Discontinued?

The last Databricks Light runtime release was 2.4 Extended Support. There was no Light version for Spark 3.x. Is Databricks Light runtime discontinued? If not, when we can expect the next DBR Light version?

Data Engineering

795 Views
2 replies
2 kudos

05-26-2022 2:49:14 AM

View Replies

Latest Reply

Anonymous
Not applicable

05-26-2022 4:43:23 AM

2 kudos

Hi @Venkadeshwaran K, I looked around, and it does look like there won't be future light runtimes. We can't hire enough engineers to maintain and develop everything, and light is one of the casualties of that.

2 kudos

05-26-2022 4:43:23 AM

1 More Replies

by Startup_Names • New Contributor

05-26-2022 4:32:00 AM

228 Views
0 replies
0 kudos

Startup Names is the place to find best brand names for sale completed with logo. Use our free business name generator for industry wise creative bran...

Startup Names is the place to find best brand names for sale completed with logo. Use our free business name generator for industry wise creative brand names.

Data Engineering

228 Views
0 replies
0 kudos

05-26-2022 4:32:00 AM

by geertvanhove • New Contributor III

03-16-2022 5:47:02 AM

5251 Views
7 replies
6 kudos

Resolved! connection to databricks data source in Visual Studio fails with SSL error

Hi,I'm new to databricks but am positively surprised by the product. We use databricks delta tables as source to build a tabular model, which will serve as data source for Power Bi. To develop our tabular model we use Visual studio to import tables ...

Data Engineering

5251 Views
7 replies
6 kudos

03-16-2022 5:47:02 AM

View Replies

Latest Reply

Kaniz
Community Manager

04-26-2022 3:49:15 PM

6 kudos

Hi @geert vanhove , Just a friendly follow-up. Do you still need help, or have you resolved your problem with the above solutions? Please let us know.

6 kudos

04-26-2022 3:49:15 PM

6 More Replies

User

Count

1601

736

343

284

246

Databricks

Forum Posts

Resolved! delta table storage

Using (python) import on azure databricks

pyspark SQL cannot resolve 'explode()' due to data type mismatch

hosting-services-dedicated-shared-colocation-dubai-uae-middle-east-saudi-arabia-oman-qatar-bahrain-kuwait (1)

EsHadoopNoNodesLeftException: Connection error (check network and/or proxy settings)- all nodes failed; tried [[elasticsearch_server:80]]

The (ESGRI) introduces the world’s most wanted environmental social and governance cutting edge easy to use software tools with next-gen tech.

Resolved! How to use dateadd in databricks sql ?

Pattern Matching not working as expected with 10.4 LTS databricks runtime version

How to run sparkStream for earlier (not future messages)

New Databricks Driver gives SQLNonTransientConnectionException when trying to connect to Databricks Instance

Resolved! Delta Live Tables with EventHub

Resolved! Start Up Notebook in Databricks

Resolved! Is Databricks Light Runtime Discontinued?

Startup Names is the place to find best brand names for sale completed with logo. Use our free business name generator for industry wise creative bran...

Resolved! connection to databricks data source in Visual Studio fails with SSL error

DELTA_EXCEED_CHAR_VARCHAR_LIMIT

Not able to set run_as service_principal_name

Pyspark operations slowness in CLuster 14.3LTS as ...

[Databricks Assets Bundles] Workflow trigger on fi...

Addressing Pipeline Error Handling in Databricks b...