Data Engineering

Forum Posts

Sorted by:

by Chris_Shehu • Valued Contributor III

02-18-2022 12:54:53 PM

8327 Views
8 replies
2 kudos

Resolved! Do compute resources get removed after not being used for x number of days?

Currently we're getting reports of compute resources disappearing from one of our lesser used databricks platforms. I just turned on logging to see if we can find something but I'm wondering if a compute gets removed if it hasn't been used after so l...

Data Engineering

8327 Views
8 replies
2 kudos

02-18-2022 12:54:53 PM

View Replies

Latest Reply

Hanna0805050
New Contributor II

09-05-2022 6:05:20 AM

2 kudos

Pest Control Software to Grow Your Business Choosing the best pest control software for your business can have a powerful impact on your productivity. Fieldwork can help your workforce repel downtime, attract clients, get organized and get everything...

2 kudos

09-05-2022 6:05:20 AM

7 More Replies

by Data_Engineer3 • Contributor III

07-20-2022 9:18:19 PM

7156 Views
4 replies
1 kudos

Unable to read data from Elasticsearch with spark in Databricks.

When I am trying to read data from elasticsearch by spark sql, it throw an error like RuntimeException: Error while encoding: java.lang.RuntimeException: scala.collection.convert.Wrappers$JListWrapper is not a valid external type for schema of string...

Data Engineering

7156 Views
4 replies
1 kudos

07-20-2022 9:18:19 PM

View Replies

Latest Reply

Vidula
Honored Contributor

09-05-2022 5:16:37 AM

1 kudos

Hi there @KARTHICK N Hope all is well! Just wanted to check in if you were able to resolve your issue and would you be happy to share the solution or mark an answer as best? Else please let us know if you need more help. We'd love to hear from you.T...

1 kudos

09-05-2022 5:16:37 AM

3 More Replies

by rodrigocms • New Contributor

07-19-2022 5:57:44 AM

2617 Views
2 replies
0 kudos

Connect to SSAS

Hello everyone,I need to connect Databricks Pyspark to get information from Power BI XLMA EndPoint - the end point work as an SSAS host.So, I'm trying to find what I need to do to connect to SSAS tabular. Can anyone help?Many thanks.Rodrigo Souza

Data Engineering

2617 Views
2 replies
0 kudos

07-19-2022 5:57:44 AM

View Replies

Latest Reply

Vidula
Honored Contributor

09-05-2022 3:44:10 AM

0 kudos

Hey there @Rodrigo Camara de Souza Hope all is well! Just wanted to check in if you were able to resolve your issue and would you be happy to share the solution or mark an answer as best? Else please let us know if you need more help. We'd love to h...

0 kudos

09-05-2022 3:44:10 AM

1 More Replies

by GoldenTuna • New Contributor II

07-18-2022 7:10:32 AM

3242 Views
3 replies
1 kudos

Bulk removal of inactive users?

To make a long story short, through SCIM we accidentally provisioned 3,000+ users into our Databricks workspace who should not be there. We fixed the SCIM issue but now the workspaces tab is flooded with inactive user workspaces. Is there any way to ...

Data Engineering

3242 Views
3 replies
1 kudos

07-18-2022 7:10:32 AM

View Replies

Latest Reply

Vidula
Honored Contributor

09-05-2022 3:19:47 AM

1 kudos

Hello @David Kruetzkamp Hope all is well! Just wanted to check in if you were able to resolve your issue and would you be happy to share the solution or mark an answer as best? Else please let us know if you need more help. We'd love to hear from yo...

1 kudos

09-05-2022 3:19:47 AM

2 More Replies

by sabooalex • New Contributor II

09-05-2022 3:03:10 AM

1389 Views
0 replies
0 kudos

SCD type2 snowflake

I have monthly files which comes in S3 bucket. I want to implement SCD type2 in snowflake.I am ok to read the new files, clean it.My question is about comparing what I have read from the files, with what is stored in the snowflake table already(milli...

Data Engineering

1389 Views
0 replies
0 kudos

09-05-2022 3:03:10 AM

by Anonymous • Not applicable

01-24-2022 5:14:47 PM

1450 Views
3 replies
0 kudos

The Next Databricks Office HoursOur next Office Hours session is scheduled for January 25, 2022 - 8:00 am PDT Do you have questions about how to set u...

The Next Databricks Office HoursOur next Office Hours session is scheduled for January 25, 2022 - 8:00 am PDTDo you have questions about how to set up or use Databricks? Do you want to get best practices for deploying your use case or tips on data ar...

Data Engineering

1450 Views
3 replies
0 kudos

01-24-2022 5:14:47 PM

View Replies

Latest Reply

Hanna0805050
New Contributor II

09-05-2022 1:50:28 AM

0 kudos

Thank you for the opportunity to communicate. I work at https://www.eliteimagingsystems.com/ and know how important it is for our customers to be able to communicate with us 24/7.

0 kudos

09-05-2022 1:50:28 AM

2 More Replies

by jwilliam • Contributor

08-30-2022 9:25:48 PM

4420 Views
3 replies
4 kudos

Resolved! What is the maximum of concurrent streaming jobs for a cluster?

What is the maximum of concurrent streaming jobs for a cluster? How can I have the right amount of concurrent streaming jobs for different cluster configuration?Should I use multiple cluster for different jobs or combine it into a big cluster to hand...

Data Engineering

4420 Views
3 replies
4 kudos

08-30-2022 9:25:48 PM

View Replies

Latest Reply

Prabakar
Databricks Employee

09-01-2022 5:22:05 AM

4 kudos

Hi @John William it would be better to use different clusters for each streaming jobs.

4 kudos

09-01-2022 5:22:05 AM

2 More Replies

by DipakBachhav • New Contributor III

07-15-2022 2:57:55 PM

15891 Views
5 replies
1 kudos

How to store SQL query result data on a local disk?

I am a newbie to data bricks and trying to write results into the excel/ CSV file using the below command but getting DataFrame' object has no attribute 'to_csv' errors while executing.I am using a notebook to execute my SQL queries and now want to s...

Data Engineering

15891 Views
5 replies
1 kudos

07-15-2022 2:57:55 PM

View Replies

Latest Reply

Vidula
Honored Contributor

09-05-2022 1:23:07 AM

1 kudos

Hi there @Dipak Bachhav Hope all is well! Just wanted to check in if you were able to resolve your issue and would you be happy to share the solution or mark an answer as best? Else please let us know if you need more help. We'd love to hear from yo...

1 kudos

09-05-2022 1:23:07 AM

4 More Replies

by Ruby8376 • Valued Contributor

08-30-2022 1:40:47 PM

4814 Views
8 replies
1 kudos

Resolved! Anti pattern : moving data from cloud to on-prem

Hi there,In my current project, Current status: Az databricks streaming jobs migrate Json file from kafka to raw layer(parquet file), then parsing logic is applied and 8 tables are created in raw standardized layer.Requirement: Business team wants to...

Data Engineering

4814 Views
8 replies
1 kudos

08-30-2022 1:40:47 PM

View Replies

Latest Reply

-werners-
Esteemed Contributor III

08-31-2022 12:04:29 AM

1 kudos

You could indeed use ADF to copy the data from cloud to on-prem.However, depending on the size of the data, this can take a while.I use the same pattern, but for aggregated processed data, which is not an issue at all.You could also look at Azure Syn...

1 kudos

08-31-2022 12:04:29 AM

7 More Replies

by MrT • New Contributor II

07-14-2022 10:49:00 PM

3738 Views
3 replies
3 kudos

Python databricks-sql-connector TLS issue - client tries to negotiate v1 which fails many times then randomly tries to negotiate v1.3 which works

This issue is oddly only on an Azure Windows 10 VM. I Dont have this on my workstation or my personal computer so it seems to be host config related. The VM where the issue is i have a simple python script that connects to the Azure Databricks SQL en...

Data Engineering

3738 Views
3 replies
3 kudos

07-14-2022 10:49:00 PM

View Replies

Latest Reply

Vidula
Honored Contributor

09-04-2022 10:53:26 PM

3 kudos

Hello @Wayne Theron Hope all is well! Just wanted to check in if you were able to resolve your issue and would you be happy to share the solution or mark an answer as best? Else please let us know if you need more help. We'd love to hear from you.Th...

3 kudos

09-04-2022 10:53:26 PM

2 More Replies

by Karthe • New Contributor III

07-14-2022 9:42:32 PM

21391 Views
4 replies
2 kudos

I would like to access S3 data in databricks

Hi all,I am new to the databricks. I am trying to get the data from S3. The video tutoirals from the streaming platforms are accessing via access ID and secret access key. However, databricks is throwing a different options. I dont know what to fill...

Data Engineering

21391 Views
4 replies
2 kudos

07-14-2022 9:42:32 PM

View Replies

Latest Reply

Vidula
Honored Contributor

09-04-2022 10:52:14 PM

2 kudos

Hi @Karthikeyan Palanisamy Hope all is well! Just wanted to check in if you were able to resolve your issue and would you be happy to share the solution or mark an answer as best? Else please let us know if you need more help. We'd love to hear from...

2 kudos

09-04-2022 10:52:14 PM

3 More Replies

by Cano • New Contributor III

08-12-2022 9:37:32 AM

16806 Views
15 replies
0 kudos

Connecting Databricks Spark Cluster to Postgresql RDS Instance

I am trying to connect my Spark cluster to a Postgresql RDS instance. The Python notebook code that was used is seen below:df = ( spark.read \ .format("jdbc") \ .option("url", "jdbc:postgresql://<connection-string>:5432/database”)\ .option("dbt...

Data Engineering

16806 Views
15 replies
0 kudos

08-12-2022 9:37:32 AM

View Replies

Latest Reply

User16873043099
Contributor

08-12-2022 9:56:15 AM

0 kudos

"Caused by: java.net.SocketTimeoutException: connect timed out" indicate the network connection between Databricks cluster and the postgress database on 5432 port was not established and eventually timed out.As a first step, please ensure the connect...

0 kudos

08-12-2022 9:56:15 AM

14 More Replies

by aj19 • New Contributor

07-14-2022 9:22:57 AM

5285 Views
1 replies
0 kudos

How to trigger Azure Logic App from Azure Databricks?

I have an Azure Logic app which triggers whenever a HTTP Post request is received. I want to send this request from my notebook present in Azure Databricks workspace using scala and spark. Is it possible? If yes, then please guide on how to do it. T...

Data Engineering

5285 Views
1 replies
0 kudos

07-14-2022 9:22:57 AM

View Replies

Latest Reply

Vidula
Honored Contributor

09-04-2022 7:07:42 AM

0 kudos

Hey there @Ayushri Jain Hope all is well! Just wanted to check in if you were able to resolve your issue and would you be happy to share the solution or mark an answer as best? Else please let us know if you need more help. We'd love to hear from yo...

0 kudos

09-04-2022 7:07:42 AM

by andrej • New Contributor II

07-14-2022 7:28:02 AM

3207 Views
4 replies
1 kudos

Partition pruning with generated columns

I have a large table which contains a date_time column.The table contains 2 generated columns year, and month which are extracted from the date_time values and are used for partitioning.I have the following question.If I run the querySELECT *FROM tab...

Data Engineering

3207 Views
4 replies
1 kudos

07-14-2022 7:28:02 AM

View Replies

Latest Reply

Vidula
Honored Contributor

09-04-2022 7:04:54 AM

1 kudos

Hi @Andrej Znidarsic Hope all is well! Just wanted to check in if you were able to resolve your issue and would you be happy to share the solution or mark an answer as best? Else please let us know if you need more help. We'd love to hear from you.T...

1 kudos

09-04-2022 7:04:54 AM

3 More Replies

by VikasSinha • New Contributor

07-13-2022 11:45:55 PM

5956 Views
2 replies
0 kudos

Which is better - Azure Databricks or GCP Databricks?

Which cloud hosting environment is best to use for Databricks? My question pins down to the fact that there must be some difference between the latency, throughput, result consistency & reproducibility between different cloud hosting environments of ...

Data Engineering

5956 Views
2 replies
0 kudos

07-13-2022 11:45:55 PM

View Replies

Latest Reply

Vidula
Honored Contributor

09-03-2022 11:59:30 PM

0 kudos

Hi @Vikas Sinha Does @Prabakar Ammeappin response answer your question? If yes, would you be happy to mark it as best so that other members can find the solution more quickly?We'd love to hear from you.Thanks!

0 kudos

09-03-2022 11:59:30 PM

1 More Replies

Databricks Community

Forum Posts

Resolved! Do compute resources get removed after not being used for x number of days?

Unable to read data from Elasticsearch with spark in Databricks.

Connect to SSAS

Bulk removal of inactive users?

SCD type2 snowflake

The Next Databricks Office HoursOur next Office Hours session is scheduled for January 25, 2022 - 8:00 am PDT Do you have questions about how to set u...

Resolved! What is the maximum of concurrent streaming jobs for a cluster?

How to store SQL query result data on a local disk?

Resolved! Anti pattern : moving data from cloud to on-prem

Python databricks-sql-connector TLS issue - client tries to negotiate v1 which fails many times then randomly tries to negotiate v1.3 which works

I would like to access S3 data in databricks

Connecting Databricks Spark Cluster to Postgresql RDS Instance

How to trigger Azure Logic App from Azure Databricks?

Partition pruning with generated columns

Which is better - Azure Databricks or GCP Databricks?

Join Us as a Local Community Builder!

Cognito as IdP provider for Delta Share

How to Retrieve the spark.statistics.createdAt Whe...

Not able to find lab for Data Engineering Learning...

Lakeflow Connect - Postgres connector

Prakash Hinduja Switzerland (Swiss) How do I build...