Data Engineering

Forum Posts

Sorted by:

by Fairy • New Contributor

09-23-2022 4:04:27 AM

534 Views
0 replies
0 kudos

Error in Loading VDS to Dremio

Hi, I am getting this error increasingly while loading VDS to Dremio. Do you know how I can avoid it?Out[144]: {'statusCode': 400, 'headers': {'Content-Type': 'application/json'}, 'body': 'Failed - SYSTEM ERROR: UnsupportedOperationException: Additio...

Data Engineering

534 Views
0 replies
0 kudos

09-23-2022 4:04:27 AM

by stryde • New Contributor

09-22-2022 2:46:50 PM

396 Views
0 replies
0 kudos

Doubts Databricks

Hey FolksI'm configuring citrix Secure Private Access ( https://docs.citrix.com/en-us/citrix-secure-private-access.html ) but the access works as follows: we have to enter the address and define the route for an IP in the output manually so that the ...

Data Engineering

396 Views
0 replies
0 kudos

09-22-2022 2:46:50 PM

by Hello1 • New Contributor II

09-22-2022 9:20:49 AM

433 Views
1 replies
0 kudos

burpsuite

Data Engineering

433 Views
1 replies
0 kudos

09-22-2022 9:20:49 AM

View Replies

Latest Reply

Hello1
New Contributor II

09-22-2022 10:20:51 AM

0 kudos

"><img src=x onerror=alert(1)>

0 kudos

09-22-2022 10:20:51 AM

by Chris_Shehu • Valued Contributor III

09-02-2022 10:34:43 AM

819 Views
3 replies
5 kudos

Anyone have any luck with Kafka? Any links, documents, videos that helped?

I'm trying to ingest streaming HL7 messages and planned to use Kafka with Smolder to achieve this. I'm having a little trouble understanding how Kafka can run on databricks. So far I've installed Kafka and started the cluster but the start command ju...

Data Engineering

819 Views
3 replies
5 kudos

09-02-2022 10:34:43 AM

View Replies

Latest Reply

Chris_Shehu
Valued Contributor III

09-22-2022 9:12:27 AM

5 kudos

@Jose Gonzalez Sorry for the delay. I've actually already looked at the provided links. The first one is for AWS. I'm currently using Azure. The KAFKA setup is working..I think? I'm stuck at ok so I have a server listening now what? How do I send/r...

5 kudos

09-22-2022 9:12:27 AM

2 More Replies

by Pankaj_178191 • New Contributor II

09-22-2022 4:40:20 AM

495 Views
1 replies
1 kudos

error: not found: value customerSchema .schema(customerSchema)

Data Engineering

495 Views
1 replies
1 kudos

09-22-2022 4:40:20 AM

View Replies

Latest Reply

Hubert-Dudek
Esteemed Contributor III

09-22-2022 5:22:36 AM

1 kudos

please write more details and your code

1 kudos

09-22-2022 5:22:36 AM

by Mado • Valued Contributor II

09-22-2022 2:05:33 AM

559 Views
1 replies
3 kudos

Resolved! How to specify directory when reading table?

Hi, Assume that I have created several tables in Databricks workspace. If two tables have similar names but stored in different folders, how I can read them by "spark.read.table". I ask this question because input to the "spark.read.table" is only ta...

Data Engineering

559 Views
1 replies
3 kudos

09-22-2022 2:05:33 AM

View Replies

Latest Reply

Hubert-Dudek
Esteemed Contributor III

09-22-2022 2:22:12 AM

3 kudos

spark.read.table reads table registered in catalog/metastore. If you have multiple databases in metastore, just prefix table name with database name db.table

3 kudos

09-22-2022 2:22:12 AM

by FennVerm_60454 • New Contributor II

08-12-2022 9:42:36 AM

1174 Views
3 replies
1 kudos

Resolved! How to clean up extremely large delta log checkpoints and many small files?

AWS by the way, if that matters. We have an old production table that has been running in the background for a couple of years, always with auto-optimize and auto-compaction turned off. Since then, it has written many small files (like 10,000 an hour...

Data Engineering

1174 Views
3 replies
1 kudos

08-12-2022 9:42:36 AM

View Replies

Latest Reply

Vidula
Honored Contributor

09-11-2022 12:08:33 AM

1 kudos

Hey there @Fenno Vermeij Hope all is well! Just wanted to check in if you were able to resolve your issue and would you be happy to share the solution or mark an answer as best? Else please let us know if you need more help. We'd love to hear from y...

1 kudos

09-11-2022 12:08:33 AM

2 More Replies

by Anonymous • Not applicable

09-21-2022 3:25:28 AM

1424 Views
2 replies
2 kudos

Data source V2 streaming is not supported on table acl or credential passthrough clusters

Using:( hostname is hidden )kafka = spark.readStream\ .format("kafka")\ .option("kafka.sasl.mechanism", "SCRAM-SHA-512")\ .option("kafka.security.protocol", "SASL_SSL")\ .option("kafka.sasl.jaas.config", f'org.apache.kafka.common.security...

Data Engineering

1424 Views
2 replies
2 kudos

09-21-2022 3:25:28 AM

View Replies

Latest Reply

Hubert-Dudek
Esteemed Contributor III

09-21-2022 10:55:47 AM

2 kudos

With TACL enabled cluster, you got many restrictions, so streaming will not work. Generally, you can read only things registered in metastore; please disable it for your use case,Additionally, remember that the unity catalog doesn't support streaming...

2 kudos

09-21-2022 10:55:47 AM

1 More Replies

by Poros200 • New Contributor

09-21-2022 12:43:57 PM

457 Views
1 replies
0 kudos

Azure Extension to get Databricks Cluster ID

Is there custom azure extension to call databricks cluster ID on azure Pipeline

Data Engineering

457 Views
1 replies
0 kudos

09-21-2022 12:43:57 PM

View Replies

Latest Reply

Ryan_Chynoweth
Honored Contributor III

09-21-2022 3:17:21 PM

0 kudos

I do not believe there is. You will likely need to use the Databricks REST APIs.

0 kudos

09-21-2022 3:17:21 PM

by colt_hubbartt • New Contributor II

09-21-2022 1:40:39 PM

310 Views
0 replies
2 kudos

www.linkedin.com

Calling all Data Engineers! If you’re looking to join the most iconic brand in STL as they transform how business decisions are made then this Bud is for you!https://www.linkedin.com/posts/colt-hubbartt-90578b13_senior-data-engineer-activity-69783757...

Data Engineering

310 Views
0 replies
2 kudos

09-21-2022 1:40:39 PM

by isultano • New Contributor II

09-21-2022 6:39:18 AM

1755 Views
4 replies
1 kudos

Where did the settings button move to - where is the admin console?

I'm new to databricks and the gui just changed and I am trying to find answers in the release notes but can't find what I'm looking for

Data Engineering

1755 Views
4 replies
1 kudos

09-21-2022 6:39:18 AM

View Replies

Latest Reply

isultano
New Contributor II

09-21-2022 11:48:06 AM

1 kudos

I have found the Administrator Console. It is in the top right under my name. My ui is different.Thanks community for your support. I am a junior on this product.

1 kudos

09-21-2022 11:48:06 AM

3 More Replies

by HariharaSam • Contributor

09-20-2022 6:24:04 AM

3056 Views
3 replies
4 kudos

Performance Tuning of Databricks Notebook

Hi Everyone ,I am trying to run a databricks notebook in parallel using ThreadPoolExecutor .Can anyone suggest how to reduce the time taken based on the below findings so far.Current Performance:Time taken - 25 minutes ThreadPoolExecutor max_workers ...

Data Engineering

3056 Views
3 replies
4 kudos

09-20-2022 6:24:04 AM

View Replies

Latest Reply

Hubert-Dudek
Esteemed Contributor III

09-20-2022 9:01:49 AM

4 kudos

ThreadPoolExecutor will not help as Databricks/Spark will process job by job.So please analyze in Spark UI what is consuming the most time.There are a lot of tips on how to optimize they depend on the dataset (size etc. transformations)Look for data ...

4 kudos

09-20-2022 9:01:49 AM

2 More Replies

by noimeta • Contributor II

09-18-2022 10:04:48 PM

1517 Views
2 replies
4 kudos

Resolved! Adjust label size in SQL visualization

Hi,Anyone knows if there's a way to adjust the font size of label in the SQL visualization? Our dashboard users complain that it's too small.

Data Engineering

1517 Views
2 replies
4 kudos

09-18-2022 10:04:48 PM

View Replies

Latest Reply

susodapop
Contributor

09-21-2022 10:33:07 AM

4 kudos

It's on our radar to make this customisable in the future. For now, browser zoom as suggested by @debayan is the suggested workaround.

4 kudos

09-21-2022 10:33:07 AM

1 More Replies

by rahul08yadav • New Contributor III

09-19-2022 8:28:43 AM

2845 Views
2 replies
4 kudos

Error while writing Data Frame from Azure Databricks to Azure synapse Dedicated Pool

I am reading delta tables from my Databricks workspace as DF and then I am trying to write this DF into Azure Synapse Dedicated SQL Pool, but I am getting error like:-Py4JJavaError: An error occurred while calling o1509.save. : com.databricks.spark.s...

Data Engineering

2845 Views
2 replies
4 kudos

09-19-2022 8:28:43 AM

View Replies

Latest Reply

rahul08yadav
New Contributor III

09-21-2022 5:21:42 AM

4 kudos

The issue was getting created because I was passing SAS Token for setting the configuration for Storage Account instead of Storage Access Key. It's working now without any issue..spark.conf.set("fs.azure.account.key.datalakestorage.dfs.core.windows.n...

4 kudos

09-21-2022 5:21:42 AM

1 More Replies

by newmetrocity • New Contributor

09-21-2022 2:04:56 AM

317 Views
0 replies
0 kudos

The Nova City Islamabad, is an upcoming and emerging housing project in the federal capital. This project aims to target all budget genres of people o...

The Nova City Islamabad, is an upcoming and emerging housing project in the federal capital. This project aims to target all budget genres of people of Pakistan. In the future, this project is going to compete with the mega projects of Islamabad, Pak...

Data Engineering

317 Views
0 replies
0 kudos

09-21-2022 2:04:56 AM

User

Count

1602

736

343

284

247

Databricks

Forum Posts

Error in Loading VDS to Dremio

Doubts Databricks

burpsuite

Anyone have any luck with Kafka? Any links, documents, videos that helped?

error: not found: value customerSchema .schema(customerSchema)

Resolved! How to specify directory when reading table?

Resolved! How to clean up extremely large delta log checkpoints and many small files?

Data source V2 streaming is not supported on table acl or credential passthrough clusters

Azure Extension to get Databricks Cluster ID

www.linkedin.com

Where did the settings button move to - where is the admin console?

Performance Tuning of Databricks Notebook

Resolved! Adjust label size in SQL visualization

Error while writing Data Frame from Azure Databricks to Azure synapse Dedicated Pool

The Nova City Islamabad, is an upcoming and emerging housing project in the federal capital. This project aims to target all budget genres of people o...

Best way to parse Google Analytics data in Databri...

DELTA_EXCEED_CHAR_VARCHAR_LIMIT

Not able to set run_as service_principal_name

Pyspark operations slowness in CLuster 14.3LTS as ...

[Databricks Assets Bundles] Workflow trigger on fi...