Data Engineering

Forum Posts

Sorted by:

Start a conversation

by User16752239289 • Valued Contributor

07-29-2021 11:57:28 AM

922 Views
1 replies
1 kudos

Resolved! Tensorboard Profiler did not work on DBR 8.4 ML

The tensorboard profile board did not work. It shows loading data forever

Data Engineering

922 Views
1 replies
1 kudos

07-29-2021 11:57:28 AM

View Replies

Latest Reply

User16752239289
Valued Contributor

07-29-2021 1:14:06 PM

1 kudos

This is due to a issue reported here : https://github.com/tensorflow/profiler/issues/344The DBR 8.4ML comes with Tensor flow 2.5 and the latest version of tensorboard-plugin-profile is 2.4.To workaround the issue, you can add option --load_fast=false...

1 kudos

07-29-2021 1:14:06 PM

by rami1 • New Contributor II

07-29-2021 7:56:45 AM

530 Views
0 replies
0 kudos

Data bricks Write Performance

I have a requirement to replay ingestion from landing data and build silver table. I am trying to write delta file from raw Avro files based in landing zone. The raw files are located in folder based on date. I am currently using streaming to read d...

Data Engineering

530 Views
0 replies
0 kudos

07-29-2021 7:56:45 AM

by TyronZerafa • New Contributor II

07-29-2021 7:07:08 AM

1190 Views
0 replies
2 kudos

Integrating with Prometheus

How can I integrate Databricks clusters with Prometheus? I tried adding the following Spark property to my cluster but cannot find the Prometheus metrics endpoints. Any thoughts? spark.ui.prometheus.enabled = true

Data Engineering

1190 Views
0 replies
2 kudos

07-29-2021 7:07:08 AM

by siddesh • New Contributor

07-29-2021 5:21:49 AM

523 Views
0 replies
0 kudos

How to import python notebook from s3

I need to import python notebook present in s3

Data Engineering

523 Views
0 replies
0 kudos

07-29-2021 5:21:49 AM

by SH1966 • New Contributor II

07-28-2021 6:25:04 PM

374 Views
0 replies
1 kudos

Can I duplicate a delta table juste by copying/pasting the whole table folder ?

Data Engineering

374 Views
0 replies
1 kudos

07-28-2021 6:25:04 PM

by AbhishekBreeks • New Contributor II

07-28-2021 7:21:57 AM

544 Views
0 replies
0 kudos

Host a Star Schema Data Warehouse on Azure Databricks

Hello, Is it a good idea to Host a Schema Data Warehouse on Azure Databricks database itself. Usually we use Azure Databricks to Prep the data and then Host it on Azure Sql Database. However question is can we not Host the data on Azure Databricks i...

Data Engineering

544 Views
0 replies
0 kudos

07-28-2021 7:21:57 AM

by WhatIsHappening • New Contributor

07-28-2021 6:47:43 AM

465 Views
0 replies
0 kudos

Pandas Forward Fill Based on Keyword

Hello! I am trying to forward fill a column in a Pandas dataframe based on a keyword. I have come up with: pdf_df['EEName_TEST'] = pdf_df['EEName_TEST'].str.contains('Name:').ffill() This gives me a boolean result but I still can't figure out what ...

Data Engineering

465 Views
0 replies
0 kudos

07-28-2021 6:47:43 AM

by Jessy • New Contributor

07-28-2021 6:11:06 AM

457 Views
0 replies
0 kudos

How to send parameter to a widget in called notebook from calling notebook?

I have Azure notebook which take widget input parameter and performs necessary action. But this note book should be called within another notebook using dbutils.notebook.run , How do I pass parameter to the widget?

Data Engineering

457 Views
0 replies
0 kudos

07-28-2021 6:11:06 AM

by stramzik • New Contributor II

07-25-2021 10:24:26 PM

894 Views
1 replies
1 kudos

Unable to mount datalake gen1 to databricks

I was mounting the Datalake Gen1 to Databricks for accessing and processing files, The below code was working great for the past 1 year and all of a sudden I'm getting an errorconfigs = {"df.adl.oauth2.access.token.provider.type": "ClientCredential"...

Data Engineering

894 Views
1 replies
1 kudos

07-25-2021 10:24:26 PM

View Replies

Latest Reply

stramzik
New Contributor II

07-28-2021 4:01:43 AM

1 kudos

bumping up the thread

1 kudos

07-28-2021 4:01:43 AM

by RiyazAli • Contributor III

07-26-2021 5:59:41 AM

778 Views
0 replies
1 kudos

Unable to subset the data using SparkR, using piping convention to execute the commands

I'm operating on some data that looks like the image attached. the command that I'm performing is :<code>library(magrittr) subsetting the data for MAC-OS & sorting by event-timestamp. acDF <- eventsDF %>% SparkR::select("device", "event_timestamp...

Data Engineering

778 Views
0 replies
1 kudos

07-26-2021 5:59:41 AM

by akj2784 • New Contributor II

09-19-2019 12:11:21 AM

15175 Views
11 replies
1 kudos

How to connect PostgreSQL from Databricks

I am trying to connect PostgreSQL from Azure Databricks. I am using the below code to connect. jdbcHostname = "Test" jdbcPort = 1234 jdbcDatabase = "Test1" jdbcUrl = "jdbc:postgresql://{0}:{1}/{2}".format(jdbcHostname, jdbcPort, jdbcDatabase) Conn...

Data Engineering

15175 Views
11 replies
1 kudos

09-19-2019 12:11:21 AM

View Replies

Latest Reply

Anonymous
Not applicable

02-13-2020 1:38:15 AM

1 kudos

@Javier De La Torre do you really need two-way SSL (verify-full)? In most cases one way SSL (sslmode=require) should be enough. @akj2784 When you say "Connection was successful", where do you mean you established a successful connection? You might...

1 kudos

02-13-2020 1:38:15 AM

10 More Replies

by johnsnowZX0298 • New Contributor

07-24-2021 3:44:36 AM

821 Views
0 replies
0 kudos

Is it possible to dynamically create jobs?

Say I have two notebooks A and B. Notebook A generates data for notebook B to process. However, I want multiple B to process the data concurrently. Is this possible?

Data Engineering

821 Views
0 replies
0 kudos

07-24-2021 3:44:36 AM

by alecdavis47 • New Contributor

07-23-2021 9:39:29 PM

363 Views
0 replies
0 kudos

databricks-connect without using cluster

For those of you who use databricks-connect probably know that it’s a great tool to use the power of spark/databricks, while executing/debugging code (and having proper git integration) from your favorite IDE. However, when you want to test somethin...

Data Engineering

363 Views
0 replies
0 kudos

07-23-2021 9:39:29 PM

by ShivamRunthala • New Contributor

07-22-2021 9:13:14 AM

1551 Views
0 replies
0 kudos

All applications stuck in Waiting State on Standalone Spark Cluster

Spark Standalone Cluster Configuration (Spark 3.0.0)- 1 Master2 Workers (4 cores each) I am using Airflow SparkSubmitOperator to submit the job to Spark Master in Cluster mode. There are multiple(~20) DAGs on airflow submitting jobs to Spark. These ...

Data Engineering

1551 Views
0 replies
0 kudos

07-22-2021 9:13:14 AM

by nolanreilly • New Contributor

07-22-2021 7:08:02 AM

387 Views
0 replies
0 kudos

Impossible to read a custom pipeline? (Scala)

I have created a custom transformer to be used in a ml pipeline. I was able to write the pipeline to storage by extending the transformer class with DefaultParamsWritable. Reading the pipeline back in however, does not seem possible in Scala. I have...

Data Engineering

387 Views
0 replies
0 kudos

07-22-2021 7:08:02 AM

User

Count

1601

736

343

284

247

Databricks

Forum Posts

Resolved! Tensorboard Profiler did not work on DBR 8.4 ML

Data bricks Write Performance

Integrating with Prometheus

How to import python notebook from s3

Can I duplicate a delta table juste by copying/pasting the whole table folder ?

Host a Star Schema Data Warehouse on Azure Databricks

Pandas Forward Fill Based on Keyword

How to send parameter to a widget in called notebook from calling notebook?

Unable to mount datalake gen1 to databricks

Unable to subset the data using SparkR, using piping convention to execute the commands

How to connect PostgreSQL from Databricks

Is it possible to dynamically create jobs?

databricks-connect without using cluster

All applications stuck in Waiting State on Standalone Spark Cluster

Impossible to read a custom pipeline? (Scala)

DELTA_EXCEED_CHAR_VARCHAR_LIMIT

Not able to set run_as service_principal_name

Pyspark operations slowness in CLuster 14.3LTS as ...

[Databricks Assets Bundles] Workflow trigger on fi...

Addressing Pipeline Error Handling in Databricks b...