Data Engineering

Forum Posts

Sorted by:

by Rishabh264 • Honored Contributor II

12-22-2022 2:24:40 AM

5571 Views
7 replies
4 kudos

Resolved! connect databricks to teradata

hey i want to know can we connect databricks to the teradata database and if yes what will be the procedure ??? help would be appreciated

Data Engineering

5571 Views
7 replies
4 kudos

12-22-2022 2:24:40 AM

View Replies

Latest Reply

jose_gonzalez
Moderator

12-27-2022 3:16:33 PM

4 kudos

use the JDBC driver from here https://docs.databricks.com/integrations/jdbc-odbc-bi.html

4 kudos

12-27-2022 3:16:33 PM

6 More Replies

by venkat-bodempud • New Contributor III

08-07-2022 11:26:55 PM

3097 Views
4 replies
7 kudos

Power BI - Databricks Integration using Service Principal

Hello Community,We are able to connect to databricks(using Personal access token) from Power BI Desktop and we able to set up scheduling databricks notebook using DataFactory for every 10 minutes(as per our requirement). We want to avoid using the pe...

Data Engineering

3097 Views
4 replies
7 kudos

08-07-2022 11:26:55 PM

View Replies

Latest Reply

Prabakar
Esteemed Contributor III

08-08-2022 2:50:23 AM

7 kudos

You can use the token generated for the service principal and use it. As a security best practice, when authenticating with automated tools, systems, scripts, and apps, Databricks recommends you use access tokens belonging to service principals inste...

7 kudos

08-08-2022 2:50:23 AM

3 More Replies

by chanansh • Contributor

01-18-2023 7:48:14 AM

801 Views
2 replies
0 kudos

how to compute difference over time of a spark structure streaming?

I have a table with a timestamp column (t) and a list of columns for which I would like to compute the difference over time (v), by some key(k): v_diff(t) = v(t)-v(t-1) for each k independently.Normally I would write:lag_window = Window.partitionBy(C...

Data Engineering

801 Views
2 replies
0 kudos

01-18-2023 7:48:14 AM

View Replies

Latest Reply

chanansh
Contributor

02-08-2023 5:32:54 AM

0 kudos

I found this but could not make it work https://www.databricks.com/blog/2022/10/18/python-arbitrary-stateful-processing-structured-streaming.html

0 kudos

02-08-2023 5:32:54 AM

1 More Replies

by Skesaram • New Contributor II

02-07-2023 6:44:54 AM

612 Views
1 replies
0 kudos

Need help to connect to local DB from Data bricks

jdbcHostname="478"jdbcPort=1433jdbcDatabase="Onprem_AzureDB"jdbcUsername="upendra"jdbcPassword="upendrakumar"jdbcDriver="com.microsoft.sqlserver.jdbc.SQLServerDriver"jdbcUrl=f"jdbc:sqlserver://{jdbcHostname}:{jdbcPort};databaseName={jdbcDatabase};use...

Data Engineering

612 Views
1 replies
0 kudos

02-07-2023 6:44:54 AM

View Replies

Latest Reply

Debayan
Esteemed Contributor III

02-07-2023 11:05:15 PM

0 kudos

Hi, Could you please verify the network connectivity from Databricks to the SQL server? Please make sure SQL:port is allowed in your firewall rules or security groups.

0 kudos

02-07-2023 11:05:15 PM

by lasmali • New Contributor II

01-31-2023 2:46:31 AM

1130 Views
3 replies
0 kudos

Instance Profile creation via the Databricks REST API returns "INVALID_PARAMETER_VALUE"

# Problem StatementWe have a need to create Instance Profiles via the Databricks REST API, but the endpoint returns an "INVALID_PARAMETER_VALUE" error with "Syntactically invalid AWS instance profile ARN" message even when provided an appropriate ARN...

Data Engineering

1130 Views
3 replies
0 kudos

01-31-2023 2:46:31 AM

View Replies

Latest Reply

Anonymous
Not applicable

02-07-2023 9:53:12 PM

0 kudos

Hi @lasse Lidegaard Hope all is well! Just wanted to check in if you were able to resolve your issue and would you be happy to share the solution or mark an answer as best? Else please let us know if you need more help. We'd love to hear from you.Th...

0 kudos

02-07-2023 9:53:12 PM

2 More Replies

by mebinjoy • New Contributor II

01-31-2023 8:32:36 PM

1780 Views
6 replies
8 kudos

Resolved! Certificate not received.

I had completed the Data Engineering Associate V3 certification today morning and I'm yet to receive my certification. I had received a mail stating that I had passed and the certification would be mailed.

Data Engineering

1780 Views
6 replies
8 kudos

01-31-2023 8:32:36 PM

View Replies

Latest Reply

Anonymous
Not applicable

02-07-2023 8:07:05 PM

8 kudos

Hi @Mebin Joy Thank you for reaching out! Please submit a ticket to our Training Team here: https://help.databricks.com/s/contact-us?ReqType=training and our team will get back to you shortly. Regards

8 kudos

02-07-2023 8:07:05 PM

5 More Replies

by cmilligan • Contributor II

02-06-2023 1:34:22 PM

3729 Views
6 replies
1 kudos

Resolved! Reference a single item tuple using .format() in spark.sql()

I'm trying to pass the elements of a tuple into a sql query using .format(). This works fine when I have multiple items in my tuple, but when using a single item in a tuple I get an error.tuple1 = (1,2,3) tuple2 = (5,) combo = tuple1 + tuple2 pri...

Data Engineering

3729 Views
6 replies
1 kudos

02-06-2023 1:34:22 PM

View Replies

Latest Reply

Lakshay
Esteemed Contributor

02-07-2023 4:32:37 AM

1 kudos

Could you please post the code and the error that you are getting?

1 kudos

02-07-2023 4:32:37 AM

5 More Replies

by jerry747847 • New Contributor III

10-11-2022 11:43:49 PM

4010 Views
6 replies
11 kudos

Resolved! When to increase maximum bound vs when to increase cluster size?

Hello experts,For the below question, I am trying to understand why option C was selected instead of B? As B would also have resolved the issueQuestion 40A data analyst has noticed that their Databricks SQL queries are running too slowly. They claim ...

Data Engineering

4010 Views
6 replies
11 kudos

10-11-2022 11:43:49 PM

View Replies

Latest Reply

JRL
New Contributor II

02-07-2023 8:52:56 AM

11 kudos

On a sql server, there are wait states. Wait states occur when several processors (vCPUs) are processing and several threads are working through the processors. A longer running thread that has dependencies, can cause the thread that may have begun o...

11 kudos

02-07-2023 8:52:56 AM

5 More Replies

by 190809 • Contributor

02-06-2023 1:27:37 AM

600 Views
2 replies
1 kudos

Resolved! Loading tables to gold, one loads and the other two fail but same process.

Hi team, I am still fairly new to working with delta tables. I have created a df by reading in data from existing silver tables in my lakehouse. I read in the silver tables usiung sql into a workbook, do some manipulation, unnest some fiels and then ...

Data Engineering

600 Views
2 replies
1 kudos

02-06-2023 1:27:37 AM

View Replies

Latest Reply

190809
Contributor

02-07-2023 4:27:34 AM

1 kudos

Hi @Pravin Chaubey thanks for responding. I discovered the issue. I had to load them as unmanaged tables but had previously not specified a path when doing .saveAsTable() and so those two tables that were failing to load were in fact managed tables ...

1 kudos

02-07-2023 4:27:34 AM

1 More Replies

by weldermartins • Honored Contributor

02-03-2023 1:04:07 PM

2567 Views
2 replies
1 kudos

Resolved! How to make spark-submit work on windows?

I have Jupyter Notebook installed on my machine working normally. I tested running a Spark application by running the spark-submit command and it returns the message that the file was not found. What do you need to do to make it work?Below is a file ...

Data Engineering

2567 Views
2 replies
1 kudos

02-03-2023 1:04:07 PM

View Replies

Latest Reply

Debayan
Esteemed Contributor III

02-05-2023 10:20:21 AM

1 kudos

Hi, yet this is not tested in my lab, but could you please check and confirm if this works: https://stackoverflow.com/questions/37861469/how-to-submit-spark-application-on-cmd

1 kudos

02-05-2023 10:20:21 AM

1 More Replies

by sasidhar • New Contributor II

12-11-2022 8:07:54 AM

2592 Views
4 replies
8 kudos

custom python module not found while using dbx on pycharm

Am new to databricks and pyspark. Building a pyspark application using pycharm IDE. I have tested the code in local and wanted to run on databricks cluster from IDE itself. Following the dbx documentation and able to run the single python file succes...

Data Engineering

2592 Views
4 replies
8 kudos

12-11-2022 8:07:54 AM

View Replies

Latest Reply

Meghala
Valued Contributor II

02-06-2023 2:33:30 AM

8 kudos

Even I got error

8 kudos

02-06-2023 2:33:30 AM

3 More Replies

by najmead • Contributor

02-04-2023 3:23:07 AM

1247 Views
2 replies
0 kudos

Error Creating Primary Key Constraint

I am trying to add a primary key constraint to an existing table, and I get the following error;Cannot create or update table because the child column(s) `my_primary_key` of primary key `pk` cannot be set to nullable. Either drop the constraint, or c...

Data Engineering

1247 Views
2 replies
0 kudos

02-04-2023 3:23:07 AM

View Replies

Latest Reply

Debayan
Esteemed Contributor III

02-05-2023 5:18:28 AM

0 kudos

Hi, Could you please confirm if you are using the latest databricks-sql-connector ? (https://pypi.org/project/databricks-sql-connector/)

0 kudos

02-05-2023 5:18:28 AM

1 More Replies

by Bhanu1 • New Contributor III

02-03-2023 9:54:08 AM

740 Views
2 replies
0 kudos

The new horizontal view of tasks *****. Can we please have the option for vertical view of a workflow?

Data Engineering

740 Views
2 replies
0 kudos

02-03-2023 9:54:08 AM

View Replies

Latest Reply

Bhanu1
New Contributor III

02-05-2023 11:42:59 AM

0 kudos

Hi Debayan,This was how workflows used to look like before These are now shown from left to right instead of from top to bottom. It is a pain to scroll through a long workflow now as mouses don't have the capability to scroll left and right.

0 kudos

02-05-2023 11:42:59 AM

1 More Replies

by data_explorer • New Contributor II

01-31-2023 1:48:01 AM

536 Views
1 replies
0 kudos

Is there anyway to execute grant and revoke statements to a user for an object based on a condition?

SELECT if((select count(*) from information_schema.table_privileges where grantee = 'samo@test.com' and table_schema='demo_schema' and table_catalog='demo_catalog')==1, (select count(*) from demo_catalog.demo_schema.demo_table), (select count(*) from...

Data Engineering

536 Views
1 replies
0 kudos

01-31-2023 1:48:01 AM

View Replies

Latest Reply

Debayan
Esteemed Contributor III

02-05-2023 5:12:53 AM

0 kudos

Hi, GRANT and REVOKE are privileges on an securable object to a principal. And a principal is a user, service principal, or group known to the metastore. Principals can be granted privileges and may own securable objects.Also, you can use REVOKE ON S...

0 kudos

02-05-2023 5:12:53 AM

by SaravananPalani • New Contributor II

08-23-2018 4:08:35 AM

18203 Views
8 replies
9 kudos

Is there any way to monitor the CPU, disk and memory usage of a cluster while a job is running?

I am looking for something preferably similar to Windows task manager which we can use for monitoring the CPU, memory and disk usage for local desktop.

Data Engineering

18203 Views
8 replies
9 kudos

08-23-2018 4:08:35 AM

View Replies

Latest Reply

hitech88
New Contributor II

02-04-2023 11:57:28 AM

9 kudos

Some important info to look in Gangalia UI in CPU, memory and server load charts to spot the problem:CPU chart :User %Idle %High percentage of user % indicates heavy CPU usage in the cluster.Memory chart : Use %Free %Swap % If you see purple line ove...

9 kudos

02-04-2023 11:57:28 AM

7 More Replies

User

Count

1602

736

343

284

247

Databricks

Forum Posts

Resolved! connect databricks to teradata

Power BI - Databricks Integration using Service Principal

how to compute difference over time of a spark structure streaming?

Need help to connect to local DB from Data bricks

Instance Profile creation via the Databricks REST API returns "INVALID_PARAMETER_VALUE"

Resolved! Certificate not received.

Resolved! Reference a single item tuple using .format() in spark.sql()

Resolved! When to increase maximum bound vs when to increase cluster size?

Resolved! Loading tables to gold, one loads and the other two fail but same process.

Resolved! How to make spark-submit work on windows?

custom python module not found while using dbx on pycharm

Error Creating Primary Key Constraint

The new horizontal view of tasks *****. Can we please have the option for vertical view of a workflow?

Is there anyway to execute grant and revoke statements to a user for an object based on a condition?

Is there any way to monitor the CPU, disk and memory usage of a cluster while a job is running?

Best way to parse Google Analytics data in Databri...

DELTA_EXCEED_CHAR_VARCHAR_LIMIT

Not able to set run_as service_principal_name

Pyspark operations slowness in CLuster 14.3LTS as ...

[Databricks Assets Bundles] Workflow trigger on fi...