cancel
Showing results for 
Search instead for 
Did you mean: 
Data Engineering
Join discussions on data engineering best practices, architectures, and optimization strategies within the Databricks Community. Exchange insights and solutions with fellow data engineers.
cancel
Showing results for 
Search instead for 
Did you mean: 

Forum Posts

Phuonganh
by Databricks Partner
  • 4066 Views
  • 3 replies
  • 4 kudos

Databricks SDK for Python: Errors with parameters for Statement Execution

Hi team,Im using Databricks SDK for python to run SQL queries. I created a variable as below:param = [{'name' : 'a', 'value' :x'}, {'name' : 'b', 'value' : 'y'}]and passed it the statement as below_ = w.statement_execution.execute_statement( warehous...

  • 4066 Views
  • 3 replies
  • 4 kudos
Latest Reply
vfrcode
New Contributor II
  • 4 kudos

The following works: response = w.statement_execution.execute_statement( statement='ALTER TABLE users ALTER COLUMN :col_name SET NOT NULL', warehouse_id='<warehouseID>', parameters=[sql.StatementParameterListItem(name='col_name' value='u...

  • 4 kudos
2 More Replies
Henry
by New Contributor II
  • 5076 Views
  • 7 replies
  • 0 kudos

Cannot login Databricks Community Edition with new account

It seems it is not allowing me to log into databricks community edition. I have recently created a new account and had the account verified. However, whenever I try to log in, I am redirected to the same page without throwing any errors. When I do en...

  • 5076 Views
  • 7 replies
  • 0 kudos
Latest Reply
Walter_C
Databricks Employee
  • 0 kudos

Please check steps provided in https://community.databricks.com/t5/support-faqs/databricks-community-sso-august-3rd-2024/ta-p/78459If issue persist please reach out to databricks-community@databricks.com

  • 0 kudos
6 More Replies
lauraxyz
by Contributor
  • 1366 Views
  • 2 replies
  • 0 kudos

dbutils.notebook API: pass data back to caller notebook

Hi all, according to this doc, we can pass data back through temp views, DBFS, or JSON data.However, in my case, i need to pass both a temp view, as well as some metadata in JSON.  is there a way to exit with BOTH a view AND json, something likedbuti...

  • 1366 Views
  • 2 replies
  • 0 kudos
Latest Reply
lauraxyz
Contributor
  • 0 kudos

I can have a try and see if it's the same like exit(view_name) in that the view is created in global_temp_db and that the lifecycle is with the job compute.

  • 0 kudos
1 More Replies
dixonantony
by New Contributor III
  • 2219 Views
  • 8 replies
  • 0 kudos

Not able create table form external spark

py4j.protocol.Py4JJavaError: An error occurred while calling o123.sql.: io.unitycatalog.client.ApiException: generateTemporaryPathCredentials call failed with: 401 - {"error_code":"UNAUTHENTICATED","message":"Request to generate access credential for...

  • 2219 Views
  • 8 replies
  • 0 kudos
Latest Reply
NandiniN
Databricks Employee
  • 0 kudos

You need the generateTemporaryPathCredentials API as you are trying to create external tables 

  • 0 kudos
7 More Replies
jeremy98
by Honored Contributor
  • 11300 Views
  • 3 replies
  • 0 kudos

Resolved! token share

Hello community,I want to have a new token, to be available to our users to interact with staging workspace. Is it possible to generate a token to be used for triggering only workflow in staging workspace databricks with databricks api?

  • 11300 Views
  • 3 replies
  • 0 kudos
Latest Reply
Walter_C
Databricks Employee
  • 0 kudos

you will need to have your own personal token to create one for your SP, in the UI you need to go to Settings > Under User > Developer > Personal Access Token.Once you have your own token you can run the API I mentioned on my previous post and you ne...

  • 0 kudos
2 More Replies
jeremy98
by Honored Contributor
  • 1207 Views
  • 5 replies
  • 1 kudos

For each task field

Hi community,I was wondering after passing a list of dict through tasks using .taskValue.set() method, how to maintain the same data type through each task?Because seems, that when I use the for loop and getting by the parameters each element of the ...

  • 1207 Views
  • 5 replies
  • 1 kudos
Latest Reply
Alberto_Umana
Databricks Employee
  • 1 kudos

Yeah, to ensure that the data types are maintained, you can convert the values to the desired types after deserialization. This is necessary because JSON does not distinguish between integers and floats, and all numbers are deserialized as floatsThe ...

  • 1 kudos
4 More Replies
VJ3
by Contributor
  • 2505 Views
  • 3 replies
  • 0 kudos

Databricks Upload local files (Create/Modify table)

Hello Team,I believe Databricks come out recently feature of Create or modify a table using file upload which is less than 2 GB (file format CSV, TSV, or JSON, Avro, Parquet, or text files to create or overwrite a managed Delta Lake table) on Self Se...

  • 2505 Views
  • 3 replies
  • 0 kudos
Latest Reply
NandiniN
Databricks Employee
  • 0 kudos

For Sharing a CSV file containing PII data with another user who should not have access to PII data elements: You can use Databricks' Unity Catalog to manage and govern access to data. Unity Catalog allows you to define fine-grained access controls a...

  • 0 kudos
2 More Replies
alpar
by New Contributor II
  • 5240 Views
  • 4 replies
  • 4 kudos

Merge operation to delta table with new column starting with upper case seems to be not working

Hello,I have a simple spark dataframe saved to a delta table:data = [ (1, "John", "Doe"), (2, "Jane", "Smith"), (3, "Mike", "Johnson"), (4, "Emily", "Davis")]columns = ["Id", "First_name", "Last_name"]df = spark.createDataFrame(data, sche...

  • 5240 Views
  • 4 replies
  • 4 kudos
Latest Reply
hari-prasad
Valued Contributor II
  • 4 kudos

I assume you must be facing an error referred here on GitHub issues page. you can follow it, they make release fix for same.[BUG][Spark] issue when merge using autoMerge property · Issue #3336 · delta-io/delta · GitHub

  • 4 kudos
3 More Replies
REM1992
by New Contributor
  • 1049 Views
  • 1 replies
  • 0 kudos

Alert monitoring, not running in schedule

Hello, I think the alert that I set is not running on the schedule that I set , every day 9 am JST time. It shows up like it is running, with the symbol of running moving , but it says since 2025/1/7 while it should have been run at 2025/1/8 9:00 am ...

  • 1049 Views
  • 1 replies
  • 0 kudos
Latest Reply
Walter_C
Databricks Employee
  • 0 kudos

The day showing there is the day since the first time it was executed, if you check on the job runs do you see that there are jobs running every day on that period of time?

  • 0 kudos
mh7
by New Contributor II
  • 2902 Views
  • 3 replies
  • 0 kudos

spark throws error while using [NOT_IMPLEMENTED] rdd is not implemented.

i am running code in 15.4lts and it works fine in all purpose cluster.processed_counts = df.rdd.mapPartitions(process_partition).reduce(lambda x, y: x + y)when i run the same code using job cluster, it throw's below error. I verfied the cluster setti...

  • 2902 Views
  • 3 replies
  • 0 kudos
Latest Reply
Walter_C
Databricks Employee
  • 0 kudos

Ok, but your all purpose cluster is set up with Single User mode which is indeed supported for the RDD, can you confirm your job cluster is also created by using Single user mode?

  • 0 kudos
2 More Replies
Databricks_-Dat
by New Contributor II
  • 9595 Views
  • 4 replies
  • 1 kudos

Databricks workflows, sample script/method to deploy jobs.json to other workspace

Could someone point me at right direction to deploy Jobs from one workspace to other workspace using josn file in Devops CI/CD pipeline? Thanks in advance.

  • 9595 Views
  • 4 replies
  • 1 kudos
Latest Reply
yuvapraveen_k
New Contributor III
  • 1 kudos

Your are welcome. There was a feature that databricks released to linked the workflow definition to the GIT automatically. Please refer the link below,https://www.databricks.com/blog/2022/06/21/build-reliable-production-data-and-ml-pipelines-with-git...

  • 1 kudos
3 More Replies
Deepak_Goldwyn
by New Contributor III
  • 9185 Views
  • 5 replies
  • 2 kudos

Resolved! Create Jobs and Pipelines in Workflows using API

I am trying to create Databricks Jobs and Delta live table(DLT) pipelines by using Databricks API.I would like to have the JSON code of Jobs and DLT in the repository(to configure the code as per environment) and execute the Databricks API by passing...

  • 9185 Views
  • 5 replies
  • 2 kudos
Latest Reply
Deepak_Goldwyn
New Contributor III
  • 2 kudos

Hi Jose,Yes it answered my question. I am indeed using JSON file to create Jobs and pipelinesThanks.

  • 2 kudos
4 More Replies
Ru
by Databricks Partner
  • 2113 Views
  • 6 replies
  • 2 kudos

Resolved! DLT Databricks Runtime version for the CURRENT channel doesn't match what's in release 2024.49

I'm expecting the Databricks Runtime for the DLT pipeline (CURRENT) to match the 2024.49 release notes. However, this is not the case. We are seeing CURRENT DLT pipelines still using Databricks Runtime 14. Our code depends on Databricks Runtime 15.4,...

  • 2113 Views
  • 6 replies
  • 2 kudos
Latest Reply
Alberto_Umana
Databricks Employee
  • 2 kudos

Hi @Ru, Is it still not 15.4 DBR version? If not do you have a support plan to open a case?

  • 2 kudos
5 More Replies
Dilorom
by New Contributor
  • 7395 Views
  • 2 replies
  • 0 kudos

How to connect to Dynamics CRM server in Databricks.

Currently I have access to Dynamics CRM backend server via AAD, and I can query tables via XRM tool. I am trying to connect to Dynamics CRM backend server in Databricks, and I am not sure how the connection needs to be set up or if any other access n...

  • 7395 Views
  • 2 replies
  • 0 kudos
Latest Reply
arijitm
Databricks Employee
  • 0 kudos

Hi @Dilorom @sheridan06 I was wondering if you were able to successfully connect and have some guidance or best practices around this.

  • 0 kudos
1 More Replies
Labels