Community Discussions

by anonymous_567 • New Contributor II

yesterday

98 Views
1 replies
0 kudos

Ingesting Non-Incremental Data into Delta

Hello,I have non-incremental data landing in a storage account. This data contains old data from before as well as new data. I would like to avoid doing a complete table deletion and table creation just to upload the data from storage and have an upd...

Community Discussions

Reply

98 Views
1 replies
0 kudos

yesterday

View Replies

Latest Reply

AmanSehgal
Honored Contributor III

yesterday

0 kudos

Well, if you know the conditions to separate new data from old data, then while reading the data in to your dataframe, use filter or where clause to select new data and ingest it in to your delta table.This is how you can do in general. But if you ha...

0 kudos

yesterday

by Mustafa_Kamal • New Contributor II

yesterday

31 Views
1 replies
0 kudos

Parameterizing DLT Pipelines

Hi Everyone,I have DLTP pipeline which I need to execute for difference source systems. Need advise on how to parametrize this.I have gone through many articles on the web, but it seems there is no accurate information available.Can anyone please hel...

Community Discussions

Reply

31 Views
1 replies
0 kudos

yesterday

View Replies

Latest Reply

AmanSehgal
Honored Contributor III

yesterday

0 kudos

You can provide parameters in the configuration section of DLT pipeline and access it in your code using spark.conf.get(<parameter_name>).Parameterize DLT pipelines

0 kudos

yesterday

by databrciks • Visitor

yesterday

154 Views
0 replies
0 kudos

Databrciks: failure logs

Hello Team,I am new to Databrciks. Generally where all the logs will be stored in Databricks. I see if any job fails below the command i could see some error messages.Otherwise in real time how to check the log files/error messages in Databricks UI.T...

Community Discussions

Reply

154 Views
0 replies
0 kudos

yesterday

by k2 • New Contributor

Wednesday

196 Views
1 replies
0 kudos

log delivery are not creating data in s3 bucket

Hiii, Does anyone have an idea about the typical duration for Databricks to create logs in an S3 bucket using the databricks_mws_log_delivery Terraform resource? I've implemented the code provided in the Databricks official documentation, but I've be...

Community Discussions

Reply

196 Views
1 replies
0 kudos

Wednesday

View Replies

Latest Reply

k2
New Contributor

yesterday

0 kudos

The issue has been resolved. There was no problem with the code or the API. However, it took over 12 hours for logs to start appearing in my bucket, despite Databricks documentation indicating that logs should appear within 1 hour..Thank you!

0 kudos

yesterday

by TheIceBrick • New Contributor III

01-19-2024 8:38:16 AM

2391 Views
3 replies
1 kudos

Is there a (request-) size limit for the Databricks Rest Api Sql statements?

When inserting rows through the Sql Api (/api/2.0/sql/statements/), when more than a certain number of records (about 25 records with 8 small columns) are included in the statement, the call fails with the error:"The request could not be processed by...

Community Discussions

REST API

Sql Statements

Reply

2391 Views
3 replies
1 kudos

01-19-2024 8:38:16 AM

View Replies

Latest Reply

ChrisCkx
Visitor

yesterday

1 kudos

@TheIceBrick did you find out anything else about this?I am experiencing exactly the same, I can insert up to 35 rows but break at about 50 rows.The payload size is 42KB, I am passing parameters for each row.@Debayan This is no where near the 16MiB /...

1 kudos

yesterday

2 More Replies

by Ruby8376 • Valued Contributor

a month ago

515 Views
7 replies
1 kudos

Expose delta table data to Salesforce - odata?

HI Looking for suggestiongs to stream on demand data from databricks delta tables to salesforce.Is odata a good option?

Community Discussions

Reply

515 Views
7 replies
1 kudos

a month ago

View Replies

Latest Reply

-werners-
Esteemed Contributor III

4 weeks ago

1 kudos

I see, is there a possibility in SF to define an external location/datasource?Just guessing here, as these type of packages are really good in isolating data, not integrating it.

1 kudos

4 weeks ago

6 More Replies

by jenshumrich • New Contributor III

a week ago

195 Views
2 replies
0 kudos

Long running jobs get lost

Hello,I tried to schedule a long running job and surprisingly it does seem to neither terminate (and thus does not let the cluster shut down), nor continue running, even though the state is still "Running":But the truth is that the job has miserably ...

Community Discussions

Reply

195 Views
2 replies
0 kudos

a week ago

View Replies

Latest Reply

Lakshay
Esteemed Contributor

yesterday

0 kudos

Have you looked at the sql plan to see what the spark job 72 was doing?

0 kudos

yesterday

1 More Replies

by chari • Contributor

yesterday

51 Views
3 replies
0 kudos

Reading csv file with spark throws [insufficient privelage] error

Hello Community,I have some csv files saved in databricks workspace and want to read them with spark. I make use of the commanddf = spark.read.format('csv').load(r'filepath') However, it throws the error.org.apache.spark.SparkSecurityException: [INSU...

Community Discussions

Reply

51 Views
3 replies
0 kudos

yesterday

View Replies

Latest Reply

Lakshay
Esteemed Contributor

yesterday

0 kudos

If this a UC enabled workspace, you need to provide the right access.

0 kudos

yesterday

2 More Replies

by thilanka02 • New Contributor

yesterday

109 Views
2 replies
1 kudos

Resolved! Spark read CSV does not throw Exception if the file path is not available in Databricks 14.3

We were using this method and this was working as expected in Databricks 13.3. def read_file(): try: df_temp_dlr_kpi = spark.read.load(raw_path,format="csv", schema=kpi_schema) return df_temp_dlr_kpi except Exce...

Community Discussions

Reply

109 Views
2 replies
1 kudos

yesterday

View Replies

Latest Reply

thilanka02
New Contributor

yesterday

1 kudos

Thank you @daniel_sahal for the reply

1 kudos

yesterday

1 More Replies

by liormayn • New Contributor

Thursday

141 Views
1 replies
0 kudos

OSError: [Errno 78] Remote address changed

Hello:)as part of deploying an app that previously ran directly on emr to databricks, we are running experiments using LTS 9.1, and getting the following error: PythonException: An exception was thrown from a UDF: 'pyspark.serializers.SerializationEr...

Community Discussions

Reply

141 Views
1 replies
0 kudos

Thursday

View Replies

Latest Reply

shan_chandra
Honored Contributor III

Thursday

0 kudos

@liormayn - could you please let us know if you had a chance to run it on DBR 10.4 LTS?

0 kudos

Thursday

by Ajay-Pandey • Esteemed Contributor III

02-13-2024 9:01:14 PM

668 Views
3 replies
2 kudos

Resolved! Update regarding Community Reward Store

Hi Team,Is there any update on the Community Reward Store, as it's been discontinued from the old portal, and we still can't see the new portal for that.Is there any expected date when this will be available for community members?

Community Discussions

Reply

668 Views
3 replies
2 kudos

02-13-2024 9:01:14 PM

View Replies

Latest Reply

Ajay-Pandey
Esteemed Contributor III

02-16-2024 8:28:37 PM

2 kudos

Thanks for update.

2 kudos

02-16-2024 8:28:37 PM

2 More Replies

by liormayn • New Contributor

Thursday

64 Views
0 replies
0 kudos

Error while encoding: java.lang.RuntimeException: org.apache.spark.sql.catalyst.util.GenericArrayDa

Hello:)we are trying to run an existing working flow that works currently on EMR, on databricks.we use LTS 10.4, and when loading the data we get the following error:at org.apache.spark.api.python.BasePythonRunner$WriterThread.run(PythonRunner.scala:...

Community Discussions

Reply

64 Views
0 replies
0 kudos

Thursday

by anonymous_567 • New Contributor II

Wednesday

176 Views
3 replies
0 kudos

Autoloader update table when new changes are made

Hello,Everyday a new file of the same name gets sent to my storage account with old and new data appended at the end. Columns may also be added during one of these file updates. This file does a complete overwrite of the previous file. Is it possibl...

Community Discussions

Reply

176 Views
3 replies
0 kudos

Wednesday

View Replies

Latest Reply

data-grassroots
New Contributor

Wednesday

0 kudos

This may be helpful - the bit on allow overwritehttps://docs.databricks.com/en/ingestion/auto-loader/faq.html

0 kudos

Wednesday

2 More Replies

by Miguel_Grafana • New Contributor

Wednesday

70 Views
0 replies
0 kudos

Azure Oauth Passthrough with the Go Driver

Can anyone point me towards some resources for achieving this? I already have the token.Trying with: dbsql.WithAccessToken(settings.Token)But I'm getting the following error:Unable to load OAuth Config: request error after 1 attempt(s): unexpected HT...

Community Discussions

Reply

70 Views
0 replies
0 kudos

Wednesday

by Alexandru • New Contributor II

a week ago

278 Views
3 replies
0 kudos

Resolved! vscode python project for development

Hi,I'm trying to set up a local development environment using python / vscode / poetry. Also, linting is enabled (Microsoft pylance extension) and the python.analysis.typeCheckingMode is set to strict.We are using python files for our code (.py) whit...

Community Discussions

Reply

278 Views
3 replies
0 kudos

a week ago

View Replies

Latest Reply

artsheiko
Valued Contributor III

Tuesday

0 kudos

Hi Alexandru, Take a look at VSCode extension for Databricks : https://marketplace.visualstudio.com/items?itemName=databricks.databricks

0 kudos

Tuesday

2 More Replies

Databricks

Forum Posts

Ingesting Non-Incremental Data into Delta

Parameterizing DLT Pipelines

Databrciks: failure logs

log delivery are not creating data in s3 bucket

Is there a (request-) size limit for the Databricks Rest Api Sql statements?

Expose delta table data to Salesforce - odata?

Long running jobs get lost

Reading csv file with spark throws [insufficient privelage] error

Resolved! Spark read CSV does not throw Exception if the file path is not available in Databricks 14.3

OSError: [Errno 78] Remote address changed

Resolved! Update regarding Community Reward Store

Error while encoding: java.lang.RuntimeException: org.apache.spark.sql.catalyst.util.GenericArrayDa

Autoloader update table when new changes are made

Azure Oauth Passthrough with the Go Driver

Resolved! vscode python project for development

Spark read CSV does not throw Exception if the fil...

how to run a group of cells in databricks ?

vscode python project for development

Is it possible to get Azure Databricks cluster met...

Can we get SQL Serverless warehouses monitoring da...