cancel
Showing results for 
Search instead for 
Did you mean: 
Data Engineering
Join discussions on data engineering best practices, architectures, and optimization strategies within the Databricks Community. Exchange insights and solutions with fellow data engineers.
cancel
Showing results for 
Search instead for 
Did you mean: 

Forum Posts

Mihai_Cog
by Databricks Partner
  • 10730 Views
  • 5 replies
  • 8 kudos

Resolved! Change Data Feed Databricks

Hello,I am doing some testing with this feature Change Data Feed using Databricks and Pyspark, of course the Delta format and I don't understand something:I created a tableSaved some data insideEnabled Change Data Feed featureApply a merge with a dat...

  • 10730 Views
  • 5 replies
  • 8 kudos
Latest Reply
Tharun-Kumar
Databricks Employee
  • 8 kudos

@Mihai_Cog You have to split your merge statement into 2 parts. (Update and Insert/Delete).MERGE INTO test t USING src s ON s.Id = t.Id and s.date_field = t.date_field and s.fields <> t.fields WHEN MATCHED THEN UPDATE SET * MERGE INTO test t USING sr...

  • 8 kudos
4 More Replies
Smitha1
by Databricks Partner
  • 10201 Views
  • 10 replies
  • 9 kudos

Resolved! Request for reattempt voucher. Databricks Certified Associate Developer for Apache Spark 3.0 exam

Hi,I gave Databricks Certified Associate Developer for Apache Spark 3.0 exam today but missed by one percent. I got 68.33% and pass is 70%.I am planning to reattempt the exam, could you kindly give me another opportunity and provide reattempt voucher...

  • 10201 Views
  • 10 replies
  • 9 kudos
Latest Reply
shriya
Databricks Partner
  • 9 kudos

Hi,I gave Databricks Certified Associate Developer for Apache Spark 3.0 Python exam yesterday but missed by three percent. I got 66.66% and pass is 70%.I am planning to reattempt the exam, could you kindly give me another opportunity and provide reat...

  • 9 kudos
9 More Replies
-werners-
by Esteemed Contributor III
  • 9387 Views
  • 6 replies
  • 12 kudos

Resolved! SSRS (on-prem) on Databricks SQL

Has anybody succeeded in querying Databricks SQL with an on-prem SSRS (so an on-prem Report Server and Report Builder)?I manage to create a connection that works (according to the connection test anyway), but the moment I try to create a dataset on t...

  • 9387 Views
  • 6 replies
  • 12 kudos
Latest Reply
Haider93
New Contributor III
  • 12 kudos

Hi @-werners- , I am able to build connection between Microsoft visual studio and data bricks using Simba Spark ODBC driver. I can query delta tables sitting in Databricks from Microsoft Visual studio (SSRS). However, when I am deploying the report t...

  • 12 kudos
5 More Replies
qasimhassan
by Contributor
  • 1062 Views
  • 0 replies
  • 0 kudos

Why no Practice Test for Databricks Professional Exam ?

Hi Databricks Team/Community,I'm planning for Databricks Data Engineering Professional Certification but what I have seen is that Databricks don't give any practice test for this professional/advanced certification. I don't know why, If anyone is rea...

  • 1062 Views
  • 0 replies
  • 0 kudos
andresfp14
by New Contributor II
  • 30297 Views
  • 2 replies
  • 0 kudos

Can you delete a widget, or force a value to it?

Hello everyone! Basically, I have a dropdown widget, and at some point I want to change the options available, but still maintain the same selected option. The problems that I find are: - If I want to delete the widget and create a new one, it seems...

0693f000007OoHmAAK
  • 30297 Views
  • 2 replies
  • 0 kudos
Latest Reply
Gim
Contributor
  • 0 kudos

I know this is 5+ years late but I would like to add an answer (or work around) to this in case anyone else was looking for the same solution.What I did was to essentially 'reset' the widget whenever I had to assign (or force) a value to it.#Cell #1 ...

  • 0 kudos
1 More Replies
Swostiman
by New Contributor II
  • 8300 Views
  • 5 replies
  • 4 kudos

Consuming data from databricks[Hive metastore] sql endpoint using pyspark

I was trying to read some delta data from databricks[Hive metastore] sql endpoint using pyspark, but while doing so I encountered that all the values of the table after fetching are same as the column name.Even when I try to just show the data it giv...

  • 8300 Views
  • 5 replies
  • 4 kudos
Latest Reply
sucan
New Contributor II
  • 4 kudos

Encountered the same issue and downgrading to 2.6.22 helped me resolve this issue.

  • 4 kudos
4 More Replies
pg1
by New Contributor II
  • 2334 Views
  • 2 replies
  • 1 kudos

How to expand display of workspace name in top right window?

How to expand display of workspace name in top right window? My workspace name is little bit longer and it cuts out, could not find a way to expand name window.

  • 2334 Views
  • 2 replies
  • 1 kudos
Latest Reply
Tharun-Kumar
Databricks Employee
  • 1 kudos

@pg1 Is your workspace Unity Catalog enabled?We have identified this behaviour in Unity Catalog enabled workspaces and already working to show up the entire workspace name when we hover over it.

  • 1 kudos
1 More Replies
cbabay
by New Contributor II
  • 3455 Views
  • 2 replies
  • 2 kudos

Resolved! Adding Duration Warning During Workflow Deployment

Hi Team,Thank you very much for making it so simple to add notifications on failure and duration warnings to workflows.Each time we deploy to dev, through Jenkins, our email is unsubscribed and we stop getting alerts. Is there any documentation on ho...

  • 3455 Views
  • 2 replies
  • 2 kudos
Latest Reply
alexott
Databricks Employee
  • 2 kudos

It heavily depends on how do you perform deployment - what APIs are used, etc. Most probably you're using Job Reset API instead of Job Update API

  • 2 kudos
1 More Replies
Yasser
by New Contributor
  • 1408 Views
  • 0 replies
  • 0 kudos

[sql warehouse] Invalid configuration value detected for fs.azure.account.key with 'force' = 'true'

Hello,I am getting the following error when trying to copy data to databricks from an ADLS with SQL and using a SAS tokenFailure to initialize configuration for storage account <storage account>: Invalid configuration value detected for fs.azure.acco...

  • 1408 Views
  • 0 replies
  • 0 kudos
Chris_Shehu
by Valued Contributor III
  • 33281 Views
  • 5 replies
  • 5 kudos

Resolved! What is the best way to handle big data sets?

I'm trying to find the best strategy for handling big data sets. In this case I have something that is 450 million records. I'm pulling the data from SQL Server very quickly but when I try to push the data to the Delta Table OR a Azure Container the...

  • 33281 Views
  • 5 replies
  • 5 kudos
Latest Reply
Wilynan
New Contributor II
  • 5 kudos

I think you should consult experts in Big Data for advice on this issue

  • 5 kudos
4 More Replies
parthsalvi
by Contributor
  • 4370 Views
  • 1 replies
  • 2 kudos

Amazon SES : boto3 credentials not found. DBR 11.2 Shared mode

We're trying to send email using Amazon SES using boto3.client in python. We've added SES Full access in clusters IAM Role.   We were able to send email in "No isolation shared" mode in DBR 11.2 using ses = boto3.client('ses', region_name='us-****-2'...

image
  • 4370 Views
  • 1 replies
  • 2 kudos
Latest Reply
JameDavi_51481
Contributor
  • 2 kudos

This appears to be an intentional design choice to prevent users from using the credentials of the host machine to carry out arbitrary AWS API calls. I really wish there was a workaround or setting to disable this behavior because we put a lot of wor...

  • 2 kudos
Henrik
by New Contributor III
  • 2242 Views
  • 1 replies
  • 1 kudos

Resolved! Run notebooks on serverless SQL cluster

Is it just me or i'm I right that we  can't run notebooks on a serverless SQL cluster?It would be a nice feature for SQL based notebooks.

  • 2242 Views
  • 1 replies
  • 1 kudos
Latest Reply
Henrik
New Contributor III
  • 1 kudos

I figured out.I needed to start the cluster first.

  • 1 kudos
Sinthiya
by Databricks Partner
  • 3529 Views
  • 1 replies
  • 1 kudos

Multiple streaming sources to the single delta live table

In our case, we have multiple sources writing to the same target table.  A target table can be populated from multiple source tables, each contributing a set of fields. How to add/update columns in a target table from multiple sources.In a delta live...

  • 3529 Views
  • 1 replies
  • 1 kudos
Latest Reply
SaiKiranGajjala
New Contributor II
  • 1 kudos

Following.

  • 1 kudos
Mr_K
by New Contributor
  • 11611 Views
  • 2 replies
  • 2 kudos

AnalysisException: [UC_COMMAND_NOT_SUPPORTED] Spark higher-order functions are not supported in Unity Catalog.;

Hello,forecast_date = '2017-12-01' spark.conf.set('spark.sql.shuffle.partitions', 500 ) # generate forecast for this data forecasts = ( history .where(history.date < forecast_date) # limit training data to prior to our forecast date .groupBy...

  • 11611 Views
  • 2 replies
  • 2 kudos
Latest Reply
Tharun-Kumar
Databricks Employee
  • 2 kudos

@Mr_K ApplyInPandas is a higher order function in Python. As of now, we do not support higher order functions in Unity Catalog. We do support direct calls made to python UDFs. Here is an example of how to reference UDFs in UC - https://docs.databrick...

  • 2 kudos
1 More Replies
schnee1
by New Contributor III
  • 12848 Views
  • 8 replies
  • 0 kudos

Access struct elements inside dataframe?

I have JSON data set that contains a price in a string like "USD 5.00". I'd like to convert the numeric portion to a Double to use in an MLLIB LabeledPoint, and have managed to split the price string into an array of string. The below creates a data...

  • 12848 Views
  • 8 replies
  • 0 kudos
Latest Reply
goldentriangle
New Contributor II
  • 0 kudos

Thanks, Golden Triangle Tour

  • 0 kudos
7 More Replies
Labels