cancel
Showing results for 
Search instead for 
Did you mean: 
Data Engineering
Join discussions on data engineering best practices, architectures, and optimization strategies within the Databricks Community. Exchange insights and solutions with fellow data engineers.
cancel
Showing results for 
Search instead for 
Did you mean: 
Data + AI Summit 2024 - Data Engineering & Streaming

Forum Posts

weldermartins
by Honored Contributor
  • 2833 Views
  • 3 replies
  • 13 kudos

Resolved! SCD type 2

Hey guys. I don't know if I'm tired, I ask for your help, but I don't understand where is the difference in the number of fields.Thanks! I'm replicating SCD type 2 based on this documentation:https://docs.delta.io/latest/delta-update.html#slowly-chan...

SCD 2
  • 2833 Views
  • 3 replies
  • 13 kudos
Latest Reply
weldermartins
Honored Contributor
  • 13 kudos

@Werner Stinckens​ ?

  • 13 kudos
2 More Replies
SparrowDev
by New Contributor II
  • 8199 Views
  • 5 replies
  • 3 kudos

Resolved! New branch has a changes

Hi there, I ran into issue with Databricks repo. When I create new repo, it doesn't pull default branch from GitLab. I have default branch 'development', but Databricks repo pulls other branch.Moreover this just now added repo already contains change...

  • 8199 Views
  • 5 replies
  • 3 kudos
Latest Reply
-werners-
Esteemed Contributor III
  • 3 kudos

I had a similar issue, where a few notebooks popped up as changed in every branch I created (even though they were not touched). So first I manually did the discard changes thing, but that was not a solution as with every new branch they were there ...

  • 3 kudos
4 More Replies
Chris_Konsur
by New Contributor III
  • 2417 Views
  • 2 replies
  • 3 kudos

Resolved! to configure Autoloader in File notification mode to access the Premium BlobStorage

First, I tried to configure Autoloader in File notification mode to access the Premium BlobStorage 'databrickspoc1' (PREMIUM , ADLS Gen2). I get this Error: I get this errorcom.microsoft.azure.storage.StorageException: I checked my storage account->N...

  • 2417 Views
  • 2 replies
  • 3 kudos
Latest Reply
Hubert-Dudek
Esteemed Contributor III
  • 3 kudos

When you created a premium account, have you chosen "Premium account type" as "File shares"? It should be "Block blobs".

  • 3 kudos
1 More Replies
Priya_Mani
by New Contributor II
  • 1971 Views
  • 3 replies
  • 4 kudos

Databricks Notebook dataframe loading duplicate data in SQL table

Hi, I am trying to load data from datalake into SQL table using "SourceDataFrame.write" operation in a Notebook using apache spark.This seems to be loading duplicates at random times. The logs don't give much information and I am not sure what else t...

  • 1971 Views
  • 3 replies
  • 4 kudos
Latest Reply
-werners-
Esteemed Contributor III
  • 4 kudos

can you elaborate a bit more on this notebook?And also what databricks runtime version?

  • 4 kudos
2 More Replies
User16844588229
by New Contributor III
  • 7252 Views
  • 9 replies
  • 4 kudos

docs.databricks.com

Navigate and discover content more efficiently with Search in DatabricksHi all- Justin Kim here, I'm the Databricks product manager responsible for content organization and navigation in our product, which includes Search. Great to see you on the Com...

Search bar Search modal
  • 7252 Views
  • 9 replies
  • 4 kudos
Latest Reply
karthik_p
Esteemed Contributor
  • 4 kudos

@Justin Kim​ Thank you for quick reply, usually Last Modified is Recent changes right (that can be last 24hrs or cap limit that we add), whereas anytime they should show all Notebooks or Tables from start. that is where i got confused

  • 4 kudos
8 More Replies
Sandy21
by New Contributor III
  • 1137 Views
  • 1 replies
  • 3 kudos

Queries with running REST API command in databricks to create a Job

What happens when jobs/create REST API command is run multiple times(say 3 times) with the same JSON configuration? Will 3 jobs are created with the same name or only 1 job will be created?

  • 1137 Views
  • 1 replies
  • 3 kudos
Latest Reply
Debayan
Databricks Employee
  • 3 kudos

Hi @Santhosh Raj​ , logically only one job should be created.

  • 3 kudos
Dicer
by Valued Contributor
  • 6601 Views
  • 2 replies
  • 1 kudos

Resolved! PARSE_SYNTAX_ERROR: Syntax error at or near 'VACUUM'

I tried to VACUUM a delta table, but there is a Syntax error.Here is the code:%sql set spark.databricks.delta.retentionDurationCheck.enabled = False   VACUUM test_deltatable

  • 6601 Views
  • 2 replies
  • 1 kudos
Latest Reply
Ravi
Databricks Employee
  • 1 kudos

@Cheuk Hin Christophe Poon​ Missing semi-colon at end of line 2?%sql set spark.databricks.delta.retentionDurationCheck.enabled = False; VACUUM test_deltatable

  • 1 kudos
1 More Replies
a2_ish
by New Contributor II
  • 1445 Views
  • 2 replies
  • 2 kudos

How to write the delta files for managed table? how can I define the sink

I have tried below code to write data in a delta table and save the delta files in a sink. I tried using azure storage as sink but I get error as not enough access, I can confirm that I have enough access to azure storage, however I can run the below...

  • 1445 Views
  • 2 replies
  • 2 kudos
Latest Reply
Anonymous
Not applicable
  • 2 kudos

Hi @Ankit Kumar​ Does @Hubert Dudek​  response answer your question? If yes, would you be happy to mark it as best so that other members can find the solution more quickly?We'd love to hear from you.Thanks!

  • 2 kudos
1 More Replies
pret
by New Contributor II
  • 3186 Views
  • 4 replies
  • 0 kudos

How can I run a scala command line in databricks?

I wish to run a scala command, which I believe would normally be run from a scala command line rather than from within a notebook. It happens to be:scala [-cp scalatest-<version>.jar:...] org.scalatest.tools.Runner [arguments](scalatest_2.12__3.0.8.j...

  • 3186 Views
  • 4 replies
  • 0 kudos
Latest Reply
Anonymous
Not applicable
  • 0 kudos

Hi @David Vardy​ Hope all is well! Just wanted to check in if you were able to resolve your issue and would you be happy to share the solution or mark an answer as best? Else please let us know if you need more help. We'd love to hear from you.Thanks...

  • 0 kudos
3 More Replies
hafeez
by New Contributor III
  • 3667 Views
  • 4 replies
  • 7 kudos

Azure Databricks SCIM API GA Availability plans?

When is the Azure SCIM API 2.0 is going General Availability. Currently I see it is on public preview?And also any security concerns for the APIs In Preview if it is being used in our current production?Reference: https://learn.microsoft.com/en-us/az...

  • 3667 Views
  • 4 replies
  • 7 kudos
Latest Reply
Anonymous
Not applicable
  • 7 kudos

Hi @Sultan Mohideen Hafeez Aboobacker Mohammed Shahul​ Hope all is well! Just wanted to check in if you were able to resolve your issue and would you be happy to share the solution or mark an answer as best? Else please let us know if you need more h...

  • 7 kudos
3 More Replies
User16826994223
by Honored Contributor III
  • 1248 Views
  • 2 replies
  • 0 kudos

Lakehouse Concept

I want to understand lake house concept in very brief If I have to pitch for a customer in 1 minute

  • 1248 Views
  • 2 replies
  • 0 kudos
Latest Reply
User16826994223
Honored Contributor III
  • 0 kudos

Lakehouse is a concept defined with the following Parameter-Data is stored in an open standard format.Data is stored in a way which support Data Science,ML and BI loads.Delta is just a way or engine on cloud storage that provides control on data and...

  • 0 kudos
1 More Replies
CatherineDalzel
by New Contributor II
  • 1963 Views
  • 2 replies
  • 0 kudos

Sign in to Databricks Partner Academy

A few months ago, I signed up for the Databricks Academy, customer version. Since then, my company has become a Databricks Partner (Go Databricks + Stardog!). My colleagues have successfully registered for Databricks Partner Academy, but I can't get ...

  • 1963 Views
  • 2 replies
  • 0 kudos
Latest Reply
" src="" />
This widget could not be displayed.
This widget could not be displayed.
This widget could not be displayed.
  • 0 kudos

This widget could not be displayed.
A few months ago, I signed up for the Databricks Academy, customer version. Since then, my company has become a Databricks Partner (Go Databricks + Stardog!). My colleagues have successfully registered for Databricks Partner Academy, but I can't get ...

This widget could not be displayed.
  • 0 kudos
This widget could not be displayed.
1 More Replies
User16790091296
by Contributor II
  • 2667 Views
  • 2 replies
  • 2 kudos

How to configure Databricks token inside Docker File?

I have a docker file where I want toDownload the Databricks CLIConfigure the CLI by adding a host and tokenAnd then running a python file that hits the Databricks tokenI am able to install the CLI in the docker image, and I have a working python file...

  • 2667 Views
  • 2 replies
  • 2 kudos
Latest Reply
sachingawade
New Contributor II
  • 2 kudos

Hi I was facing same issue and searching for the solution but didnt get it, and now after working on it i have the solution if you want to access databricks models/download_artifacts using hostname and access token like how you do on databricks cli ...

  • 2 kudos
1 More Replies
Karthikeyan1
by New Contributor II
  • 1118 Views
  • 1 replies
  • 2 kudos

How to get the Databricks Default Pricing through API or any endpoints (*if available)???

Attaching Screenshots FYI from the official site, I've checked the Inspect, but no API calls have been made specifically for this cost default scrapping, Is there any Endpoints available to scrape this?

image.png
  • 1118 Views
  • 1 replies
  • 2 kudos
Latest Reply
Karthikeyan1
New Contributor II
  • 2 kudos

@All Users Group​ Any views on this??

  • 2 kudos

Connect with Databricks Users in Your Area

Join a Regional User Group to connect with local Databricks users. Events will be happening in your city, and you won’t want to miss the chance to attend and share knowledge.

If there isn’t a group near you, start one and help create a community that brings people together.

Request a New Group
Labels