cancel
Showing results for 
Search instead for 
Did you mean: 
Data Engineering
Join discussions on data engineering best practices, architectures, and optimization strategies within the Databricks Community. Exchange insights and solutions with fellow data engineers.
cancel
Showing results for 
Search instead for 
Did you mean: 
Data + AI Summit 2024 - Data Engineering & Streaming

Forum Posts

hayden_blair
by New Contributor III
  • 422 Views
  • 3 replies
  • 3 kudos

Delta Live Table automatic table removal and schema update

Hello, I made a delta live table workflow that created 3 streaming tables in unity catalog. I then removed the source code for the 3rd table from the workflow and reran. After about a week, the 3rd streaming table is no longer available in unity cata...

  • 422 Views
  • 3 replies
  • 3 kudos
Latest Reply
hayden_blair
New Contributor III
  • 3 kudos

This makes sense @raphaelblg! Just to confirm my understanding, is the following statement true:If I remove the source code for a unity catalog DLT streaming table from a DLT pipeline and wait 7 days, that table will be dropped from unity catalog, an...

  • 3 kudos
2 More Replies
dpc
by New Contributor III
  • 227 Views
  • 2 replies
  • 0 kudos

Returing and reusing the identity value

Hello I have a table that has a column defined as an identity (BIGINT GENERATED ALWAYS AS IDENTITY)I will be inserting rows into this table in parallelHow can I get the identity and use that within a pipelineParallel is relevant as there will be mult...

  • 227 Views
  • 2 replies
  • 0 kudos
Latest Reply
szymon_dybczak
Contributor
  • 0 kudos

Hi @dpc ,What you're trying to achieve does not make sense in the context of identity columns. Look at below entry from documentation. So, the answer is - if you want to have concurrent transaction, don't use identity columns Declaring an identity co...

  • 0 kudos
1 More Replies
mbaas
by New Contributor III
  • 558 Views
  • 5 replies
  • 5 kudos

Temporary streaming tables (CDC)

I am currently using the `apply_changes` feature. I saw for the regular decorator `dlt.table` you can create temporary tables. I do not see the option you could use this feature with `dlt.create_streaming_table(`, in the sql version it looks it is su...

  • 558 Views
  • 5 replies
  • 5 kudos
Latest Reply
Kaniz_Fatma
Community Manager
  • 5 kudos

Hi @mbaas, Thank you for reaching out to our community! We're here to help you. To ensure we provide you with the best support, could you please take a moment to review the response and choose the one that best answers your question? Your feedback no...

  • 5 kudos
4 More Replies
joaogilsa
by New Contributor II
  • 563 Views
  • 3 replies
  • 1 kudos

Resolved! Delete folder using Databricks CLI

Hello,I am trying to delete a folder and its content using databricks cli, but I'm getting the following error:databricks workspace delete /Workspace/Users/XXX/XXX --profile DEFAULT --recursive trueError: expected to have the absolute path of the not...

  • 563 Views
  • 3 replies
  • 1 kudos
Latest Reply
joaogilsa
New Contributor II
  • 1 kudos

Thank you for the help, @szymon_dybczak, it worked!

  • 1 kudos
2 More Replies
FerArribas
by Contributor
  • 8571 Views
  • 6 replies
  • 6 kudos

Resolved! Redirect error in access to web app in Azure Databricks with private front endpoint

I have created a workspace with private endpoint in Azure following this guide:https://learn.microsoft.com/en-us/azure/databricks/administration-guide/cloud-configurations/azure/private-linkOnce I have created the private link of type browser_authent...

  • 8571 Views
  • 6 replies
  • 6 kudos
Latest Reply
flomader
New Contributor II
  • 6 kudos

You don't need a CNAME record.Go to your private link resource in Azure and click on Settings > DNS Configuration. Make sure you have created private link A records for all the FQDNs listed under 'Custom DNS records'. You have most likely missed one ...

  • 6 kudos
5 More Replies
yvishal519
by Contributor
  • 508 Views
  • 2 replies
  • 3 kudos

Resolved! Databricks DLT with Hive Metastore and ADLS Access Issues

We are currently working on Databricks DLT tables to transform data from bronze to silver. we are specifically instructed us not to use mount paths for accessing data from ADLS Gen 2. To comply, I configured storage credentials and created an externa...

yvishal519_0-1721908544085.png
  • 508 Views
  • 2 replies
  • 3 kudos
Latest Reply
szymon_dybczak
Contributor
  • 3 kudos

Hi @yvishal519 ,Since you're using hive metastore you have no other option than mount points. Storage credentials and external locations are only supported in Unity Catalog

  • 3 kudos
1 More Replies
helghe
by New Contributor II
  • 363 Views
  • 3 replies
  • 3 kudos

Unavailable system schemas

When I list the available schemas I get the following:{"schemas":[{"schema":"storage","state":"AVAILABLE"},{"schema":"operational_data","state":"UNAVAILABLE"},{"schema":"access","state":"AVAILABLE"},{"schema":"billing","state":"ENABLE_COMPLETED"},{"s...

  • 363 Views
  • 3 replies
  • 3 kudos
Latest Reply
hle
New Contributor II
  • 3 kudos

I have the same issue for the compute schema. Workspace is UC enabled and I'm account admin. 

  • 3 kudos
2 More Replies
Amit_Dass_Chmp
by New Contributor III
  • 246 Views
  • 1 replies
  • 0 kudos

Auto-tuning capability available for external tables?

If I am using Databricks Runtime 11.3 and above to create managed Delta tables cataloged in Unity Catalog (Databricks’ data catalog), I don’t need to worry about optimizing the underlying file sizes or configuring a target file size for my Delta tabl...

  • 246 Views
  • 1 replies
  • 0 kudos
Latest Reply
szymon_dybczak
Contributor
  • 0 kudos

Hi @Amit_Dass_Chmp ,Yep, according to documentation. As of second question, such capability will be available in the future. If you are using Databricks Runtime 11.3 and above to create managed Delta tables cataloged in Unity Catalog (Databricks’ dat...

  • 0 kudos
dpc
by New Contributor III
  • 340 Views
  • 3 replies
  • 4 kudos

Resolved! Approach to monthly data snapshots

HelloI'm building a datawarehouse with all the usual facts and dimensionsIt will flush (truncate) and rebuild on a monthly basisUsers have the need to not only view the data now but also view it historically i.e. what it was a point in timeMy initial...

  • 340 Views
  • 3 replies
  • 4 kudos
Latest Reply
dpc
New Contributor III
  • 4 kudos

Great, thanks

  • 4 kudos
2 More Replies
angel531
by New Contributor II
  • 511 Views
  • 4 replies
  • 3 kudos

Resolved! getting error while accessing dbfs from databricks community account and couldnt upload any files

Hi, I have enabled dbfs in my databricks community account and started the cluster. while accessing dbfs its throwing an error.

doubt.png
  • 511 Views
  • 4 replies
  • 3 kudos
Latest Reply
satyakiguha
New Contributor III
  • 3 kudos

Hi @Kaniz_Fatma I am no longer facing this issue, Thanks to the team for fixing it !  

  • 3 kudos
3 More Replies
fdeba
by New Contributor
  • 269 Views
  • 2 replies
  • 0 kudos

DatabricksSession and SparkConf

Hi,I want to initialize a Spark session using `DatabricksSession`. However, it seems not possible to call `.config()` and pass it a `SparkConf` instance. The following works:# Initialize the configuration for the Spark session confSettings = [ ("...

  • 269 Views
  • 2 replies
  • 0 kudos
Latest Reply
Witold
Contributor III
  • 0 kudos

In almost all cases you don't need to create a new spark session, as Databricks will do it for you automatically.If it's only about spark configurations, there are multiple ways to set it:Cluster settingsspark.conf.set

  • 0 kudos
1 More Replies
mkd
by New Contributor II
  • 4738 Views
  • 4 replies
  • 5 kudos

Resolved! CSV import error

Upload ErrorError occurred when processing file tips1.csv: [object Object].  I've been trying to import a csv file from my local machine to the databricks. The above mentioned error couldn't be resolved. Anyone pls help me in this regard.

  • 4738 Views
  • 4 replies
  • 5 kudos
Latest Reply
clentin
Contributor
  • 5 kudos

@Kaniz_Fatma - this is now fixed. Thank you so much for your prompt action. Appreciate it. 

  • 5 kudos
3 More Replies
aalanis
by New Contributor II
  • 406 Views
  • 4 replies
  • 2 kudos

Issues reading json files with databricks vs oss pyspark

Hi Everyone, I'm currently developing an application in which I read json files with nested structure. I developed my code locally on my laptop using the opensource version of pyspark (3.5.1) using a similar code to this:sample_schema:schema = Struct...

  • 406 Views
  • 4 replies
  • 2 kudos
Latest Reply
Kaniz_Fatma
Community Manager
  • 2 kudos

Hi @aalanis, Hi, Thank you for reaching out to our community! We're here to help you. To ensure we provide you with the best support, could you please take a moment to review the response and choose the one that best answers your question? Your feedb...

  • 2 kudos
3 More Replies
stefano0929
by New Contributor II
  • 219 Views
  • 1 replies
  • 0 kudos

Error 301 Moved Permanently in cells of plotting

Hi, I created a workbook for academic purposes and had completed it... from one moment to the next all the plot cells of charts (and only those) started returning the following error and I really don't know how to solve it by today.Failed to store th...

  • 219 Views
  • 1 replies
  • 0 kudos
Latest Reply
Kaniz_Fatma
Community Manager
  • 0 kudos

Hi @stefano0929, This has been fixed now. Could you please confirm?

  • 0 kudos
Bhabs
by New Contributor
  • 189 Views
  • 2 replies
  • 0 kudos

Replace one tag in a Jason file in the data bricks table .

 There is a column (src_json) in emp_table . I need to replace (ages to age )in each json in the src_json column in emp_table.Can you pls suggest the best way to do it .

  • 189 Views
  • 2 replies
  • 0 kudos
Latest Reply
Kaniz_Fatma
Community Manager
  • 0 kudos

Hi @Bhabs, Thank you for reaching out to our community! We're here to help you. To ensure we provide you with the best support, could you please take a moment to review the response and choose the one that best answers your question? Your feedback no...

  • 0 kudos
1 More Replies

Connect with Databricks Users in Your Area

Join a Regional User Group to connect with local Databricks users. Events will be happening in your city, and you won’t want to miss the chance to attend and share knowledge.

If there isn’t a group near you, start one and help create a community that brings people together.

Request a New Group
Labels