cancel
Showing results for 
Search instead for 
Did you mean: 
Data Engineering
Join discussions on data engineering best practices, architectures, and optimization strategies within the Databricks Community. Exchange insights and solutions with fellow data engineers.
cancel
Showing results for 
Search instead for 
Did you mean: 
Data + AI Summit 2024 - Data Engineering & Streaming

Forum Posts

bojian_tw
by Visitor
  • 19 Views
  • 0 replies
  • 0 kudos

Delta Live Table pipeline hanging at INITIALIZING forever

I have a dlt pipeline haning at INIALIZING forever, it never stops. But I found the Analysis Exeption already happened at beginningpyspark.errors.exceptions.captured.AnalysisException: [UNRESOLVED_COLUMN.WITH_SUGGESTION] A column, variable, or functi...

Screenshot 2024-07-27 at 07.50.31.png
Data Engineering
Delta Live Table
dlt
  • 19 Views
  • 0 replies
  • 0 kudos
NickLee
by Visitor
  • 26 Views
  • 0 replies
  • 0 kudos

How to update num_workers dynamically in a job cluster

I am setting up a workflows with the UI. In the first task, a dynamic value for the next task's num_workers is calculated based on actual data size. In the subsequent task, I'd like to use this calculated num_workers to update the job cluster's defau...

NickLee_0-1722018584496.png
  • 26 Views
  • 0 replies
  • 0 kudos
Miguel_Salas
by Visitor
  • 24 Views
  • 0 replies
  • 1 kudos

Last file in S3 folder using autoloader

Nowadays we already use the autoloader with checkpoint location, but I still wanted to know if it is possible to read only the last updated file within a folder. I know it somewhat loses the purpose of checkpoint locatioAnother question is it possibl...

  • 24 Views
  • 0 replies
  • 1 kudos
YS1
by New Contributor III
  • 104 Views
  • 2 replies
  • 0 kudos

Delta Live Tables and Pivoting

Hello,I'm trying to create a DLT pipeline where I read data as a streaming dataset from a Kafka source, save it in a table, and then filter, transform, and pivot the data. However, I've encountered an issue: DLT doesn't support pivoting, and using fo...

Data Engineering
dlt
streaming
  • 104 Views
  • 2 replies
  • 0 kudos
Latest Reply
Slash
New Contributor II
  • 0 kudos

Hi @YS1 ,As a workaround you can rewrite pivot to sql with case statements.Below Pivot:data = [ ("ProductA", "North", 100), ("ProductA", "South", 150), ("ProductA", "East", 200), ("ProductA", "West", 250), ("ProductB", "North", 30...

  • 0 kudos
1 More Replies
BenDataBricks
by New Contributor II
  • 936 Views
  • 8 replies
  • 4 kudos

OAuth U2M Manual token generation failing

I am writing a frontend webpage that will log into DataBricks and allow the user to select datasets.I am new to front end development, so there may be some things I am missing here, but I know that the DataBricks SQL connector for javascript only wor...

  • 936 Views
  • 8 replies
  • 4 kudos
Latest Reply
MaheshMandlik
  • 4 kudos

@Slash Thank you for your help. Your solution has worked very well for me.

  • 4 kudos
7 More Replies
j_al
by New Contributor II
  • 3088 Views
  • 10 replies
  • 5 kudos

Jobs API 2.1 OpenAPI specification seems broken.

Jobs API 2.1 OpenAPI specification seems broken.The swagger file seems to be invalid.https://docs.databricks.com/_extras/api-refs/jobs-2.1-aws.yaml

  • 3088 Views
  • 10 replies
  • 5 kudos
Latest Reply
JeffShutt_
New Contributor II
  • 5 kudos

@Debayan Mukherjee​ , are you suggesting to revert the openapi version specified in https://docs.databricks.com/_extras/api-refs/jobs-2.1-aws.yaml from 3.1.0 to 3.0.3?

  • 5 kudos
9 More Replies
RishabhGarg
by Visitor
  • 84 Views
  • 2 replies
  • 0 kudos

Keywords and Functions supported in SQL but not in Databricks SQL.

Actually, I have around 2000 SQL queries. I have to convert them in Databricks supported SQLs, so that I can run them in databricks environment. So I want to know the list of all keywords, functions or anything that is different in databricks SQL. Pl...

  • 84 Views
  • 2 replies
  • 0 kudos
Latest Reply
Slash
New Contributor II
  • 0 kudos

Hi @RishabhGarg ,You're saying SQL, but which dialect? Because every provider has its own extension to ANSI SQL standard. So for example, if you're using SQL Server for example, there is a TOP keyword to limit the rows.

  • 0 kudos
1 More Replies
RobsonNLPT
by Contributor
  • 66 Views
  • 6 replies
  • 2 kudos

Databricks Variant Data Type

HiI've tried to enabled a table to test the new variant data type (public preview)I used the alter cmd: ALTER TABLE tablexxxx SET TBLPROPERTIES('delta.feature.variantType-preview' = 'supported')and I have the error[DELTA_UNSUPPORTED_FEATURES_IN_CONFI...

  • 66 Views
  • 6 replies
  • 2 kudos
Latest Reply
RobsonNLPT
Contributor
  • 2 kudos

Yes but I tried using 2 cluster typesSQL Serverless and New Compute Serverless.Error in both

  • 2 kudos
5 More Replies
Patricckk
by New Contributor II
  • 123 Views
  • 3 replies
  • 1 kudos

Attributed-Based Access Control

Hi,Over here they are explaining attribute-based-access-controls, which I want to implement in my project but can't find the documentation or the option to create rules myself. Is this feature already available?https://www.databricks.com/dataaisummit...

  • 123 Views
  • 3 replies
  • 1 kudos
Latest Reply
mhiltner
New Contributor III
  • 1 kudos

Expected for Q3 in preview mode

  • 1 kudos
2 More Replies
KamilK
by Visitor
  • 37 Views
  • 0 replies
  • 1 kudos

Include SPARK-46990 in databricks 15.4 LTS

Hi, could you include fix SPARK-46990 ([SPARK-46990] Regression: Unable to load empty avro files emitted by event-hubs - ASF JIRA (apache.org)) in Databricks 15.4? (15.4 is in the beta stage, so it might be a right time to include fix)

  • 37 Views
  • 0 replies
  • 1 kudos
Kayla
by Contributor III
  • 78 Views
  • 2 replies
  • 0 kudos

Resolved! Compute Policy Does Not Install Libraries

Has anyone else run into the issue where applying libraries through a compute policy just completely does not work?I'm trying to insane some pretty basic Python libraries from PyPI (pytest and paramiko, for example) and it is failing on 13.3 and 14.3...

  • 78 Views
  • 2 replies
  • 0 kudos
Latest Reply
Kaniz_Fatma
Community Manager
  • 0 kudos

Hi @Kayla, In Databricks, compute policies control various aspects of cluster behavior. When you add libraries to a policy: Users can’t install or uninstall compute-scoped libraries on compute that uses this policy.Libraries configured through the UI...

  • 0 kudos
1 More Replies
Join 100K+ Data Experts: Register Now & Grow with Us!

Excited to expand your horizons with us? Click here to Register and begin your journey to success!

Already a member? Login and join your local regional user group! If there isn’t one near you, fill out this form and we’ll create one for you to join!

Labels
Top Kudoed Authors