cancel
Showing results for 
Search instead for 
Did you mean: 
Warehousing & Analytics
Engage in discussions on data warehousing, analytics, and BI solutions within the Databricks Community. Share insights, tips, and best practices for leveraging data for informed decision-making.
cancel
Showing results for 
Search instead for 
Did you mean: 

Forum Posts

MadelynM
by Databricks Employee
  • 3051 Views
  • 0 replies
  • 0 kudos

[Recap] Data + AI Summit 2024 - Warehousing & Analytics | Improve performance and increase insights

Here's your Data + AI Summit 2024 - Warehousing & Analytics recap as you use intelligent data warehousing to improve performance and increase your organization’s productivity with analytics, dashboards and insights.  Keynote: Data Warehouse presente...

Screenshot 2024-07-03 at 10.15.26 AM.png
Warehousing & Analytics
AI BI Dashboards
AI BI Genie
Databricks SQL
  • 3051 Views
  • 0 replies
  • 0 kudos
tarunnagar
by New Contributor II
  • 51 Views
  • 2 replies
  • 1 kudos

Tips for Streamlining Spark Job Development and Debugging in Databricks

Hi everyone,I’m looking to improve the efficiency of developing and debugging Spark jobs within Databricks and wanted to get insights from the community. Spark is incredibly powerful, but as projects grow in complexity, it can become challenging to m...

  • 51 Views
  • 2 replies
  • 1 kudos
Latest Reply
KamalDeepPareek
  • 1 kudos

Use modular, parameterized code with reusable functions and notebooks for faster development. Separate environments for dev, test, and prod ensure stability. Leverage Databricks’ Job clusters, Delta Live Tables, and Autoloader for efficiency. Enable ...

  • 1 kudos
1 More Replies
mausch
by New Contributor
  • 3140 Views
  • 1 replies
  • 0 kudos

CalledProcessError when running dbt

I've been trying to run a dbt project (sourced in Azure DevOps) in Databricks Workflows, but I get this error message:  CalledProcessError: Command 'b'\nmkdir -p "/tmp/tmp-dbt-run-1124228490001263"\nunexpected_errors="$(cp -a -u "/Workspace/Repos/.in...

  • 3140 Views
  • 1 replies
  • 0 kudos
Latest Reply
mark_ott
Databricks Employee
  • 0 kudos

The error you encountered when running your dbt project in Databricks Workflows comes from Databricks trying to copy the entire repository, including the virtual environment (venv) folder and its cached bytecode files (__pycache__), into a temporary ...

  • 0 kudos
rajanator
by New Contributor
  • 95 Views
  • 1 replies
  • 1 kudos

Intermittent 400 Error with Power BI Desktop - ODBC Connection to SQL Warehouse

Hi all,I'm experiencing an intermittent connection issue between Power BI Desktop and our Azure Databricks SQL Warehouse and looking for help troubleshootingError Message:ODBC: ERROR [HY000] [Microsoft][ThriftExtension] (14) Unexpected response from ...

  • 95 Views
  • 1 replies
  • 1 kudos
Latest Reply
mark_ott
Databricks Employee
  • 1 kudos

The intermittent ODBC error you’re seeing in Power BI when connecting to Azure Databricks is a recognized issue related to SSL validation interruptions or proxy interference in the Simba ThriftExtension layer. The behavior—random occurrences, tempora...

  • 1 kudos
Bakkie
by New Contributor III
  • 3770 Views
  • 3 replies
  • 4 kudos

Resolved! Databricks Apps based on Streamlit could not find a valid JAVA_HOME installation

We are launching our first Databricks Apps based on Streamlit.The App works when simply running the notebook in our workspace, but fails after deployment due to "could not find a valid JAVA_HOME installation" when running in the system environment.We...

  • 3770 Views
  • 3 replies
  • 4 kudos
Latest Reply
NandiniN
Databricks Employee
  • 4 kudos

Databricks Apps (which use a lightweight, container-based runtime) do not automatically include JVM, best is to use the databricks package to not have dependency issues.

  • 4 kudos
2 More Replies
playnicekids
by New Contributor
  • 302 Views
  • 2 replies
  • 1 kudos

Resolved! Metric Views

HiI think I’ve found a reproducible bug / or am misunderstanding some syntax / capabilities of Metric Views when joining a calendar scaffold to an SCD2 table.The same SQL query works perfectly, but the Metric View always returns a constant 1 per mont...

  • 302 Views
  • 2 replies
  • 1 kudos
Latest Reply
Louis_Frolio
Databricks Employee
  • 1 kudos

Hey @playnicekids , I dig some digging and have come up with some helpful hints/tips to get you past your issue:   This behavior is due to how metric view joins are defined and executed.   Diagnosis   The join in your metric view is a many-to-many te...

  • 1 kudos
1 More Replies
Giktator
by New Contributor II
  • 1313 Views
  • 4 replies
  • 0 kudos

Error FAILED_READ_FILE.NO_HINT When Reading File from R2 Storage

 Hi there,I'm encountering the following error while attempting to read a file from R2 storage:[FAILED_READ_FILE.NO_HINT] Error while reading file r2:REDACTED_LOCAL_PART@user_id.r2.cloudflarestorage.com/data/20250128_160228_54805_wpkza_e38751cf-969e-...

  • 1313 Views
  • 4 replies
  • 0 kudos
Latest Reply
NandiniN
Databricks Employee
  • 0 kudos

But the error is from aws and seen when the payload size is incorrectly defined in the contentLength parameter. Caused by: com.amazonaws.SdkClientException: Data read has a different length than the expected: This sounds like a bug, Was this resolved...

  • 0 kudos
3 More Replies
CEH
by New Contributor II
  • 371 Views
  • 4 replies
  • 1 kudos

Union of tiny dataframes exhausts resource, memory error

As part of a function I create df1 and df2 and aim to stack them and output the results.  But the results do not display within the function, nor if I output the results and display after.results = df1.unionByName(df2, allowMissingColumns=False)displ...

  • 371 Views
  • 4 replies
  • 1 kudos
Latest Reply
Louis_Frolio
Databricks Employee
  • 1 kudos

Hey @CEH, What you’re running into looks like a Spark Connect gRPC message-size limit, not a computational failure with the union itself. Even with smallish row counts, the serialized payload (either the inlined query plan or Arrow batch results) can...

  • 1 kudos
3 More Replies
loic
by Contributor
  • 963 Views
  • 2 replies
  • 1 kudos

Resolved! Databricks workspace default catalog not working anymore with JDBC driver

Hello,We recently detected an issue in our product deployment with terraform.At some point, we have some java code that creates a schema in "hive_metastore" catalog.Since "hive_metastore" catalog is the default one, there should not be any need to sp...

  • 963 Views
  • 2 replies
  • 1 kudos
Latest Reply
loic
Contributor
  • 1 kudos

The exact error reported by Databricks is:[RequestId=f27975cd-7589-4463-8c03-6015893ee133 ErrorClass=INVALID_PARAMETER_VALUE] Invalid input: RPC CreateSchema Field managedcatalog.SchemaInfo.catalog_name: name "" is not a valid name 

  • 1 kudos
1 More Replies
meljung
by New Contributor II
  • 883 Views
  • 1 replies
  • 0 kudos

Resolved! Moving average calculation in Databricks AI/BI dashboard

So, I can't figure out how to do moving average as custom calculation in Databricks dashboard. I'm applying many different filters and the denominator of the metric has to change dynamically based on the chosen filters. So, in this case using `Custom...

Screenshot 2025-06-17 134351.png
  • 883 Views
  • 1 replies
  • 0 kudos
Latest Reply
mark_ott
Databricks Employee
  • 0 kudos

Currently, Databricks dashboards do not support applying a moving average “custom calculation” on top of another custom metric that itself is dynamic with respect to the filters. Workarounds Segmented SQL Datasets: Pre-compute the filtered sets (as ...

  • 0 kudos
der
by Contributor
  • 585 Views
  • 4 replies
  • 6 kudos

Resolved! Dashboard choropleth map with geometry

We are currently building our dashboards in Apache Superset. With the Git integration in Databricks AI/BI Dashboards, the development process has improved a lot. So we are thinking about switch to Databricks AI/BI Dashboard.One pain point in Databric...

  • 585 Views
  • 4 replies
  • 6 kudos
Latest Reply
NandiniN
Databricks Employee
  • 6 kudos

Closing the loop here: Update: The PMs are updated by me and are aware of the usecase and this request (it will help hasten priortization). Thanks! 

  • 6 kudos
3 More Replies
leo-machado
by New Contributor III
  • 4606 Views
  • 10 replies
  • 5 kudos

(Big) Problem with SQL Warehouse Auto stop

Long story short, I'm not sure if this is an already known problem, but the Auto Stop feature on SQL Warehouses after minutes of inactivity is not working properly.We started using SQL Warehouses more aggressively this December when we scaled up one ...

image (2).png Screenshot 2025-01-02 at 10.31.27.png
  • 4606 Views
  • 10 replies
  • 5 kudos
Latest Reply
HNguyen
New Contributor II
  • 5 kudos

This is a good catch. Auto termination is something you tend to set and trust it will do the right thing .Wondering if the Databricks team managed to fix this, seeing it has been half a year since the problem was raised.This is also important for ou...

  • 5 kudos
9 More Replies
mrp
by New Contributor
  • 987 Views
  • 3 replies
  • 2 kudos

Resolved! Lakebase Scale-to-Zero Behavior: Automatic or Application-Controlled?

Hi all,Lakebase is currently advertised as a database system that can scale down to zero:https://www.databricks.com/blog/what-is-a-lakebaseDoes anyone know if this scale-to-zero behavior is handled automatically by Databricks when the database is idl...

  • 987 Views
  • 3 replies
  • 2 kudos
Latest Reply
Louis_Frolio
Databricks Employee
  • 2 kudos

Hey @mrp ,Some features of Lakebase are still in public preview, so not all functionality is available yet. @ilorus  is correct that “scale to zero” is not currently part of the product. However, it is on the roadmap and should be available early nex...

  • 2 kudos
2 More Replies
yshah
by New Contributor II
  • 651 Views
  • 3 replies
  • 2 kudos

Resolved! Behaviour of ANALYZE command varying when using different clusters and table types.

Certain tables have this configuration enabled, whereas others do not have it.Delta.checkpointPolicy=v2This is affecting the behavior of the ANALYZE commandIf flag is enabled : Table stats are not visible after doing the DESCRIBE command using SINGLE...

  • 651 Views
  • 3 replies
  • 2 kudos
Latest Reply
Louis_Frolio
Databricks Employee
  • 2 kudos

Greetings @yshah , here are some helpful hints/tips/tricks to guide you. To access table column statistics when checkpoint V2 is enabled, you can follow these guidelines: Utilize Databricks Runtime 13.3 LTS or Higher: Ensure that you are using Datab...

  • 2 kudos
2 More Replies
BillSundwall
by New Contributor II
  • 2616 Views
  • 4 replies
  • 3 kudos

Resolved! Maps in AI/BI Dashboards?

Is there any official word as to when we can expect Chloropleth or Marker Map visuals in AI/BI Dashboards? I realize Legacy Dashboards are still supported, but it feels uncertain to build new ones with AI/BI Dashboards in GA.

  • 2616 Views
  • 4 replies
  • 3 kudos
Latest Reply
NandiniN
Databricks Employee
  • 3 kudos

Choropleth Maps - https://docs.databricks.com/aws/en/dashboards/visualizations/maps#choropleth-options

  • 3 kudos
3 More Replies
Shivaprasad
by New Contributor III
  • 323 Views
  • 1 replies
  • 1 kudos

Commentary functionality through Databricks

In on-Prem we currently create dashboards which basically provides year over year or quarter over quarter comparison. When the variance is more than certain threshold for a particular data intersection business has the option to add comments, also so...

  • 323 Views
  • 1 replies
  • 1 kudos
Latest Reply
nayan_wylde
Honored Contributor III
  • 1 kudos

There is no inbuilt features to built-in “commentary workflow” like your Java app. But you can custom build one. Here are the steps to build one.1.  Use Databricks SQL Dashboards or Lakeview Dashboards for your YoY/QoQ variance analysis.2. Create a D...

  • 1 kudos