cancel
Showing results for 
Search instead for 
Did you mean: 
Warehousing & Analytics
Engage in discussions on data warehousing, analytics, and BI solutions within the Databricks Community. Share insights, tips, and best practices for leveraging data for informed decision-making.
cancel
Showing results for 
Search instead for 
Did you mean: 

Forum Posts

MadelynM
by Databricks Employee
  • 3129 Views
  • 0 replies
  • 0 kudos

[Recap] Data + AI Summit 2024 - Warehousing & Analytics | Improve performance and increase insights

Here's your Data + AI Summit 2024 - Warehousing & Analytics recap as you use intelligent data warehousing to improve performance and increase your organization’s productivity with analytics, dashboards and insights.  Keynote: Data Warehouse presente...

Screenshot 2024-07-03 at 10.15.26 AM.png
Warehousing & Analytics
AI BI Dashboards
AI BI Genie
Databricks SQL
  • 3129 Views
  • 0 replies
  • 0 kudos
Torch3333
by New Contributor II
  • 1363 Views
  • 1 replies
  • 1 kudos

Resolved! Linearizability on Delta Lake table

Hi all, Does Delta Lake table guarantee linearizability for the following operations on a single record:- SELECT- UPDATE and DELETE with condition (WHERE clauses) If linearizability is not guaranteed, what consistency model does Delta Lake provide fo...

  • 1363 Views
  • 1 replies
  • 1 kudos
Latest Reply
Advika
Databricks Employee
  • 1 kudos

Hello @Torch3333! Delta Lake does not guarantee linearizability for single record operations. In Delta Lake, isolation levels ensure consistency guarantees in transactions. By default, SELECT operations follow snapshot isolation, ensuring that reads ...

  • 1 kudos
edejong1980
by New Contributor III
  • 3433 Views
  • 1 replies
  • 1 kudos

Resolved! Databricks SQL Warehouse does not scale down to 0

Hey Support Team,We are experiencing an issue with our Databricks warehouse where the auto-stop feature does not seem to be working as expected. Despite setting an idle timeout, the warehouse continues running after the configured auto-stop time has ...

  • 3433 Views
  • 1 replies
  • 1 kudos
Latest Reply
edejong1980
New Contributor III
  • 1 kudos

We were able to diagnose and resolve the problem. The problem was caused due to a cube.js JDBC connection repeatedly connecting to our Databricks SQL Warehouse. Databricks SQL Warehouse does not scale down when there are repeated new connections made...

  • 1 kudos
sunnypol
by New Contributor
  • 3707 Views
  • 1 replies
  • 0 kudos

SQL Warehouse: INVALID_PARAMETER_VALUE when starting

Hi Team,Suddenly our SQL warehouse stopped running, can you please help us how can we check why it stopped.  

  • 3707 Views
  • 1 replies
  • 0 kudos
Latest Reply
Ismael-K
Databricks Employee
  • 0 kudos

A similar error was reported, please check this thread to see if it helps resolve the issue.

  • 0 kudos
Akshay_Petkar
by Valued Contributor
  • 1470 Views
  • 2 replies
  • 2 kudos

Resolved! Row-Level Security Not Working in Published Databricks AI/BI Dashboard

I have applied row-level security (RLS) on the department column in the table so that users can only see data related to their own department. The security policy works perfectly when I query the table in Databricks SQL.Now, I have built a Databricks...

  • 1470 Views
  • 2 replies
  • 2 kudos
Latest Reply
koji_kawamura
Databricks Employee
  • 2 kudos

Hi @Akshay_Petkar , which credential mode did you use to publish the dashboard? In order to make access control work based on the viewer's credential, "Don't embed credentials" mode should be used. If you published the dashboard with "Embed credentia...

  • 2 kudos
1 More Replies
VCA50380
by Contributor II
  • 1725 Views
  • 3 replies
  • 0 kudos

Resolved! Equivalent of Oracle's CLOB in Databricks

Dear all,(migrating for an on-premise Oracle ...)The question is in the subject: "What is the equivalent of Oracle's CLOB in Databricks" ?I saw that the "string" type can go up to 50 thousands characters, which is quite good in most of our cases, but...

  • 1725 Views
  • 3 replies
  • 0 kudos
Latest Reply
VCA50380
Contributor II
  • 0 kudos

Hello;Thanks for the answer.For the concatenation itself, it is not an issue.My question is "is Databricks supporting something bigger than the 'string' data-type" ? Thanks

  • 0 kudos
2 More Replies
VCA50380
by Contributor II
  • 3821 Views
  • 10 replies
  • 3 kudos

Resolved! Efficacy of PySpark in Databricks

Hi all,- migrating from an on-premise Oracle -Currently on Oracle, I have a "library" of let's say 300 tables to load, sequentially, based on views (some tables being fed potentially by several views, therefore the number of underlying views is highe...

  • 3821 Views
  • 10 replies
  • 3 kudos
Latest Reply
MariuszK
Valued Contributor III
  • 3 kudos

The common scenario for data processing in Oracle is based on PL/SQL and cursors. In the case of PySpark we don't have such concept as cursors and iteration on data frames can lead to poor performance.I migrated Oracle to Databricks and I learned tha...

  • 3 kudos
9 More Replies
bcb44
by New Contributor
  • 501 Views
  • 1 replies
  • 0 kudos

How to write a custom datasource for shared (standard access mode)

Hello,I maintain a spark plugin with users who are moving from spark to databricks and want to use shared access mode. My library has a bunch of custom datasources and users are seeing errors when using them. Is there a way to write a custom datasour...

  • 501 Views
  • 1 replies
  • 0 kudos
Latest Reply
Alberto_Umana
Databricks Employee
  • 0 kudos

Hello @bcb44, Thanks for your question.  Data Source V2 Relations: These are not supported in clusters that are configured with Table ACL or Credential Passthrough. Can you validate if ACL or Passthrough are enabled on the cluster? Also this should w...

  • 0 kudos
Paddy_chu
by New Contributor III
  • 595 Views
  • 1 replies
  • 1 kudos

run all below on a notebook cell isn't working

Hi All,Does anyone noticed if you "run all" the first time on the notebook, later if you click "run all below" on a cell, that wouldn't work anymore, and require to click "run all" again.It doesn't happen to me about couple of weeks ago, I used to ru...

  • 595 Views
  • 1 replies
  • 1 kudos
Latest Reply
Advika_
Databricks Employee
  • 1 kudos

Hello @Paddy_chu! This doesn't happen with every notebook, but it's likely due to cell dependencies. When you Run All, the final cell might modify or clear the data. Later, if you use Run All Below from a middle cell, it may not work if the required ...

  • 1 kudos
yuvala
by New Contributor II
  • 636 Views
  • 2 replies
  • 0 kudos

Discrepancies between query and dashboard results

Hey all,I have encountered a strange issue while validating my dashboard visual against the source table.The visual shows the count(distinct VAL) per day and when I ran a query that does the same calculation I get a difference of 84 (on average) and ...

query active ads.png dashboard active ads.png
  • 636 Views
  • 2 replies
  • 0 kudos
Latest Reply
Alberto_Umana
Databricks Employee
  • 0 kudos

Hi @yuvala, Are you using same query for the dashboard? could you please share it?

  • 0 kudos
1 More Replies
VCA50380
by Contributor II
  • 3783 Views
  • 6 replies
  • 4 kudos

Resolved! Write-back functionality from PowerBi into Databricks

Hello,I'm not sure if it is the correct place to post this, sorry.Migrating from an on-premise Oracle to Databricks, we are wondering about the following functionality:. From the reporting tool in place (currenly, PowerBI), users are able to send bac...

  • 3783 Views
  • 6 replies
  • 4 kudos
Latest Reply
VCA50380
Contributor II
  • 4 kudos

Hello Mantu,Thanks for your answer.This is clear, and as our colleague on PowerBI is already dealing with Power Automate, he should be able to test this.If we will be allowed to use Databricks REST API (our infra guys will tell us), I guess we will b...

  • 4 kudos
5 More Replies
larsbbb
by New Contributor III
  • 805 Views
  • 2 replies
  • 3 kudos

Unable to create serverless warehouse

We are unable to create a Serverless Warehouse Cluster at our own databricks workspace. The same settings do work on other Azure tenants that I have access to.The workspace is running in Azure on a Premium Plan in West Europe.Features enabled:Automat...

larsbbb_2-1738837572271.png larsbbb_3-1738837675322.png larsbbb_4-1738837733428.png larsbbb_5-1738837821146.png
  • 805 Views
  • 2 replies
  • 3 kudos
Latest Reply
Walter_C
Databricks Employee
  • 3 kudos

Was this workspace moved to Premium plan recently or have been Premium since creation?

  • 3 kudos
1 More Replies
HeathDG1
by New Contributor II
  • 940 Views
  • 1 replies
  • 0 kudos

Row filtering based on condition not working

Hi-We have a delta table in our unity catalog called dream_team.stern_portfolio.location_info.We are trying to use row level security to filter our data based on a users group membership. This way when users look at out dashboard they can only see th...

  • 940 Views
  • 1 replies
  • 0 kudos
Latest Reply
Isi
Honored Contributor III
  • 0 kudos

Hey @HeathDG1 I think your function isn’t behaving the way you expect because of how the logic is set up:•If the user is in Stern MA, they only see rows where state = 'MA'  (which is good).•BUT for everyone else, the function returns true, meaning th...

  • 0 kudos
fishingrod
by New Contributor II
  • 1566 Views
  • 3 replies
  • 0 kudos

How to implement automatic scaling of cluster size in Serverless Warehouse

I would like to know if the cluster size of a Serverless Warehouse can automatically scale up and down, and what determines the number of workers used when executing queries. Does it use all workers within the cluster size fixedly, or does it use par...

  • 1566 Views
  • 3 replies
  • 0 kudos
Latest Reply
Takuya-Omi
Valued Contributor III
  • 0 kudos

@fishingrod My understanding is that Intelligent Workload Management (IWM) in Serverless SQL Warehouses adjusts the number of clusters, but it does not automatically scale the cluster size.This means that if you need to improve the execution performa...

  • 0 kudos
2 More Replies
THIAM_HUATTAN
by Valued Contributor
  • 3161 Views
  • 3 replies
  • 0 kudos

Visualization from Python dataframe?

I notice it is very easily to get visualization from sql language inside Databricks. Say you run a SQL query which gives you a table, and you can easily use that table to do its visualization in terms of plots.   How about in Python language when we ...

  • 3161 Views
  • 3 replies
  • 0 kudos
Latest Reply
KenChase99
New Contributor II
  • 0 kudos

 Yes! You can visualize a Python DataFrame in Databricks easily using: display(df)This works like SQL visualizations, offering built-in charts. For more customization, Matplotlib, Seaborn, or Plotly can be used. Would love to see even more native sup...

  • 0 kudos
2 More Replies
Akshay_Petkar
by Valued Contributor
  • 1119 Views
  • 1 replies
  • 0 kudos

How to get the size of selected rows in bytes using a single SQL query?

Hi all,I have a table named employee in Databricks. I ran the following query to filter out rows where the salary is greater than 25000.This query returns 10 rows. I want to find the size of these 10 rows in bytes, and I would like to calculate or re...

  • 1119 Views
  • 1 replies
  • 0 kudos
Latest Reply
Alberto_Umana
Databricks Employee
  • 0 kudos

Hi @Akshay_Petkar, You can try with this query: SELECT SUM(LENGTH(CAST(employee.* AS STRING))) AS total_size_in_bytesFROM employeeWHERE salary > 25000;

  • 0 kudos