cancel
Showing results for 
Search instead for 
Did you mean: 
Warehousing & Analytics
Engage in discussions on data warehousing, analytics, and BI solutions within the Databricks Community. Share insights, tips, and best practices for leveraging data for informed decision-making.
cancel
Showing results for 
Search instead for 
Did you mean: 

Forum Posts

MadelynM
by Databricks Employee
  • 1111 Views
  • 0 replies
  • 0 kudos

[Recap] Data + AI Summit 2024 - Warehousing & Analytics | Improve performance and increase insights

Here's your Data + AI Summit 2024 - Warehousing & Analytics recap as you use intelligent data warehousing to improve performance and increase your organization’s productivity with analytics, dashboards and insights.  Keynote: Data Warehouse presente...

Screenshot 2024-07-03 at 10.15.26 AM.png
Warehousing & Analytics
AI BI Dashboards
AI BI Genie
Databricks SQL
  • 1111 Views
  • 0 replies
  • 0 kudos
VCA50380
by New Contributor III
  • 200 Views
  • 6 replies
  • 2 kudos

Efficacy of PySpark in Databricks

Hi all,- migrating from an on-premise Oracle -Currently on Oracle, I have a "library" of let's say 300 tables to load, sequentially, based on views (some tables being fed potentially by several views, therefore the number of underlying views is highe...

  • 200 Views
  • 6 replies
  • 2 kudos
Latest Reply
MariuszK
Contributor II
  • 2 kudos

The common scenario for data processing in Oracle is based on PL/SQL and cursors. In the case of PySpark we don't have such concept as cursors and iteration on data frames can lead to poor performance.I migrated Oracle to Databricks and I learned tha...

  • 2 kudos
5 More Replies
jhrcek
by New Contributor III
  • 7141 Views
  • 13 replies
  • 7 kudos

Misleading UNBOUND_SQL_PARAMETER even though parameter specified

Hello. Please forgive me if this is not the right place to ask, but I'm having issues with databricks' statement execution api. I'm developing Haskell client for this api.I managed to implement most of it, but I'm running into issues with using named...

  • 7141 Views
  • 13 replies
  • 7 kudos
Latest Reply
vivily
New Contributor II
  • 7 kudos

Not sure why but I still get the error, even with a simple select when trying to run parametrized sql in a notebook cell:This fails:%pythontable_name = "my_table"%sqlselect org_idfrom identifier(:table_name)   While this succeeds:%sqlselect org_idfro...

  • 7 kudos
12 More Replies
VCA50380
by New Contributor III
  • 264 Views
  • 6 replies
  • 4 kudos

Resolved! Write-back functionality from PowerBi into Databricks

Hello,I'm not sure if it is the correct place to post this, sorry.Migrating from an on-premise Oracle to Databricks, we are wondering about the following functionality:. From the reporting tool in place (currenly, PowerBI), users are able to send bac...

  • 264 Views
  • 6 replies
  • 4 kudos
Latest Reply
VCA50380
New Contributor III
  • 4 kudos

Hello Mantu,Thanks for your answer.This is clear, and as our colleague on PowerBI is already dealing with Power Automate, he should be able to test this.If we will be allowed to use Databricks REST API (our infra guys will tell us), I guess we will b...

  • 4 kudos
5 More Replies
larsbbb
by New Contributor III
  • 147 Views
  • 2 replies
  • 3 kudos

Unable to create serverless warehouse

We are unable to create a Serverless Warehouse Cluster at our own databricks workspace. The same settings do work on other Azure tenants that I have access to.The workspace is running in Azure on a Premium Plan in West Europe.Features enabled:Automat...

larsbbb_2-1738837572271.png larsbbb_3-1738837675322.png larsbbb_4-1738837733428.png larsbbb_5-1738837821146.png
  • 147 Views
  • 2 replies
  • 3 kudos
Latest Reply
Walter_C
Databricks Employee
  • 3 kudos

Was this workspace moved to Premium plan recently or have been Premium since creation?

  • 3 kudos
1 More Replies
HeathDG1
by New Contributor
  • 104 Views
  • 1 replies
  • 0 kudos

Row filtering based on condition not working

Hi-We have a delta table in our unity catalog called dream_team.stern_portfolio.location_info.We are trying to use row level security to filter our data based on a users group membership. This way when users look at out dashboard they can only see th...

  • 104 Views
  • 1 replies
  • 0 kudos
Latest Reply
Isi
New Contributor III
  • 0 kudos

Hey @HeathDG1 I think your function isn’t behaving the way you expect because of how the logic is set up:•If the user is in Stern MA, they only see rows where state = 'MA'  (which is good).•BUT for everyone else, the function returns true, meaning th...

  • 0 kudos
fishingrod
by New Contributor
  • 174 Views
  • 3 replies
  • 0 kudos

How to implement automatic scaling of cluster size in Serverless Warehouse

I would like to know if the cluster size of a Serverless Warehouse can automatically scale up and down, and what determines the number of workers used when executing queries. Does it use all workers within the cluster size fixedly, or does it use par...

  • 174 Views
  • 3 replies
  • 0 kudos
Latest Reply
Takuya_Omi
Valued Contributor II
  • 0 kudos

@fishingrod My understanding is that Intelligent Workload Management (IWM) in Serverless SQL Warehouses adjusts the number of clusters, but it does not automatically scale the cluster size.This means that if you need to improve the execution performa...

  • 0 kudos
2 More Replies
THIAM_HUATTAN
by Valued Contributor
  • 2207 Views
  • 3 replies
  • 0 kudos

Visualization from Python dataframe?

I notice it is very easily to get visualization from sql language inside Databricks. Say you run a SQL query which gives you a table, and you can easily use that table to do its visualization in terms of plots.How about in Python language when we hav...

  • 2207 Views
  • 3 replies
  • 0 kudos
Latest Reply
KenChase99
New Contributor
  • 0 kudos

 Yes! You can visualize a Python DataFrame in Databricks easily using: display(df)This works like SQL visualizations, offering built-in charts. For more customization, Matplotlib, Seaborn, or Plotly can be used. Would love to see even more native sup...

  • 0 kudos
2 More Replies
DataFarmer
by New Contributor II
  • 5294 Views
  • 5 replies
  • 1 kudos

Resolved! How to let Business Users edit tables in Databricks

Hi Community!I have the requirement that business users shall be able to edit/update tables in Unity Catalog, e.g. master data records, mapping tables. I also want thes actions to be logged for auditing/troubleshooting.Is there any simple solution to...

  • 5294 Views
  • 5 replies
  • 1 kudos
Latest Reply
kenwong
Databricks Employee
  • 1 kudos

We do have a few partners that offer solutions in this space (e.g. Retool).  Recently, Sigma added support for their InputTable feature which was designed for this use case: https://www.sigmacomputing.com/blog/bring-your-own-data-to-databricks-with-s...

  • 1 kudos
4 More Replies
Akshay_Petkar
by Contributor II
  • 997 Views
  • 5 replies
  • 2 kudos

How to Display Top Categories in Databricks AI/BI Dashboard?

In a Databricks AI/BI dashboard, I have a field with multiple categories (e.g., district-wise sales with 50 districts). How can I display only the top few categories (like the top 10) based on a specific metric such as sales?

  • 997 Views
  • 5 replies
  • 2 kudos
Latest Reply
Mo
Databricks Employee
  • 2 kudos

hey @migq2 , @maks  in the AI/BI dashboards in your data, add a limit parameter like:select all from my_table limit :limit_number to all your tables. when you're on canvas and adding visualizations, add a filter and create a parameter with single val...

  • 2 kudos
4 More Replies
mtreigelman
by New Contributor
  • 493 Views
  • 3 replies
  • 0 kudos

Re-Using Datasets inside the Same SQL Dashboard

Hi folks, I am creating a SQL dashboard and want to know if I can re-use datasets within the same dashboard. The screenshot below captures what I would like to do pretty well, but to summarize... I need to run an computationally expensive query and w...

databricks_dash_dataset_recycle_question.png
  • 493 Views
  • 3 replies
  • 0 kudos
Latest Reply
sonynbcu
New Contributor II
  • 0 kudos

Bumping this. I also have a use case where it would be beneficial to reference one dataset from another dataset.

  • 0 kudos
2 More Replies
Akshay_Petkar
by Contributor II
  • 105 Views
  • 1 replies
  • 0 kudos

How to get the size of selected rows in bytes using a single SQL query?

Hi all,I have a table named employee in Databricks. I ran the following query to filter out rows where the salary is greater than 25000.This query returns 10 rows. I want to find the size of these 10 rows in bytes, and I would like to calculate or re...

  • 105 Views
  • 1 replies
  • 0 kudos
Latest Reply
Alberto_Umana
Databricks Employee
  • 0 kudos

Hi @Akshay_Petkar, You can try with this query: SELECT SUM(LENGTH(CAST(employee.* AS STRING))) AS total_size_in_bytesFROM employeeWHERE salary > 25000;

  • 0 kudos
vidya_kothavale
by New Contributor III
  • 219 Views
  • 1 replies
  • 1 kudos

Resolved! Insufficient Permissions Error When Reading Data from S3 in Shared Databricks Compute

I am using a Shared Databricks Compute and trying to read data from an S3 bucket via an Instance Profile. However, I am encountering the following error: [INSUFFICIENT_PERMISSIONS] Insufficient privileges: User does not have permission SELECT on any ...

  • 219 Views
  • 1 replies
  • 1 kudos
Latest Reply
Ayushi_Suthar
Databricks Employee
  • 1 kudos

Hi @vidya_kothavale , Greetings! Can you please refer to this article and check if it helps you to resolve your issue : https://kb.databricks.com/en_US/data/user-does-not-have-permission-select-on-any-file Please note that these permissions are only ...

  • 1 kudos

Connect with Databricks Users in Your Area

Join a Regional User Group to connect with local Databricks users. Events will be happening in your city, and you won’t want to miss the chance to attend and share knowledge.

If there isn’t a group near you, start one and help create a community that brings people together.

Request a New Group
Labels
Top Kudoed Authors