cancel
Showing results for 
Search instead for 
Did you mean: 
Warehousing & Analytics
Engage in discussions on data warehousing, analytics, and BI solutions within the Databricks Community. Share insights, tips, and best practices for leveraging data for informed decision-making.
cancel
Showing results for 
Search instead for 
Did you mean: 

Forum Posts

MadelynM
by Databricks Employee
  • 1819 Views
  • 0 replies
  • 0 kudos

[Recap] Data + AI Summit 2024 - Warehousing & Analytics | Improve performance and increase insights

Here's your Data + AI Summit 2024 - Warehousing & Analytics recap as you use intelligent data warehousing to improve performance and increase your organization’s productivity with analytics, dashboards and insights.  Keynote: Data Warehouse presente...

Screenshot 2024-07-03 at 10.15.26 AM.png
Warehousing & Analytics
AI BI Dashboards
AI BI Genie
Databricks SQL
  • 1819 Views
  • 0 replies
  • 0 kudos
VCA50380
by Contributor II
  • 664 Views
  • 3 replies
  • 0 kudos

Resolved! Equivalent of Oracle's CLOB in Databricks

Dear all,(migrating for an on-premise Oracle ...)The question is in the subject: "What is the equivalent of Oracle's CLOB in Databricks" ?I saw that the "string" type can go up to 50 thousands characters, which is quite good in most of our cases, but...

  • 664 Views
  • 3 replies
  • 0 kudos
Latest Reply
VCA50380
Contributor II
  • 0 kudos

Hello;Thanks for the answer.For the concatenation itself, it is not an issue.My question is "is Databricks supporting something bigger than the 'string' data-type" ? Thanks

  • 0 kudos
2 More Replies
VCA50380
by Contributor II
  • 1824 Views
  • 10 replies
  • 3 kudos

Resolved! Efficacy of PySpark in Databricks

Hi all,- migrating from an on-premise Oracle -Currently on Oracle, I have a "library" of let's say 300 tables to load, sequentially, based on views (some tables being fed potentially by several views, therefore the number of underlying views is highe...

  • 1824 Views
  • 10 replies
  • 3 kudos
Latest Reply
MariuszK
Contributor III
  • 3 kudos

The common scenario for data processing in Oracle is based on PL/SQL and cursors. In the case of PySpark we don't have such concept as cursors and iteration on data frames can lead to poor performance.I migrated Oracle to Databricks and I learned tha...

  • 3 kudos
9 More Replies
Kirki
by New Contributor II
  • 864 Views
  • 0 replies
  • 0 kudos

MongoDB Spark Connection Issues

Hi. I have a local MongoDB running on an EC2 instance in the same AWS VPC as my Databricks cluster but cannot get Databricks to talk to MongoDB. I've followed the guide at https://docs.databricks.com/aws/en/connect/external-systems/mongodb and have a...

  • 864 Views
  • 0 replies
  • 0 kudos
vidya_kothavale
by New Contributor III
  • 1023 Views
  • 3 replies
  • 3 kudos

Resolved! Issue with MongoDB Spark Connector in Databricks

 I followed the official Databricks documentation("https://docs.databricks.com/en/_extras/notebooks/source/mongodb.html")to integrate MongoDB Atlas with Spark by setting up the MongoDB Spark Connector and configuring the connection string in my Datab...

  • 1023 Views
  • 3 replies
  • 3 kudos
Latest Reply
szymon_dybczak
Esteemed Contributor III
  • 3 kudos

Hi @vidya_kothavale ,Could you try to change "spark.mongodb.input.uri" to following?spark.read.format("mongodb").option("spark.mongodb.read.connection.uri" 

  • 3 kudos
2 More Replies
bcb44
by New Contributor
  • 185 Views
  • 1 replies
  • 0 kudos

How to write a custom datasource for shared (standard access mode)

Hello,I maintain a spark plugin with users who are moving from spark to databricks and want to use shared access mode. My library has a bunch of custom datasources and users are seeing errors when using them. Is there a way to write a custom datasour...

  • 185 Views
  • 1 replies
  • 0 kudos
Latest Reply
Alberto_Umana
Databricks Employee
  • 0 kudos

Hello @bcb44, Thanks for your question.  Data Source V2 Relations: These are not supported in clusters that are configured with Table ACL or Credential Passthrough. Can you validate if ACL or Passthrough are enabled on the cluster? Also this should w...

  • 0 kudos
Paddy_chu
by New Contributor III
  • 225 Views
  • 1 replies
  • 1 kudos

run all below on a notebook cell isn't working

Hi All,Does anyone noticed if you "run all" the first time on the notebook, later if you click "run all below" on a cell, that wouldn't work anymore, and require to click "run all" again.It doesn't happen to me about couple of weeks ago, I used to ru...

  • 225 Views
  • 1 replies
  • 1 kudos
Latest Reply
Advika_
Databricks Employee
  • 1 kudos

Hello @Paddy_chu! This doesn't happen with every notebook, but it's likely due to cell dependencies. When you Run All, the final cell might modify or clear the data. Later, if you use Run All Below from a middle cell, it may not work if the required ...

  • 1 kudos
yuvala
by New Contributor II
  • 294 Views
  • 2 replies
  • 0 kudos

Discrepancies between query and dashboard results

Hey all,I have encountered a strange issue while validating my dashboard visual against the source table.The visual shows the count(distinct VAL) per day and when I ran a query that does the same calculation I get a difference of 84 (on average) and ...

query active ads.png dashboard active ads.png
  • 294 Views
  • 2 replies
  • 0 kudos
Latest Reply
Alberto_Umana
Databricks Employee
  • 0 kudos

Hi @yuvala, Are you using same query for the dashboard? could you please share it?

  • 0 kudos
1 More Replies
patilsuhasv
by New Contributor
  • 646 Views
  • 0 replies
  • 0 kudos

Dela Table and history

Hi All,How can I maintain 7 years of transactional data in delta table? Can I have log retention of 7 days, but data retention of 7 years?Appreciate your response.Thanks and regards Suhas

  • 646 Views
  • 0 replies
  • 0 kudos
ZoraidaHS
by New Contributor
  • 746 Views
  • 0 replies
  • 0 kudos

Disjunction on static filters for a Widget

I would like to be able to express something like: WHERE columnA = "valueA" OR columnB = "valueB" but on the Widget. I only see the possibility that chaining filters that are processed as AND operator. Am I missing something?

  • 746 Views
  • 0 replies
  • 0 kudos
VCA50380
by Contributor II
  • 1211 Views
  • 6 replies
  • 4 kudos

Resolved! Write-back functionality from PowerBi into Databricks

Hello,I'm not sure if it is the correct place to post this, sorry.Migrating from an on-premise Oracle to Databricks, we are wondering about the following functionality:. From the reporting tool in place (currenly, PowerBI), users are able to send bac...

  • 1211 Views
  • 6 replies
  • 4 kudos
Latest Reply
VCA50380
Contributor II
  • 4 kudos

Hello Mantu,Thanks for your answer.This is clear, and as our colleague on PowerBI is already dealing with Power Automate, he should be able to test this.If we will be allowed to use Databricks REST API (our infra guys will tell us), I guess we will b...

  • 4 kudos
5 More Replies
larsbbb
by New Contributor III
  • 344 Views
  • 2 replies
  • 3 kudos

Unable to create serverless warehouse

We are unable to create a Serverless Warehouse Cluster at our own databricks workspace. The same settings do work on other Azure tenants that I have access to.The workspace is running in Azure on a Premium Plan in West Europe.Features enabled:Automat...

larsbbb_2-1738837572271.png larsbbb_3-1738837675322.png larsbbb_4-1738837733428.png larsbbb_5-1738837821146.png
  • 344 Views
  • 2 replies
  • 3 kudos
Latest Reply
Walter_C
Databricks Employee
  • 3 kudos

Was this workspace moved to Premium plan recently or have been Premium since creation?

  • 3 kudos
1 More Replies
HeathDG1
by New Contributor II
  • 385 Views
  • 1 replies
  • 0 kudos

Row filtering based on condition not working

Hi-We have a delta table in our unity catalog called dream_team.stern_portfolio.location_info.We are trying to use row level security to filter our data based on a users group membership. This way when users look at out dashboard they can only see th...

  • 385 Views
  • 1 replies
  • 0 kudos
Latest Reply
Isi
Contributor III
  • 0 kudos

Hey @HeathDG1 I think your function isn’t behaving the way you expect because of how the logic is set up:•If the user is in Stern MA, they only see rows where state = 'MA'  (which is good).•BUT for everyone else, the function returns true, meaning th...

  • 0 kudos
fishingrod
by New Contributor II
  • 470 Views
  • 3 replies
  • 0 kudos

How to implement automatic scaling of cluster size in Serverless Warehouse

I would like to know if the cluster size of a Serverless Warehouse can automatically scale up and down, and what determines the number of workers used when executing queries. Does it use all workers within the cluster size fixedly, or does it use par...

  • 470 Views
  • 3 replies
  • 0 kudos
Latest Reply
Takuya-Omi
Valued Contributor III
  • 0 kudos

@fishingrod My understanding is that Intelligent Workload Management (IWM) in Serverless SQL Warehouses adjusts the number of clusters, but it does not automatically scale the cluster size.This means that if you need to improve the execution performa...

  • 0 kudos
2 More Replies