cancel
Showing results for 
Search instead for 
Did you mean: 
Databricks Platform Discussions
Dive into comprehensive discussions covering various aspects of the Databricks platform. Join the conversation to deepen your understanding and maximize your usage of the Databricks platform.
cancel
Showing results for 
Search instead for 
Did you mean: 

Browse the Community

Data Engineering

Join discussions on data engineering best practices, architectures, and optimization strategies with...

9760 Posts

Data Governance

Join discussions on data governance practices, compliance, and security within the Databricks Commun...

427 Posts

Generative AI

Explore discussions on generative artificial intelligence techniques and applications within the Dat...

117 Posts

Machine Learning

Dive into the world of machine learning on the Databricks platform. Explore discussions on algorithm...

865 Posts

Warehousing & Analytics

Engage in discussions on data warehousing, analytics, and BI solutions within the Databricks Communi...

521 Posts

Databricks Free Trial Help

Engage in discussions about the Databricks Free Trial within the Databricks Community. Share insight...

30 Posts

Activity in Databricks Platform Discussions

Deloitte_DS
by > New Contributor II
  • 3823 Views
  • 5 replies
  • 0 kudos

Resolved! Unable to install poppler-utils

Hi,I'm trying to install system level package "Poppler-utils" for the cluster. I added the following line to the init.sh script.sudo apt-get -f -y install poppler-utilsI got the following error: PDFInfoNotInstalledError: Unable to get page count. Is ...

  • 3823 Views
  • 5 replies
  • 0 kudos
Latest Reply
Raghavan93513
Databricks Employee
  • 0 kudos

Hi Team, If you use a single user cluster and use the below init script, it will work: sudo rm -r /var/lib/apt/lists/* sudo apt clean && sudo apt update --fix-missing -ysudo apt-get install poppler-utils tesseract-ocr -y But if you are using a shared...

  • 0 kudos
4 More Replies
Meenambigai
by > Visitor
  • 5 Views
  • 0 replies
  • 0 kudos

Link for webinar Get Started with Databricks for Data Engineering session

where to find Link for webinar Get Started with Databricks for Data Engineering session

  • 5 Views
  • 0 replies
  • 0 kudos
Daithi
by > New Contributor II
  • 18111 Views
  • 16 replies
  • 3 kudos

Unity Catalog - Error getting sample data in data explorer

I get an error message saying I Error getting sample data, when I try to view sample data from a table in a schema I created in a Unity Catalog. I dropped the schema and table and got a collague to recreate and still the same message. We are both Uni...

image.png
  • 18111 Views
  • 16 replies
  • 3 kudos
Latest Reply
AnnexPetal
Visitor
  • 3 kudos

It sounds like a common issue with access control that can arise in database management systems or applications with user permissions. Checking table-level ACLs (Access Control Lists) is a great first step, as it ensures that the user has the proper ...

  • 3 kudos
15 More Replies
nhakobian
by > Visitor
  • 15 Views
  • 0 replies
  • 0 kudos

Python Artifact Installation Error on Runtime 16.1 on Shared Clusters

I've run into an issue with no clear path to resolution.Due to various integrations we have in Unity Catalog, some jobs we have to run in a Shared Cluster environment in order to authenticate properly to the underlying data resource. When setting up ...

  • 15 Views
  • 0 replies
  • 0 kudos
Harikrish
by > New Contributor
  • 366 Views
  • 0 replies
  • 0 kudos

Data Governance

If I grant all privileges in my schema does that automatically give access to users for all underlying objects? Or should I give access seperately for all the objects?

  • 366 Views
  • 0 replies
  • 0 kudos
VCA50380
by > New Contributor II
  • 19 Views
  • 2 replies
  • 0 kudos

Equivalent of Oracle's CLOB in Databricks

Dear all,(migrating for an on-premise Oracle ...)The question is in the subject: "What is the equivalent of Oracle's CLOB in Databricks" ?I saw that the "string" type can go up to 50 thousands characters, which is quite good in most of our cases, but...

  • 19 Views
  • 2 replies
  • 0 kudos
Latest Reply
VCA50380
New Contributor II
  • 0 kudos

Hello;Thanks for the answer.For the concatenation itself, it is not an issue.My question is "is Databricks supporting something bigger than the 'string' data-type" ? Thanks

  • 0 kudos
1 More Replies
subhas_hati
by > Visitor
  • 51 Views
  • 0 replies
  • 0 kudos

Partition Size:

HiI have chosen the default partition size 128 MB. I am reading a 3.8 GB file and checking the size of partition using df.rdd.getNumPartitions() as given below. I find the partition size: 159 MB. Why the partition size after reading the file differ ?...

  • 51 Views
  • 0 replies
  • 0 kudos
EWhitley
by > New Contributor III
  • 4305 Views
  • 3 replies
  • 3 kudos

Resolved! Retrieve task name within workflow task (notebook, python)?

Using workflows, is there a way to obtain the task name from within a task?EX: I have a workflow with a notebook task. From within that notebook task I would like to retrieve the task name so I can use it for a variety of purposes.Currently, we're re...

  • 4305 Views
  • 3 replies
  • 3 kudos
Latest Reply
ttamas
New Contributor III
  • 3 kudos

Hi @EWhitley,Would {{task.name}} help in getting the current task name?https://docs.databricks.com/en/workflows/jobs/parameter-value-references.htmlPass context about job runs into job t 

  • 3 kudos
2 More Replies
mangosta
by > Visitor
  • 20 Views
  • 0 replies
  • 0 kudos

Query text truncated for queries longer than 153,596 characters

Hi, When using the `query_history.list` function of the python SDK workspace client the queries that have more than 153,596 characters are truncated.I could not find anywhere in the documentation this limit so I wanted to know if this is documented s...

  • 20 Views
  • 0 replies
  • 0 kudos
sensanjoy
by > Contributor
  • 60 Views
  • 5 replies
  • 0 kudos

Performance issue when reading data from a view.

Hi All,We are facing some performance issue and I need your help to know what could be the best approach to follow here.Existing: For each region, we have view(Reg1_View,Reg2_View..) to pull data from table( we don't have direct access to table).And ...

View_Vs_Staging_Table.png
  • 60 Views
  • 5 replies
  • 0 kudos
Latest Reply
SharathAbh93
New Contributor II
  • 0 kudos

Does any table hold data of all region 1. if yes. Get a Materialized view created (replacing all_reg_view)2. i see you already tried creating a staging table replacing the all_reg_view. Try creating cluster key along with partition.Cluster key on the...

  • 0 kudos
4 More Replies
Brad
by > Contributor II
  • 59 Views
  • 4 replies
  • 0 kudos

How to add shared libs

Hi team,I want to add some shared libs which might be used by many repos, e.g. some util functions which might be used by any repos.1. What is the recommended way to add those libs? E.g. create a separate repo and reference it in another repo?2. How ...

  • 59 Views
  • 4 replies
  • 0 kudos
Latest Reply
notauser
Visitor
  • 0 kudos

2

  • 0 kudos
3 More Replies
vvk
by > New Contributor II
  • 1908 Views
  • 2 replies
  • 0 kudos

Unable to upload a wheel file in Azure DevOps pipeline

Hi, I am trying to upload a wheel file to Databricks workspace using Azure DevOps release pipeline to use it in the interactive cluster. I tried "databricks workspace import" command, but looks like it does not support .whl files. Hence, I tried to u...

  • 1908 Views
  • 2 replies
  • 0 kudos
Latest Reply
Satyadeepak
Databricks Employee
  • 0 kudos

Hi @vvk - The HTTP 403 error typically indicates a permissions issue. Ensure that the SP has the necessary permissions to perform the fs cp operation on the specified path. Verify that the path specified in the fs cp command is correct and that the v...

  • 0 kudos
1 More Replies
stvayers
by > New Contributor
  • 1718 Views
  • 1 replies
  • 0 kudos

How to mount AWS EFS via NFS on a Databricks Cluster

I'm trying to read in ~500 million small json files into an spark autoloader pipeline, and I seem to be slowed down massively by S3 request limits, so I want to explore using AWS EFS instead. I found this blog post: https://www.databricks.com/blog/20...

  • 1718 Views
  • 1 replies
  • 0 kudos
Latest Reply
Satyadeepak
Databricks Employee
  • 0 kudos

Hi @stvayers Please refer to this doc. https://docs.databricks.com/api/workspace/clusters/create It has instructions on how to mount using EFS.  

  • 0 kudos
leo-machado
by > New Contributor III
  • 243 Views
  • 5 replies
  • 2 kudos

(Big) Problem with SQL Warehouse Auto stop

Long story short, I'm not sure if this is an already known problem, but the Auto Stop feature on SQL Warehouses after minutes of inactivity is not working properly.We started using SQL Warehouses more aggressively this December when we scaled up one ...

image (2).png Screenshot 2025-01-02 at 10.31.27.png
  • 243 Views
  • 5 replies
  • 2 kudos
Latest Reply
pdiamond
New Contributor II
  • 2 kudos

Is this still being investigated by Databricks? I'm seeing similar behavior that's costing us a lot of money.

  • 2 kudos
4 More Replies
Bepposbeste1993
by > New Contributor III
  • 288 Views
  • 4 replies
  • 0 kudos

Resolved! select 1 query not finishing

Hello,I have the issue that even a query like "select 1" is not finishing. The sql warehouse runs infinite. I have no idea where to look for any issues because in the SPARK UI I cant see any error.What is intresting is that also allpurpose clusters (...

  • 288 Views
  • 4 replies
  • 0 kudos
Latest Reply
Alberto_Umana
Databricks Employee
  • 0 kudos

Hi @Bepposbeste1993, Do you have the case ID raised for this issue? 

  • 0 kudos
3 More Replies