cancel
Showing results for 
Search instead for 
Did you mean: 
Discussions
Engage in dynamic conversations covering diverse topics within the Databricks Community. Explore discussions on data engineering, machine learning, and more. Join the conversation and expand your knowledge base with insights from experts and peers.
cancel
Showing results for 
Search instead for 
Did you mean: 

Browse the Community

Community Discussions

Engage in vibrant discussions covering diverse learning topics within the Databricks Community. Expl...

4276 Posts

Activity in Discussions

Suheb
by > New Contributor II
  • 6 Views
  • 0 replies
  • 0 kudos

How can I create my first notebook and run a Spark job in Databricks?

How do I start in Databricks by creating a notebook and use it to run a simple data processing task (a Spark job)?

  • 6 Views
  • 0 replies
  • 0 kudos
dbernstein_tp
by > New Contributor III
  • 22 Views
  • 1 replies
  • 0 kudos

Resolved! Failed to edit ingestion pipeline PostgreSQL slot name cannot be empty or null

I'm trying to add tables to an existing SQL server CDC ingestion pipeline and today am getting this mysterious error message. Failed to edit ingestion pipelinePostgreSQL slot name cannot be empty or null Have not encountered this before. Is this simp...

  • 22 Views
  • 1 replies
  • 0 kudos
Latest Reply
dbernstein_tp
New Contributor III
  • 0 kudos

After I posted this I noticed that the gateway compute for this pipeline was repeatedly failing and retrying. This was resolved by increasing our quota of "Standard FS Family" compute on Azure. And when that was resolved the above error also disappea...

  • 0 kudos
Shivaprasad
by > New Contributor III
  • 29 Views
  • 1 replies
  • 0 kudos

Error while creating databricks custom app

I am trying to create a simple databricks custom app but I am getting Error: Could not import 'app'. error.app.yaml fileenv: - name: FLASK_APP   value: '/Workspace/Users/sam@xxx.com/databricks_apps/hello-world_2025_11_13-16_19/Gaap_commentry/app'comm...

  • 29 Views
  • 1 replies
  • 0 kudos
Latest Reply
Raman_Unifeye
Contributor III
  • 0 kudos

Seems like you are combining a file path for FLASK_APP with running a file using command. When FLASK_APP is set to a full path, Flask expects that path to point to a Python file (e.g., app.py) or package that contains the application instance.​Correc...

  • 0 kudos
sparkplug
by > New Contributor III
  • 873 Views
  • 12 replies
  • 5 kudos

Resolved! I need a switch to turn off Data Apps in databricks workspaces

HiHow do I disable Data Apps on my workspace. This is really annoying that Databricks pushes new features without any option to disable them. At least you should have some tools to control access before rolling it out. It seems you only care about fe...

  • 873 Views
  • 12 replies
  • 5 kudos
Latest Reply
Louis_Frolio
Databricks Employee
  • 5 kudos

@Raman_Unifeye , I don't have visibility into the roadmap. However, if you are a customer you can always log a feature request. Cheers, Louis.

  • 5 kudos
11 More Replies
Carsten03
by > New Contributor III
  • 24677 Views
  • 11 replies
  • 6 kudos

Resolved! Run workflow using git integration with service principal

Hi,I want to run a dbt workflow task and would like to use the git integration for that. Using my personal user I am able to do so but I am running my workflows using a service principal.I added git credentials and the repository using terraform. I a...

  • 24677 Views
  • 11 replies
  • 6 kudos
Latest Reply
Coffee77
Contributor III
  • 6 kudos

On the other hand, another approach you could use. Configure your tasks with relative paths to notebooks and deploy all of them with DAB. Your job will reference directly the deployed notebook, no need to access GIT from jobs/notebooks. That is deleg...

  • 6 kudos
10 More Replies
smoortema
by > Contributor
  • 20 Views
  • 1 replies
  • 1 kudos

how to know which join type was used (broadcast, shuffle hash or sort merge join) for a query?

What is the best way to know what kind of join was used for a SQL query between broadcast, shuffle hash and sort merge? How can the spark UI or the query plan be interpreted?

  • 20 Views
  • 1 replies
  • 1 kudos
Latest Reply
Louis_Frolio
Databricks Employee
  • 1 kudos

Hello @smoortema , here are some helpful tips and tricks.   Here’s how to quickly determine which join strategy Spark used—between broadcast hash join, shuffle hash join, and sort-merge join—and how to read both the query plan and the Spark UI to ver...

  • 1 kudos
DataRabbit
by > New Contributor II
  • 21880 Views
  • 5 replies
  • 0 kudos

Resolved! py4j.security.Py4JSecurityException: Constructor public org.apache.spark.ml.feature.VectorAssembler(java.lang.String) is not whitelisted.

Hello, i have a problem.When I try to run the MLlib Assembler (from pyspark.ml.feature import VectorAssembler) I get this error and I don't know what to do anymore. Please help.

  • 21880 Views
  • 5 replies
  • 0 kudos
Latest Reply
VenuG
New Contributor III
  • 0 kudos

Do you plan to support this in Serverless Free Edition? Migration from Community Edition to Serveless has been fraught with these limitations.

  • 0 kudos
4 More Replies
shivamrai162
by > New Contributor III
  • 49 Views
  • 1 replies
  • 0 kudos

Agent Bricks Multi Agent Supervisor failing update

Hello,I recently created a Multi Agent supervisor yesterday and it is working fine till date but when i created a second Multi Agent Supervisor assistant, im facing below error: Endpoint update failedFailed to deploy : Quota Exceeded: You've hit the ...

shivamrai162_0-1763468341971.png
  • 49 Views
  • 1 replies
  • 0 kudos
Latest Reply
Advika
Databricks Employee
  • 0 kudos

Hello @shivamrai162! The error is from a workspace quota, not billing. You’ve hit the Model Serving provisioned concurrency quota, which is enforced independently of your remaining trial credits. That’s why you can still have $200 left and see a quot...

  • 0 kudos
Pratikmsbsvm
by > Contributor
  • 43 Views
  • 2 replies
  • 1 kudos

How to Design a Data Quality Framework for Medallion Architecture Data Pipeline

Hello,I am building a Data Pipeline which extract data from Oracle Fusion and Push it to Databricks Delta lake.I am using Bronze, Silver and Gold Approach.May someone please help me how to control all three segment that is Bronze, Silver and Gold wit...

  • 43 Views
  • 2 replies
  • 1 kudos
Latest Reply
nayan_wylde
Esteemed Contributor
  • 1 kudos

Here’s how you can implement DQ at each stage:Bronze LayerChecks:File format validation (CSV, JSON, etc.).Schema validation (column names, types).Row count vs. source system.Tools:Use Databricks Autoloader with schema evolution and badRecordsPathImpl...

  • 1 kudos
1 More Replies
Shalabh007
by > Honored Contributor
  • 8888 Views
  • 6 replies
  • 19 kudos

Practice Exams for Databricks Certified Data Engineer Professional exam

Can anyone help with official Practice Exams set for Databricks Certified Data Engineer Professional exam, like we have below for Databricks Certified Data Engineer AssociatePractice exam for the Databricks Certified Data Engineer Associate exam

  • 8888 Views
  • 6 replies
  • 19 kudos
Latest Reply
JOHNBOSCOW23
  • 19 kudos

I Passed my Exam today thanks

  • 19 kudos
5 More Replies
peternagy
by > New Contributor
  • 42 Views
  • 1 replies
  • 0 kudos

Databricks Model Serving Endpoint Fails: “_USER not found for feature table”

Hi Databricks Community,I’m trying to deploy a model serving endpoint that uses Databricks Feature Store (Unity Catalog, online tables).My offline and online feature tables are created and visible in Databricks.The model is logged with FeatureEnginee...

  • 42 Views
  • 1 replies
  • 0 kudos
Latest Reply
AbhaySingh
Databricks Employee
  • 0 kudos

Root cause in plain English The lookup client is trying to read SQL-style credentials like PREFIX_USER/PREFIX_PASSWORD for a third‑party online store and the “prefix” is empty, so it searches for “_USER” and fails. That auth scheme applies only to th...

  • 0 kudos
Mario_D
by > New Contributor III
  • 278 Views
  • 4 replies
  • 1 kudos

Resolved! Agentic AI Workload Analyzer

Hello Databricks experts,In Automating Best Practices with Agentic AI Workload Analyzer, Krishna Satyavarapu and Nikhil Mishra mentioned  "The Agentic AI Workload Analyzer". Is it known when this will be available.cheersMario  

  • 278 Views
  • 4 replies
  • 1 kudos
Latest Reply
Thompson2345
New Contributor II
  • 1 kudos

Generative AI is transforming how we handle data and automation. The key challenge now is balancing model creativity with control — ensuring reliable outputs while keeping innovation at the core.

  • 1 kudos
3 More Replies
anhnnguyen
by > Visitor
  • 40 Views
  • 2 replies
  • 1 kudos

Adding maven dependency to ETL pipeline

Hello guys,I'm building ETL pipeline and need to access HANA data lake file system. In order to do that I need to have sap-hdlfs library in compute environment, library is available in maven repository.My job will have multiple notebook task and ETL ...

anhnnguyen_0-1763437214864.png
  • 40 Views
  • 2 replies
  • 1 kudos
Latest Reply
nayan_wylde
Esteemed Contributor
  • 1 kudos

DLT doesn’t have a UI for library installation, but you can:Use libraries configuration in the pipeline JSON or YAML spec:{ "libraries": [ { "maven": { "coordinates": "com.sap.hana.hadoop:sap-hdlfs:<version>" } } ] }Or...

  • 1 kudos
1 More Replies
Charuvil
by > New Contributor III
  • 34 Views
  • 2 replies
  • 1 kudos

Data Profile on a table is not securable object

Data Profile on a table is not a securable object in Unity Catalog or at Workspace level. This make the management of Data Profiles difficult for workspace admins.Why isn’t “profile” a securable object in Databricks? It makes sense to require “Manage...

  • 34 Views
  • 2 replies
  • 1 kudos
Latest Reply
Raman_Unifeye
Contributor III
  • 1 kudos

well no concrete answer on why, perhaps Data Profile is treated as ephemeral, computed metadata or a snapshot of summary statistics (like min/max, distinct counts, etc.). It is created by a user's compute job within a specific workspace environment. ...

  • 1 kudos
1 More Replies
mishrash12n
by > New Contributor II
  • 565 Views
  • 6 replies
  • 1 kudos

Resolved! Request for certification discount voucher

Hello,My name is Shubham. I recently watched your video and found it very informative. I have completed my B.Tech and am in the process of joining a company that requires a Databricks certification.I am reaching out to request a voucher for the Datab...

  • 565 Views
  • 6 replies
  • 1 kudos
Latest Reply
jagruti105
Visitor
  • 1 kudos

When is the next festival date ? I missed it

  • 1 kudos
5 More Replies