Databricks Platform Discussions

Browse the Community

Administration & Architecture

Explore discussions on Databricks administration, deployment strategies, and architectural best prac...

879 Posts

Data Engineering

Join discussions on data engineering best practices, architectures, and optimization strategies with...

10442 Posts

Data Governance

Join discussions on data governance practices, compliance, and security within the Databricks Commun...

445 Posts

Generative AI

Explore discussions on generative artificial intelligence techniques and applications within the Dat...

163 Posts

Machine Learning

Dive into the world of machine learning on the Databricks platform. Explore discussions on algorithm...

903 Posts

Warehousing & Analytics

Engage in discussions on data warehousing, analytics, and BI solutions within the Databricks Communi...

571 Posts

Databricks Free Trial Help

Engage in discussions about the Databricks Free Trial within the Databricks Community. Share insight...

55 Posts

Activity in Databricks Platform Discussions

Sorted by:

Start a conversation

by alonisser > • Contributor II

9 hours ago

21 Views
0 replies
0 kudos

Since we've moved from azure to aws, a specific job has extremely long vacuum runs, is there a specific flag/configuration for the s3 storage that is needed to support faster vacuum.How can I research what's going on?Note, it's not ALL jobs, but a sp...

Data Engineering

deltalake

21 Views
0 replies
0 kudos

9 hours ago

by vidya_kothavale > • New Contributor III

10 hours ago

40 Views
0 replies
0 kudos

Issue reading Vertica table into Databricks - Numeric value out of range

I am trying to read a Vertica table into a Spark DataFrame using JDBC in Databricks.Here is my sample code:hostname = ""username = ""password = ""database_port = ""database_name = ""qry_col_level = f"""SELECT * FROM analytics_DS.ansh_units_cum_dash""...

Data Engineering

40 Views
0 replies
0 kudos

10 hours ago

by Sujitha • Databricks Employee

01-24-2024 10:02:12 AM

22378 Views
13 replies
23 kudos

Databricks Announces the Industry’s First Generative AI Engineer Learning Pathway and Certification

Today, we are announcing the industry's first Generative AI Engineer learning pathway and certification to help ensure that data and AI practitioners have the resources to be successful with generative AI. At Databricks, we recognize that generative ...

Screenshot 2024-01-24 at 11.32.01 PM.png

Generative AI

22378 Views
13 replies
23 kudos

01-24-2024 10:02:12 AM

Latest Reply

LRALVA
Contributor II

13 hours ago

23 kudos

Thanks @Sujitha for sharing.

23 kudos

13 hours ago

12 More Replies

by Tuno986 > • New Contributor

02-18-2025 12:42:40 AM

871 Views
2 replies
0 kudos

Implementing Federated Governance in Databricks Unity Catalog

Hi,I am working for a large company that is implementing a Databricks solution. We have multiple domains, each responsible for its own data products, following a data mesh approach.As part of a federated governance model, we need a way to communicate...

Data Governance

871 Views
2 replies
0 kudos

02-18-2025 12:42:40 AM

Latest Reply

LRALVA
Contributor II

13 hours ago

0 kudos

Hi @Tuno986 You can implement a notification-driven system by having each domain team register new data products in a Delta table or send an event. This triggers automated compliance checks (schema, lineage, ACLs) using Databricks workflows. Governan...

0 kudos

13 hours ago

1 More Replies

by suraj46nair > • Visitor

15 hours ago

23 Views
0 replies
0 kudos

learning Data Analysis with Databricks

Hi Team, Im unable to access below mentioed URL page. - https://customer-academy.databricks.com/learn/course/external/view/elearning/2916/data-analysis-with-databricks can you pls help.

Databricks Free Trial Help

23 Views
0 replies
0 kudos

15 hours ago

by sayandeb > • New Contributor

yesterday

184 Views
4 replies
2 kudos

Is databricks community edtion cluster having issue?

i am trying to run a piece of code and everytime i am getting "Notebook detached×Exception when creating execution context: java.net.SocketTimeoutException: Connect Timeout". Can anyone please tell if there is any issue going on for the databrics com...

Databricks Free Trial Help

communityedition

Databricks

184 Views
4 replies
2 kudos

yesterday

Latest Reply

shohcarroll
New Contributor

yesterday

2 kudos

I am running into the same issue. I am just trying to initialize a spark session. Please let me know if there is a fix.

2 kudos

yesterday

3 More Replies

by lucasbergamo > • New Contributor

yesterday

53 Views
0 replies
0 kudos

Issue with Verification Code Input on Login

Hello, I hope you're well.I'd like to report a bug encountered when entering the verification code to log into the platform. When I type the code without Caps Lock enabled, the input field displays the characters in uppercase, but the code isn't acc...

Administration & Architecture

53 Views
0 replies
0 kudos

yesterday

by kweks970 > • New Contributor

yesterday

1607 Views
1 replies
0 kudos

DEV and PROD

"SELECT * FROM' data call on my table in PROD is giving all the rows of data (historical data), but a call on my table in DEV is giving me just one row of data (current one row of historical data). what could be the problem??

Data Engineering

1607 Views
1 replies
0 kudos

yesterday

Latest Reply

BigRoux
Databricks Employee

yesterday

0 kudos

Please don't cross post. Thanks, Louis.

0 kudos

yesterday

by ashokz > • New Contributor

yesterday

72 Views
1 replies
0 kudos

Is it possible restore a deleted catalog and schema

Is it possible restore a deleted catalog and schema.if CASCADE is used even though schemas and tables are present in catalog, catalog will be dropped.Is it possible to restore catalog or is possible to restrict the use of CACADE command.Thank you.

Administration & Architecture

72 Views
1 replies
0 kudos

yesterday

Latest Reply

BigRoux
Databricks Employee

yesterday

0 kudos

It is not possible to directly restore a deleted catalog or schema if they were dropped with the CASCADE option, especially in Databricks Unity Catalog. When a catalog or schema is dropped with CASCADE, all its dependent objects, such as schemas and ...

0 kudos

yesterday

by ashokz > • New Contributor

yesterday

97 Views
1 replies
0 kudos

Is it possible expand/extend subnet CIDR of an existing azure databricks workspace

Is it possible expand/extend subnet CIDR of an existing azure databricks workspace. Currently our workspace is maxed out, Is it possible expand/extend subnet CIDR of an existing azure databricks workspace without having create a new one

Administration & Architecture

97 Views
1 replies
0 kudos

yesterday

Latest Reply

BigRoux
Databricks Employee

yesterday

0 kudos

Yes, it is possible to expand or extend the subnet CIDR of an existing Azure Databricks workspace without creating a new one, but this capability is specifically applicable if the workspace is deployed with VNet injection. For workspaces that use V...

0 kudos

yesterday

by AlexMc > • New Contributor

Thursday

181 Views
6 replies
1 kudos

Resolved! GET /api/2.2/jobs/list Ordering

Hi there!I am calling the job list API (via the Python SDK):GET /api/2.2/jobs/listdocs.databricks.com/api/workspace/jobs/listDoes anyone know what ordering is applied / calculated for the list of jobs? Is it consistent or random?Is it by creation tim...

Data Engineering

181 Views
6 replies
1 kudos

Thursday

Latest Reply

AlexMc
New Contributor

yesterday

1 kudos

Thanks both - this was very helpful!

1 kudos

yesterday

5 More Replies

by noorbasha534 > • Contributor III

yesterday

169 Views
3 replies
1 kudos

Azure Databricks Status

Dear all,I wanted to check if anyone implemented the solution of capturing information from Databricks status page in real-time 24x7 and load that into a log or table...https://learn.microsoft.com/en-us/azure/databricks/resources/statuswhat is the be...

Administration & Architecture

169 Views
3 replies
1 kudos

yesterday

Latest Reply

TheRealOliver
Contributor

yesterday

1 kudos

It seems that the webhook is the way!There is nothing about system status in Databricks REST API.There is nothing about system status in the System Tables schema.

1 kudos

yesterday

2 More Replies

by Christian_C > • New Contributor II

a week ago

461 Views
7 replies
0 kudos

Google Pub Sub and Delta live table

I am using delta live table and pub sub to ingest message from 30 different topics in parallel. I noticed that initialization time can be very long around 15 minutes. Does someone knows how to reduced initialization time in dlt ? Thanks You

Data Engineering

461 Views
7 replies
0 kudos

a week ago

Latest Reply

BigRoux
Databricks Employee

Wednesday

0 kudos

Classic clusters can take up to seven minutes to be acquired, configured, and deployed, with most of this time spent waiting for the cloud service to allocate virtual machines. In contrast, serverless clusters typically start in under eight seconds. ...

0 kudos

Wednesday

6 More Replies

by Iris12 > • New Contributor

3 weeks ago

338 Views
1 replies
0 kudos

How to do NLP against PDFs in Databricks? Can be done in Snowflake very easily.

I’d love to build a quick "art of the possible" demo showing how easy it is to query unstructured PDFs using natural language. In Snowflake, I wired up a similar solution in ~2 hours just by following their tutorial guide.Does anyone know the best wa...

Generative AI

338 Views
1 replies
0 kudos

3 weeks ago

Latest Reply

BigRoux
Databricks Employee

yesterday

0 kudos

To query unstructured PDF files using natural language in Databricks, you can leverage an approach similar to the "Retrieval Augmented Generation (RAG) and DBRX" demo. Although the specific demo you referenced (https://notebooks.databricks.com/demos/...

0 kudos

yesterday

by xming > • New Contributor

Thursday

101 Views
1 replies
0 kudos

Error when uploading MLFlow artifacts to DBFS

Hi everyone,I'm attempting to use MLFlow experiment tracking from a local machine, but I'm encountering difficulties in uploading artifacts.I've tried a sample code as simple as the following.import mlflow import os os.environ["DATABRICKS_HOST"] = "...

Machine Learning

101 Views
1 replies
0 kudos

Thursday

Latest Reply

BigRoux
Databricks Employee

yesterday

0 kudos

A couple things: 1. If you don't own the mlflow experiment you need ot have edit permissions on the experiment (needed for logging). Default artifact locations in DBFS (`dbfs:/databricks/mlflow-tracking/`) require explicit write permissions 2. The lo...