Get Started Discussions

by fiverrpromotion • New Contributor

10-14-2024 2:19:25 AM

1376 Views
1 replies
1 kudos

Resolved! Addressing Memory Constraints in Scaling XGBoost and LGBM: A Comprehensive Approach for High-Volume

Scaling XGBoost and LightGBM models to handle exceptionally large datasets—those comprising billions to tens of billions of rows—presents a formidable computational challenge, particularly when constrained by the limitations of in-memory processing o...

Get Started Discussions

Reply

1376 Views
1 replies
1 kudos

10-14-2024 2:19:25 AM

View Replies

Latest Reply

jamesl
Databricks Employee

4 weeks ago

1 kudos

Hi @fiverrpromotion, As you mention, scaling XGBoost and LightGBM for massive datasets has its challenges, especially when trying to preserve critical training capabilities such as early stopping and handling of sparse features / high-cardinality cat...

1 kudos

4 weeks ago

by JavierS • New Contributor

10-21-2024 11:25:43 AM

1318 Views
1 replies
0 kudos

Resolved! Problem with ray train and Databricks Notebook (Strange dbutils error)

Hi everyone,I'm running some code to train a multimodal Hugging Face model with SFTTrainer and TorchTrainer to use all GPU workers. When trying to execute trainer.fit() it gives me a dbutils serialization error,even I am not using dbutils directly in...

Get Started Discussions

AIR

Databricks

DeepLearning

Distributed

ray

Reply

1318 Views
1 replies
0 kudos

10-21-2024 11:25:43 AM

View Replies

Latest Reply

sarahbhord
Databricks Employee

a month ago

0 kudos

JavierS - The dbutils serialization error occurs in your code because dbutils is only available on the Databricks driver node and cannot be pickled or transferred to Spark or Ray worker nodes. This error can appear even if your code doesn't directly...

0 kudos

a month ago

by JaydeepKhatri • New Contributor II

08-04-2025 2:49:44 AM

760 Views
2 replies
1 kudos

Resolved! Using merge Schema with spark.read.csv for inconsistent schemas

The Problem:A common data engineering challenge is reading a directory of CSV files where the schemas are inconsistent. For example, some files might have columns in a different order, or be missing certain columns altogether. The standard behavior o...

Get Started Discussions

Reply

760 Views
2 replies
1 kudos

08-04-2025 2:49:44 AM

View Replies

Latest Reply

Louis_Frolio
Databricks Employee

a month ago

1 kudos

Hey @JaydeepKhatri here are some helpful points to consider: Is this an officially supported, enhanced feature of the Databricks CSV reader? Based on internal research, this appears to be an undocumented “feature” of Spark running on Databricks. ...

1 kudos

a month ago

1 More Replies

by MadelynM • Databricks Employee

09-10-2024 10:03:46 AM

4240 Views
10 replies
0 kudos

Who's hiring? Latest Job Postings from the Databricks Community!

More than 10,000 organizations worldwide — including Block, Comcast, Condé Nast, Rivian, Shell and over 60% of the Fortune 500 — rely on the Databricks Data Intelligence Platform to take control of their data and put it to work with AI. Use this thre...

Get Started Discussions

Reply

4240 Views
10 replies
0 kudos

09-10-2024 10:03:46 AM

View Replies

Latest Reply

Ashwini23
New Contributor II

a month ago

0 kudos

Job Title - Engineering Data Analyst LeadCompany name- Wartsila Location - Mumbai Required years of experience- 8 years Link to apply - https://careers.wartsila.com/job-invite/46999/Job description;ResponsibilitiesTransform raw data into meaningful i...

0 kudos

a month ago

9 More Replies

by Phani1 • Valued Contributor II

07-18-2024 10:37:01 PM

3422 Views
5 replies
0 kudos

Talend ETL code to Databricks

Hi Team,What is the best way to transfer Talend ETL code to Databricks and what are the best methods/practices for migrating Talend ETL's to Databricks (notebook, code conversion/migration strategy, workflow's etc)?Regards,Janga

Get Started Discussions

Reply

3422 Views
5 replies
0 kudos

07-18-2024 10:37:01 PM

View Replies

Latest Reply

vamsi_simbus
New Contributor III

a month ago

0 kudos

Hi @Phani1 ,Were you able to complete the migration from Talend to Databricks? If yes, please do let me know the process / tools that you have used for the migration and the challenges that you have encountered. ThanksVamsi.

0 kudos

a month ago

4 More Replies

by anipar • New Contributor III

09-26-2025 8:18:40 PM

1006 Views
6 replies
5 kudos

Resolved! Databricks metastore

I have configured a new databricks workspace through my personal azure account. I cannot see the metastore. Since this is my personal account, I am the admin. Can you please guide me on how to enable the metastore. Thank you.

Get Started Discussions

Reply

1006 Views
6 replies
5 kudos

09-26-2025 8:18:40 PM

View Replies

Latest Reply

Khaja_Zaffer
Contributor III

09-27-2025 1:54:39 AM

5 kudos

Hello @anipar Good day!!Can you please confirm again if you can see the metastore here?Log in to your Databricks workspace.Click your username in the top bar and select "Manage Account" to open the account console AZURE (https://accounts.azuredatabri...

5 kudos

09-27-2025 1:54:39 AM

5 More Replies

by Gaz • New Contributor III

09-26-2025 8:01:35 AM

291 Views
1 replies
1 kudos

Resolved! Dashboard embed: dashboard id is missing in token claim

We're trying to embed a dashboard following the example at https://learn.microsoft.com/en-us/azure/databricks/dashboards/embedding/external-embed#step-4-use-the-example-app-to-authenticate-and-generate-tokens The embed is displayed but with an error ...

Get Started Discussions

Reply

291 Views
1 replies
1 kudos

09-26-2025 8:01:35 AM

View Replies

Latest Reply

Gaz
New Contributor III

09-26-2025 12:32:08 PM

1 kudos

Nevermind. I accidentally removed the external_viewer_id and external_value parameters. After adding them back it works as expected.

1 kudos

09-26-2025 12:32:08 PM

by xdx001 • New Contributor III

09-26-2025 9:37:29 AM

329 Views
0 replies
3 kudos

Understanding Databricks Clean Rooms: Design, Access, and Cost Model

Hi everyone,I wanted to share some insights and clarifications around Databricks Clean Rooms, especially regarding how they are designed and how costs are calculated — something that can be a bit confusing when onboarding collaborators.What Are Clean...

Get Started Discussions

Reply

329 Views
0 replies
3 kudos

09-26-2025 9:37:29 AM

by ericmedina • New Contributor

09-24-2025 8:15:57 AM

247 Views
1 replies
0 kudos

Resolved! Signed up for Customer Academy instead of Partner

Hello. Yesterday, I signed up for an account under my company email as a customer account. However, my intent was to sign up for a partner account and now it won't let me switch back. I sent a ticket to the academy team but have not received a reply ...

Get Started Discussions

Reply

247 Views
1 replies
0 kudos

09-24-2025 8:15:57 AM

View Replies

Latest Reply

Advika
Databricks Employee

09-24-2025 10:33:06 AM

0 kudos

Hello @ericmedina! For issues with Academy access, the correct step is to raise a ticket with the Databricks support team. Since you’ve already submitted one, please allow some time for them to respond.

0 kudos

09-24-2025 10:33:06 AM

by chinmay0924 • New Contributor III

04-30-2025 2:20:54 AM

2947 Views
6 replies
1 kudos

Resolved! Spark connect client and server versions should be same for executing UDFs

I am trying to execute pandas UDF in databricks. It gives me the following error on serverless compute,File /local_disk0/.ephemeral_nfs/envs/pythonEnv-b11ff17c-9b25-4ccb-927d-06a7d1ca7221/lib/python3.11/site-packages/pyspark/sql/connect/client/core.p...

Get Started Discussions

Reply

2947 Views
6 replies
1 kudos

04-30-2025 2:20:54 AM

View Replies

Latest Reply

nija
Databricks Employee

09-24-2025 3:42:39 AM

1 kudos

@chinmay0924 - You can change the serverless client image by selecting the environment panel in a Databricks Notebook (on the right pane) or in the "Environment and Libraries" section while configuring a Databricks Job Task. The set of available serv...

1 kudos

09-24-2025 3:42:39 AM

5 More Replies

by rajkishore028 • New Contributor II

09-23-2025 6:24:26 AM

523 Views
1 replies
0 kudos

Exam Suspended After Restarting the Exam — Ticket ID: 00744386

Hi Team,I am writing to raise a concern regarding my Databricks Certified Data Engineer Professional exam (Ticket ID: 00744386).During my exam session, the support representative initially requested me to show the entire room, including my bed, whi...

Get Started Discussions

Reply

523 Views
1 replies
0 kudos

09-23-2025 6:24:26 AM

View Replies

Latest Reply

cert-ops
Databricks Employee

09-23-2025 9:00:40 AM

0 kudos

Hello @rajkishore028,Thank you for filing a ticket with our support team, Support team will respond shortly. Please note that we cannot provide support or handle exam suspensions via community. Thanks & Regards,@cert-ops

0 kudos

09-23-2025 9:00:40 AM

by Nexusss7 • New Contributor II

09-21-2025 11:48:39 PM

332 Views
1 replies
2 kudos

Resolved! How to update comments and constraints on Streaming Tables created by DLT outside the pipeline?

I have a Delta Live Tables (DLT) pipeline that creates a streaming table (e.g., dbrtdflite_catalog.demotest_silver.customer_profile).I want to update table-level and column-level comments as well as constraints on this table.When I try commands like:...

Get Started Discussions

Reply

332 Views
1 replies
2 kudos

09-21-2025 11:48:39 PM

View Replies

Latest Reply

Saritha_S
Databricks Employee

09-23-2025 7:33:11 AM

2 kudos

Hi @Nexusss7 You can add a comment in the delta live tables, either the MV or the streaming table, in the tag - @Dlt.table() @Dlt.table( comment = "Delta live tables comment" )Here is the syntax for SQL: https://docs.databricks.com/aws/en/d...

2 kudos

09-23-2025 7:33:11 AM

by fjrodriguez • New Contributor III

08-26-2025 3:19:50 AM

768 Views
1 replies
1 kudos

Blackduck scanning on Databricks Workflow

Does anyone know if its compatible scan in blackduck your json based files from Workflows? At least, when its come to the notebook its compatible as blackduck detects python based files but i am wondering why can workflow be scanned as well.

Get Started Discussions

Reply

768 Views
1 replies
1 kudos

08-26-2025 3:19:50 AM

View Replies

Latest Reply

koji_kawamura
Databricks Employee

09-18-2025 7:32:27 PM

1 kudos

Hi @fjrodriguez , Have you found your answer already? I hope I understand your question correctly. I think Databricks Asset Bundles (or Terraform) can be a useful tool to be added here, if you haven't looked at it already. Yes, you can and should s...

1 kudos

09-18-2025 7:32:27 PM

by Phani1 • Valued Contributor II

07-25-2023 2:17:08 AM

7105 Views
5 replies
1 kudos

Databricks Job Failure + Service now Integration

Hi Team,Could you please suggest how to raise the service now ticket, in case of Databricks job failure?Regards ,Phanindra

Get Started Discussions

Reply

7105 Views
5 replies
1 kudos

07-25-2023 2:17:08 AM

View Replies

Latest Reply

PRV
New Contributor II

09-18-2025 8:30:51 AM

1 kudos

JSON response from Databricks to ServiceNow, doesn't give enough details. Is there a way we can send customized json with additional details.

1 kudos

09-18-2025 8:30:51 AM

4 More Replies

by rseshadri • New Contributor III

10-08-2024 5:43:46 AM

3711 Views
8 replies
0 kudos

Unable to install the Databricks in AWS

I have followed the instructions given in the following link https://docs.databricks.com/en/admin/workspace/quick-start.html#use-aws-quickstart-to-create-a-databricks-workspace%22There was an error in CFT while deploying the workspace. Please help[ER...

Get Started Discussions

Reply

3711 Views
8 replies
0 kudos

10-08-2024 5:43:46 AM

View Replies

Latest Reply

S_Reo1
New Contributor II

09-17-2025 9:17:24 PM

0 kudos

Hi @rseshadri Please check the following two points:STS Endpoint Activation: Confirm whether the STS endpoint for the us-west-2 region is active. You can verify this from the IAM account settings.Resource Limits: Ensure that the limits for resources ...

0 kudos

09-17-2025 9:17:24 PM

7 More Replies

Databricks Community

Forum Posts

Resolved! Addressing Memory Constraints in Scaling XGBoost and LGBM: A Comprehensive Approach for High-Volume

Resolved! Problem with ray train and Databricks Notebook (Strange dbutils error)

Resolved! Using merge Schema with spark.read.csv for inconsistent schemas

Who's hiring? Latest Job Postings from the Databricks Community!

Talend ETL code to Databricks

Resolved! Databricks metastore

Resolved! Dashboard embed: dashboard id is missing in token claim

Understanding Databricks Clean Rooms: Design, Access, and Cost Model

Resolved! Signed up for Customer Academy instead of Partner

Resolved! Spark connect client and server versions should be same for executing UDFs

Exam Suspended After Restarting the Exam — Ticket ID: 00744386

Resolved! How to update comments and constraints on Streaming Tables created by DLT outside the pipeline?

Blackduck scanning on Databricks Workflow

Databricks Job Failure + Service now Integration

Unable to install the Databricks in AWS

Join Us as a Local Community Builder!

Ingesting data from APIs Like Shopify (for orders)...

Databricks partner Tech Summit FY26 access

Using merge Schema with spark.read.csv for inconsi...

Problem with ray train and Databricks Notebook (St...

Addressing Memory Constraints in Scaling XGBoost a...