cancel
Showing results for 
Search instead for 
Did you mean: 
Get Started Discussions
Start your journey with Databricks by joining discussions on getting started guides, tutorials, and introductory topics. Connect with beginners and experts alike to kickstart your Databricks experience.
cancel
Showing results for 
Search instead for 
Did you mean: 

Forum Posts

fiverrpromotion
by New Contributor
  • 1376 Views
  • 1 replies
  • 1 kudos

Resolved! Addressing Memory Constraints in Scaling XGBoost and LGBM: A Comprehensive Approach for High-Volume

Scaling XGBoost and LightGBM models to handle exceptionally large datasets—those comprising billions to tens of billions of rows—presents a formidable computational challenge, particularly when constrained by the limitations of in-memory processing o...

  • 1376 Views
  • 1 replies
  • 1 kudos
Latest Reply
jamesl
Databricks Employee
  • 1 kudos

Hi @fiverrpromotion, As you mention, scaling XGBoost and LightGBM for massive datasets has its challenges, especially when trying to preserve critical training capabilities such as early stopping and handling of sparse features / high-cardinality cat...

  • 1 kudos
JavierS
by New Contributor
  • 1318 Views
  • 1 replies
  • 0 kudos

Resolved! Problem with ray train and Databricks Notebook (Strange dbutils error)

Hi everyone,I'm running some code to train a multimodal Hugging Face model with SFTTrainer and TorchTrainer to use all GPU workers. When trying to execute trainer.fit() it gives me a dbutils serialization error,even I am not using dbutils directly in...

Get Started Discussions
AIR
Databricks
DeepLearning
Distributed
ray
  • 1318 Views
  • 1 replies
  • 0 kudos
Latest Reply
sarahbhord
Databricks Employee
  • 0 kudos

JavierS -  The dbutils serialization error occurs in your code because dbutils is only available on the Databricks driver node and cannot be pickled or transferred to Spark or Ray worker nodes. This error can appear even if your code doesn't directly...

  • 0 kudos
JaydeepKhatri
by New Contributor II
  • 760 Views
  • 2 replies
  • 1 kudos

Resolved! Using merge Schema with spark.read.csv for inconsistent schemas

The Problem:A common data engineering challenge is reading a directory of CSV files where the schemas are inconsistent. For example, some files might have columns in a different order, or be missing certain columns altogether. The standard behavior o...

  • 760 Views
  • 2 replies
  • 1 kudos
Latest Reply
Louis_Frolio
Databricks Employee
  • 1 kudos

  Hey @JaydeepKhatri  here are some helpful points to consider: Is this an officially supported, enhanced feature of the Databricks CSV reader? Based on internal research, this appears to be an undocumented “feature” of Spark running on Databricks. ...

  • 1 kudos
1 More Replies
MadelynM
by Databricks Employee
  • 4240 Views
  • 10 replies
  • 0 kudos

Who's hiring? Latest Job Postings from the Databricks Community!

More than 10,000 organizations worldwide — including Block, Comcast, Condé Nast, Rivian, Shell and over 60% of the Fortune 500 — rely on the Databricks Data Intelligence Platform to take control of their data and put it to work with AI. Use this thre...

  • 4240 Views
  • 10 replies
  • 0 kudos
Latest Reply
Ashwini23
New Contributor II
  • 0 kudos

Job Title - Engineering Data Analyst LeadCompany name- Wartsila Location - Mumbai Required years of experience- 8 years Link to apply - https://careers.wartsila.com/job-invite/46999/Job description;ResponsibilitiesTransform raw data into meaningful i...

  • 0 kudos
9 More Replies
Phani1
by Valued Contributor II
  • 3422 Views
  • 5 replies
  • 0 kudos

Talend ETL code to Databricks

Hi Team,What is the best way to transfer Talend ETL code to Databricks and what are the best methods/practices for migrating Talend ETL's to Databricks (notebook, code conversion/migration strategy, workflow's etc)?Regards,Janga

  • 3422 Views
  • 5 replies
  • 0 kudos
Latest Reply
vamsi_simbus
New Contributor III
  • 0 kudos

Hi @Phani1 ,Were you able to complete the migration from Talend to Databricks? If yes, please do let me know the process / tools that you have used for the migration and the challenges that you have encountered. ThanksVamsi.

  • 0 kudos
4 More Replies
anipar
by New Contributor III
  • 1006 Views
  • 6 replies
  • 5 kudos

Resolved! Databricks metastore

I have configured a new databricks workspace through my personal azure account. I cannot see the metastore. Since this is my personal account, I am the admin. Can you please guide me on how to enable the metastore. Thank you.

  • 1006 Views
  • 6 replies
  • 5 kudos
Latest Reply
Khaja_Zaffer
Contributor III
  • 5 kudos

Hello @anipar Good day!!Can you please confirm again if you can see the metastore here?Log in to your Databricks workspace.Click your username in the top bar and select "Manage Account" to open the account console AZURE (https://accounts.azuredatabri...

  • 5 kudos
5 More Replies
Gaz
by New Contributor III
  • 291 Views
  • 1 replies
  • 1 kudos

Resolved! Dashboard embed: dashboard id is missing in token claim

We're trying to embed a dashboard following the example at https://learn.microsoft.com/en-us/azure/databricks/dashboards/embedding/external-embed#step-4-use-the-example-app-to-authenticate-and-generate-tokens The embed is displayed but with an error ...

  • 291 Views
  • 1 replies
  • 1 kudos
Latest Reply
Gaz
New Contributor III
  • 1 kudos

Nevermind. I accidentally removed the external_viewer_id and external_value parameters. After adding them back it works as expected.

  • 1 kudos
ericmedina
by New Contributor
  • 247 Views
  • 1 replies
  • 0 kudos

Resolved! Signed up for Customer Academy instead of Partner

Hello. Yesterday, I signed up for an account under my company email as a customer account. However, my intent was to sign up for a partner account and now it won't let me switch back. I sent a ticket to the academy team but have not received a reply ...

  • 247 Views
  • 1 replies
  • 0 kudos
Latest Reply
Advika
Databricks Employee
  • 0 kudos

Hello @ericmedina! For issues with Academy access, the correct step is to raise a ticket with the Databricks support team. Since you’ve already submitted one, please allow some time for them to respond.

  • 0 kudos
chinmay0924
by New Contributor III
  • 2947 Views
  • 6 replies
  • 1 kudos

Resolved! Spark connect client and server versions should be same for executing UDFs

I am trying to execute pandas UDF in databricks. It gives me the following error on serverless compute,File /local_disk0/.ephemeral_nfs/envs/pythonEnv-b11ff17c-9b25-4ccb-927d-06a7d1ca7221/lib/python3.11/site-packages/pyspark/sql/connect/client/core.p...

  • 2947 Views
  • 6 replies
  • 1 kudos
Latest Reply
nija
Databricks Employee
  • 1 kudos

@chinmay0924 - You can change the serverless client image by selecting the environment panel in a Databricks Notebook (on the right pane) or in the "Environment and Libraries" section while configuring a Databricks Job Task. The set of available serv...

  • 1 kudos
5 More Replies
rajkishore028
by New Contributor II
  • 523 Views
  • 1 replies
  • 0 kudos

Exam Suspended After Restarting the Exam — Ticket ID: 00744386

  Hi Team,I am writing to raise a concern regarding my Databricks Certified Data Engineer Professional exam (Ticket ID: 00744386).During my exam session, the support representative initially requested me to show the entire room, including my bed, whi...

  • 523 Views
  • 1 replies
  • 0 kudos
Latest Reply
cert-ops
Databricks Employee
  • 0 kudos

Hello @rajkishore028,Thank you for filing a ticket with our support team, Support team will respond shortly. Please note that we cannot provide support or handle exam suspensions via community. Thanks & Regards,@cert-ops

  • 0 kudos
Nexusss7
by New Contributor II
  • 332 Views
  • 1 replies
  • 2 kudos

Resolved! How to update comments and constraints on Streaming Tables created by DLT outside the pipeline?

I have a Delta Live Tables (DLT) pipeline that creates a streaming table (e.g., dbrtdflite_catalog.demotest_silver.customer_profile).I want to update table-level and column-level comments as well as constraints on this table.When I try commands like:...

  • 332 Views
  • 1 replies
  • 2 kudos
Latest Reply
Saritha_S
Databricks Employee
  • 2 kudos

Hi @Nexusss7  You can add a comment in the delta live tables, either the MV or the streaming table, in the tag -  @Dlt.table() @Dlt.table(    comment = "Delta live tables comment" )Here is the syntax for SQL: https://docs.databricks.com/aws/en/d...

  • 2 kudos
fjrodriguez
by New Contributor III
  • 768 Views
  • 1 replies
  • 1 kudos

Blackduck scanning on Databricks Workflow

Does anyone know if its compatible scan in blackduck your json based files from Workflows?  At least, when its come to the notebook its compatible as blackduck detects python based files but i am wondering why can workflow be scanned as well.

  • 768 Views
  • 1 replies
  • 1 kudos
Latest Reply
koji_kawamura
Databricks Employee
  • 1 kudos

Hi @fjrodriguez , Have you found your answer already? I hope I understand your question correctly. I think Databricks Asset Bundles (or Terraform) can be a useful tool to be added here, if you haven't looked at it already. Yes, you can and should s...

  • 1 kudos
Phani1
by Valued Contributor II
  • 7105 Views
  • 5 replies
  • 1 kudos

Databricks Job Failure + Service now Integration

Hi Team,Could you please suggest how to raise the service now ticket, in case of Databricks job failure?Regards ,Phanindra

  • 7105 Views
  • 5 replies
  • 1 kudos
Latest Reply
PRV
New Contributor II
  • 1 kudos

JSON response from Databricks to ServiceNow, doesn't give enough details. Is there a way we can send customized json with additional details. 

  • 1 kudos
4 More Replies
rseshadri
by New Contributor III
  • 3711 Views
  • 8 replies
  • 0 kudos

Unable to install the Databricks in AWS

I have followed the instructions given in the following link https://docs.databricks.com/en/admin/workspace/quick-start.html#use-aws-quickstart-to-create-a-databricks-workspace%22There was an error in CFT while deploying the workspace. Please help[ER...

  • 3711 Views
  • 8 replies
  • 0 kudos
Latest Reply
S_Reo1
New Contributor II
  • 0 kudos

Hi @rseshadri Please check the following two points:STS Endpoint Activation: Confirm whether the STS endpoint for the us-west-2 region is active. You can verify this from the IAM account settings.Resource Limits: Ensure that the limits for resources ...

  • 0 kudos
7 More Replies

Join Us as a Local Community Builder!

Passionate about hosting events and connecting people? Help us grow a vibrant local community—sign up today to get started!

Sign Up Now
Labels