Get Started Discussions

by phguk • New Contributor III

04-16-2025 3:20:57 PM

1643 Views
2 replies
0 kudos

Python coding in notebook with a (long) token

I have written a python program (called by a trigger) that uses a token issued by a third party app (it's circa 400 bytes long including '.' and '-'). When I copy/paste this token into a Databricks notebook - curious formatting takes place and a coup...

Get Started Discussions

Reply

1643 Views
2 replies
0 kudos

04-16-2025 3:20:57 PM

View Replies

Latest Reply

ashraf1395
Honored Contributor

04-16-2025 11:05:22 PM

0 kudos

Hey Paul, You can use databricks secrets for preserving the integrity of the token.Here's the databricks doc for refernece : https://docs.databricks.com/aws/en/security/secrets

0 kudos

04-16-2025 11:05:22 PM

1 More Replies

by dplaut • New Contributor II

12-04-2023 12:57:09 PM

4454 Views
3 replies
0 kudos

Save output of show table extended to table?

I want to save the output of show table extended in catalogName like 'mysearchtext*';to a table.How do I do that?

Get Started Discussions

Reply

4454 Views
3 replies
0 kudos

12-04-2023 12:57:09 PM

View Replies

Latest Reply

njoyb
New Contributor II

04-18-2025 11:51:04 PM

0 kudos

Use DESCRIBE EXTENDED customer AS JSON this returns as a json data . This you can load Applicable to databricks 16.2 and abovehttps://docs.databricks.com/aws/en/sql/language-manual/sql-ref-syntax-aux-describe-table

0 kudos

04-18-2025 11:51:04 PM

2 More Replies

by BhavyaSreeBanga • New Contributor

04-08-2025 2:34:50 AM

5335 Views
2 replies
1 kudos

Missing Genie - Upload File Feature in Preview Section

Despite having admin privileges for both the workspace and Genie Workspace, we are unable to see the "Genie - Upload File" feature under the Preview section, even though the documentation indicates it should be available.We also attempted switching r...

Get Started Discussions

Reply

5335 Views
2 replies
1 kudos

04-08-2025 2:34:50 AM

View Replies

Latest Reply

sridharplv
Valued Contributor II

04-18-2025 12:34:13 PM

1 kudos

For more information around upload a file option please refer https://docs.databricks.com/aws/en/genie/file-uploadit supports csv and excel datasets as of now with condition that files must be smaller than 200 MB and contain fewer than 100 columns du...

1 kudos

04-18-2025 12:34:13 PM

1 More Replies

by abin-bcgov • New Contributor III

04-16-2025 6:20:31 PM

1976 Views
4 replies
4 kudos

Resolved! using Azure Databricks vs using Databricks directly

Hi friends,A quick question regarding how data, workspace controls works while using "Azure Databricks". I am planning to use Azure Databricks that comes as part of my employer's Azure Subscriptions. I work for a Public sector organization, which is ...

Get Started Discussions

Reply

1976 Views
4 replies
4 kudos

04-16-2025 6:20:31 PM

View Replies

Latest Reply

abin-bcgov
New Contributor III

04-17-2025 12:30:48 PM

4 kudos

Thanks a ton, @SP_6721

4 kudos

04-17-2025 12:30:48 PM

3 More Replies

by Kruthika • New Contributor

04-17-2025 2:47:14 AM

4769 Views
0 replies
0 kudos

Support for managed identity based authentication in python kafka client

We followed this document https://docs.databricks.com/aws/en/connect/streaming/kafka?language=Python#msk-aad to use Kafka client to read events from our event hub for a feature.As part of the SFI, the guidance is to move away from client secret and u...

Get Started Discussions

Reply

4769 Views
0 replies
0 kudos

04-17-2025 2:47:14 AM

by MLEngineer • New Contributor

09-17-2024 11:17:36 AM

563 Views
1 replies
0 kudos

Right course for ML engineer

Hi I would like to learn databricks so that I could look for job opportunities as a ML engineer. I have background with python programming, computer vision (OpenCV) .not having much of experience with azure , aws so on.which course here is good with ...

Get Started Discussions

Reply

563 Views
1 replies
0 kudos

09-17-2024 11:17:36 AM

View Replies

Latest Reply

pedrotramos97
New Contributor II

04-16-2025 1:42:08 PM

0 kudos

Given your background in Python programming and computer vision but limited experience with cloud platforms, the best pathway to enter the job market as MLE using Databricks is to pursue the Databricks Certified Machine Learning Associate certificati...

0 kudos

04-16-2025 1:42:08 PM

by VaderK • New Contributor

04-08-2025 12:36:21 AM

3363 Views
1 replies
1 kudos

Resolved! Why does .collect() cause a shuffle while .show() does not?

I’m learning Spark using the book Spark: The Definitive Guide and came across some behavior I’m trying to understand.I am reading a csv_file which has 3 columns: DEST_COUNTRY_NAME, ORIGIN_COUNTRY_NAME, count. The dataset has a total of 256 rows.Here’...

Get Started Discussions

collect

pyspark

shuffle

Reply

3363 Views
1 replies
1 kudos

04-08-2025 12:36:21 AM

View Replies

Latest Reply

-werners-
Esteemed Contributor III

04-16-2025 2:16:32 AM

1 kudos

Q1: collect() moves all data to the driver, hence a shufle. show() just shows x records from the df, from a partition (or more partitions if x > partition size). No shuffling needed.For display purposes the results are of course gathered on the driv...

1 kudos

04-16-2025 2:16:32 AM

by aniket07 • New Contributor II

04-14-2025 9:29:36 PM

1827 Views
2 replies
2 kudos

Lazy evaluation in serverless vs all purpose compute ?

As you can see right now I am connected to serverless compute and when I give wrong path, spark does lazy evaluation and gives error on display. However, when I switch from serverless to my all purpose cluster I get the error when I create the df its...

Get Started Discussions

Reply

1827 Views
2 replies
2 kudos

04-14-2025 9:29:36 PM

View Replies

Latest Reply

sridharplv
Valued Contributor II

04-15-2025 11:01:10 AM

2 kudos

Based on the scenario, what https://community.databricks.com/t5/user/viewprofilepage/user-id/156441 saying is correct though the eager evaluation property is false in both cases and for All-Purpose clusters, Spark is checking the path immediately whe...

2 kudos

04-15-2025 11:01:10 AM

1 More Replies

by tommyhmt • New Contributor II

04-12-2025 8:06:02 AM

756 Views
1 replies
0 kudos

Unable to access external table created by DLT

I originally set the Storage location in my DLT as abfss://{container}@{storageaccount}.dfs.core.windows.net/...But when running the DLT I got the following error:So I decided to leave the above Storage location blank and define the path parameter in...

Get Started Discussions

Reply

756 Views
1 replies
0 kudos

04-12-2025 8:06:02 AM

View Replies

Latest Reply

brockb
Databricks Employee

04-15-2025 10:03:57 AM

0 kudos

Hi @Tommy , Thanks for your question. I would encourage you to verify once using a Pro SQL Warehouse temporarily instead of a Serverless SQL Warehouse given the compute differences between the two - Pro compute resides in your data plane, Serverless ...

0 kudos

04-15-2025 10:03:57 AM

by kro • New Contributor II

10-21-2024 1:34:13 AM

2202 Views
2 replies
2 kudos

OCRmyPDF in Databricks

Hello,Do any of you have experience with using OCRmyPDF in Databricks? I have tried to install it in various was with different versions, but my notebook keep crashing with the error:The Python process exited with exit code 139 (SIGSEGV: Segmentation...

Get Started Discussions

ocr

ocrmypdf

pdf

segmentation fault

tesseract

Reply

2202 Views
2 replies
2 kudos

10-21-2024 1:34:13 AM

View Replies

Latest Reply

sridharplv
Valued Contributor II

04-15-2025 9:44:51 AM

2 kudos

Refer to this link too https://community.databricks.com/t5/data-engineering/pdf-parsing-in-notebook/td-p/14636

2 kudos

04-15-2025 9:44:51 AM

1 More Replies

by EllaClark • New Contributor II

04-05-2025 2:01:47 AM

3279 Views
2 replies
0 kudos

Can I automate notebook tagging based on workspace folder structure?

Hi all,I’m currently organizing a growing number of notebooks in our Databricks workspace and trying to keep things manageable with proper tagging and metadata. One idea I had was to automatically apply tags to notebooks based on their folder structu...

Get Started Discussions

Reply

3279 Views
2 replies
0 kudos

04-05-2025 2:01:47 AM

View Replies

Latest Reply

Renu_
Valued Contributor II

04-15-2025 6:58:18 AM

0 kudos

Hi @EllaClark, Yes, you can automate tagging of Databricks notebooks based on folder structure using the REST API and a script. Use the Workspace API to list notebook paths, extract folder names, and treat them as tags.If the API supports metadata up...

0 kudos

04-15-2025 6:58:18 AM

1 More Replies

by Kabi • New Contributor III

04-14-2025 2:29:53 AM

870 Views
1 replies
1 kudos

Resolved! Simple notebook sync

Hi, is there a simple way to sync a local notebook with a Databricks notebook? For example, is it possible to just connect to the Databricks kernel or something similar?I know there are IDE extensions for this, but unfortunately, they use the local d...

Get Started Discussions

Reply

870 Views
1 replies
1 kudos

04-14-2025 2:29:53 AM

View Replies

Latest Reply

Renu_
Valued Contributor II

04-14-2025 8:12:18 AM

1 kudos

Hi @Kabi, as of my knowledge databricks doesn’t support directly connecting to Databricks kernel. However, here are practical ways to sync your local notebook with Databricks:You can use Git to version control your notebooks. Clone your repo into Dat...

1 kudos

04-14-2025 8:12:18 AM

by Mani2105 • New Contributor II

04-11-2025 11:22:13 AM

537 Views
1 replies
1 kudos

Databricks Dashboard ,passing Prompt Values from one page to another

HI Guys,I have a dashboard with main page where I have a base query and added a date time range widget and linked it to filter the base query , Now I have a Page 2 where i use a a different sumamrized query as a source , base query2 . I need this qu...

Get Started Discussions

Reply

537 Views
1 replies
1 kudos

04-11-2025 11:22:13 AM

View Replies

Latest Reply

Renu_
Valued Contributor II

04-14-2025 6:30:30 AM

1 kudos

Hi @Mani2105, I guess currently, Databricks dashboards don’t support sharing widget parameters like date range filters across pages. Each page is isolated, so filters must be recreated manually per page. Manual configuration remains the only way to m...

1 kudos

04-14-2025 6:30:30 AM

by Rjdudley • Honored Contributor

04-10-2025 1:02:36 PM

1124 Views
2 replies
1 kudos

Asinine bad word detection

Are you kidding me here--I couldn't post this reply because (see arrows because I can't say the words)? I've run afoul of this several times before, bad word detection was a solved problem in the 1990s and there is even a term for errors like this--...

Get Started Discussions

Reply

1124 Views
2 replies
1 kudos

04-10-2025 1:02:36 PM

View Replies

Latest Reply

Advika
Databricks Employee

04-11-2025 3:01:27 AM

1 kudos

Hello @Rjdudley! Thank you for bringing this to our attention. We understand how frustrating it can be to have your message incorrectly flagged, especially when you're contributing meaningfully. While our filters are in place to maintain a safe space...

1 kudos

04-11-2025 3:01:27 AM

1 More Replies

by tw1 • New Contributor III

04-10-2025 12:24:57 AM

1020 Views
5 replies
1 kudos

AI/BI Dashboard - Hide Column in Table Visualization, but not in exported data

How can I hide specific colum from a table visualization, but not in the exported data.I have over 200 columns in my query result and the ui freeze while I want to show it in a table visualization. So I want to hide specific columns, but if I export ...

Get Started Discussions

Reply

1020 Views
5 replies
1 kudos

04-10-2025 12:24:57 AM

View Replies

Latest Reply

tw1
New Contributor III

04-10-2025 5:40:03 AM

1 kudos

.

1 kudos

04-10-2025 5:40:03 AM

4 More Replies

Databricks Community

Forum Posts

Python coding in notebook with a (long) token

Save output of show table extended to table?

Missing Genie - Upload File Feature in Preview Section

Resolved! using Azure Databricks vs using Databricks directly

Support for managed identity based authentication in python kafka client

Right course for ML engineer

Resolved! Why does .collect() cause a shuffle while .show() does not?

Lazy evaluation in serverless vs all purpose compute ?

Unable to access external table created by DLT

OCRmyPDF in Databricks

Can I automate notebook tagging based on workspace folder structure?

Resolved! Simple notebook sync

Databricks Dashboard ,passing Prompt Values from one page to another

Asinine bad word detection

AI/BI Dashboard - Hide Column in Table Visualization, but not in exported data

Join Us as a Local Community Builder!

Databricks partner Tech Summit FY26 access

Using merge Schema with spark.read.csv for inconsi...

Problem with ray train and Databricks Notebook (St...

Addressing Memory Constraints in Scaling XGBoost a...

Need help understanding Databricks