cancel
Showing results for 
Search instead for 
Did you mean: 
Get Started Discussions
Start your journey with Databricks by joining discussions on getting started guides, tutorials, and introductory topics. Connect with beginners and experts alike to kickstart your Databricks experience.
cancel
Showing results for 
Search instead for 
Did you mean: 

Forum Posts

Dulce42
by New Contributor II
  • 839 Views
  • 1 replies
  • 0 kudos

Trusted assets vs query examples

¡Hi community! In recent days I explored trusted assets in my genie space and this working very well! but I feel a little confused :sIn my genie space I have many queries examples when I create a new function with the same query example for verify th...

  • 839 Views
  • 1 replies
  • 0 kudos
Latest Reply
Advika
Databricks Employee
  • 0 kudos

Hello @Dulce42! It depends on your use case. If your function covers the scenario well, you don’t need a separate query example. Having both for the same purpose can create redundancy and make things more complex. Choose the option that best fits you...

  • 0 kudos
HaripriyaP
by New Contributor II
  • 1273 Views
  • 2 replies
  • 0 kudos

Resolved! Need help to add personal email to databricks partner account

I have been actively using the Databricks Partner Academy for the past three years through my current organization. As I am planning to transition to a new company, I would like to ensure continued access to my training records and certifications.Cur...

  • 1273 Views
  • 2 replies
  • 0 kudos
Latest Reply
HaripriyaP
New Contributor II
  • 0 kudos

Sure. Thank you!

  • 0 kudos
1 More Replies
Terje
by New Contributor
  • 1140 Views
  • 1 replies
  • 0 kudos

Python versions - Notebooks and DBR

Hi,I have a problem with conflicting python versions in a notebook running with the Databricks 14 day free trial. One example:spark.conf.get("spark.databricks.clusterUsageTags.clusterName") # Returns: "Python versions in the Spark Connect client and...

  • 1140 Views
  • 1 replies
  • 0 kudos
Latest Reply
Renu_
Valued Contributor II
  • 0 kudos

Hi @Terje, were you able to fix it? From what I know, during the free trial period we’re limited to the default setup, so version mismatches can’t be resolved unless we upgrade to a paid workspace.

  • 0 kudos
phguk
by New Contributor III
  • 1734 Views
  • 2 replies
  • 0 kudos

Python coding in notebook with a (long) token

I have written a python program (called by a trigger) that uses a token issued by a third party app (it's circa 400 bytes long including '.' and '-'). When I copy/paste this token into a Databricks notebook - curious formatting takes place and a coup...

  • 1734 Views
  • 2 replies
  • 0 kudos
Latest Reply
ashraf1395
Honored Contributor
  • 0 kudos

Hey Paul, You can use databricks secrets for preserving the integrity of the token.Here's the databricks doc for refernece : https://docs.databricks.com/aws/en/security/secrets

  • 0 kudos
1 More Replies
dplaut
by New Contributor II
  • 4550 Views
  • 3 replies
  • 0 kudos

Save output of show table extended to table?

I want to save the output of     show table extended in catalogName like 'mysearchtext*';to a table.How do I do that?

  • 4550 Views
  • 3 replies
  • 0 kudos
Latest Reply
njoyb
New Contributor II
  • 0 kudos

Use  DESCRIBE EXTENDED customer AS JSON this returns as a json data  . This you can load Applicable to databricks 16.2 and abovehttps://docs.databricks.com/aws/en/sql/language-manual/sql-ref-syntax-aux-describe-table

  • 0 kudos
2 More Replies
BhavyaSreeBanga
by New Contributor
  • 5492 Views
  • 2 replies
  • 1 kudos

Missing Genie - Upload File Feature in Preview Section

Despite having admin privileges for both the workspace and Genie Workspace, we are unable to see the "Genie - Upload File" feature under the Preview section, even though the documentation indicates it should be available.We also attempted switching r...

  • 5492 Views
  • 2 replies
  • 1 kudos
Latest Reply
sridharplv
Valued Contributor II
  • 1 kudos

For more information around upload a file option please refer https://docs.databricks.com/aws/en/genie/file-uploadit supports csv and excel datasets as of now with condition that files must be smaller than 200 MB and contain fewer than 100 columns du...

  • 1 kudos
1 More Replies
abin-bcgov
by New Contributor III
  • 2169 Views
  • 4 replies
  • 4 kudos

Resolved! using Azure Databricks vs using Databricks directly

Hi friends,A quick question regarding how data, workspace controls works while using "Azure Databricks". I am planning to use Azure Databricks that comes as part of my employer's Azure Subscriptions. I work for a Public sector organization, which is ...

  • 2169 Views
  • 4 replies
  • 4 kudos
Latest Reply
abin-bcgov
New Contributor III
  • 4 kudos

Thanks a ton, @SP_6721 

  • 4 kudos
3 More Replies
MLEngineer
by New Contributor
  • 597 Views
  • 1 replies
  • 0 kudos

Right course for ML engineer

Hi I would like to learn databricks so that I could look for job opportunities as a ML engineer. I have background with python programming, computer vision (OpenCV) .not having much of experience with azure , aws so on.which course here is good with ...

  • 597 Views
  • 1 replies
  • 0 kudos
Latest Reply
pedrotramos97
Databricks Employee
  • 0 kudos

Given your background in Python programming and computer vision but limited experience with cloud platforms, the best pathway to enter the job market as MLE using Databricks is to pursue the Databricks Certified Machine Learning Associate certificati...

  • 0 kudos
VaderK
by New Contributor
  • 3519 Views
  • 1 replies
  • 1 kudos

Resolved! Why does .collect() cause a shuffle while .show() does not?

I’m learning Spark using the book Spark: The Definitive Guide and came across some behavior I’m trying to understand.I am reading a csv_file which has 3 columns: DEST_COUNTRY_NAME, ORIGIN_COUNTRY_NAME, count. The dataset has a total of 256 rows.Here’...

show.png collect.png
Get Started Discussions
collect
pyspark
shuffle
  • 3519 Views
  • 1 replies
  • 1 kudos
Latest Reply
-werners-
Esteemed Contributor III
  • 1 kudos

Q1: collect() moves all data to the driver, hence a shufle. show() just shows x records from the df, from a partition (or more partitions if x > partition size).  No shuffling needed.For display purposes the results are of course gathered on the driv...

  • 1 kudos
aniket07
by New Contributor II
  • 1931 Views
  • 2 replies
  • 2 kudos

Lazy evaluation in serverless vs all purpose compute ?

As you can see right now I am connected to serverless compute and when I give wrong path, spark does lazy evaluation and gives error on display. However, when I switch from serverless to my all purpose cluster I get the error when I create the df its...

aniket07_0-1744691152378.png aniket07_1-1744691251247.png aniket07_2-1744691310065.png
  • 1931 Views
  • 2 replies
  • 2 kudos
Latest Reply
sridharplv
Valued Contributor II
  • 2 kudos

Based on the scenario, what https://community.databricks.com/t5/user/viewprofilepage/user-id/156441 saying is correct though the eager evaluation property is false in both cases and for All-Purpose clusters, Spark is checking the path immediately whe...

  • 2 kudos
1 More Replies
tommyhmt
by New Contributor II
  • 866 Views
  • 1 replies
  • 0 kudos

Unable to access external table created by DLT

I originally set the Storage location in my DLT as abfss://{container}@{storageaccount}.dfs.core.windows.net/...But when running the DLT I got the following error:So I decided to leave the above Storage location blank and define the path parameter in...

image.png image.png image.png image.png
  • 866 Views
  • 1 replies
  • 0 kudos
Latest Reply
brockb
Databricks Employee
  • 0 kudos

Hi @Tommy , Thanks for your question. I would encourage you to verify once using a Pro SQL Warehouse temporarily instead of a Serverless SQL Warehouse given the compute differences between the two - Pro compute resides in your data plane, Serverless ...

  • 0 kudos
kro
by New Contributor II
  • 2549 Views
  • 2 replies
  • 2 kudos

OCRmyPDF in Databricks

Hello,Do any of you have experience with using OCRmyPDF in Databricks? I have tried to install it in various was with different versions, but my notebook keep crashing with the error:The Python process exited with exit code 139 (SIGSEGV: Segmentation...

Get Started Discussions
ocr
ocrmypdf
pdf
segmentation fault
tesseract
  • 2549 Views
  • 2 replies
  • 2 kudos
Latest Reply
sridharplv
Valued Contributor II
  • 2 kudos

Refer to this link too https://community.databricks.com/t5/data-engineering/pdf-parsing-in-notebook/td-p/14636

  • 2 kudos
1 More Replies
EllaClark
by New Contributor II
  • 3443 Views
  • 2 replies
  • 0 kudos

Can I automate notebook tagging based on workspace folder structure?

Hi all,I’m currently organizing a growing number of notebooks in our Databricks workspace and trying to keep things manageable with proper tagging and metadata. One idea I had was to automatically apply tags to notebooks based on their folder structu...

  • 3443 Views
  • 2 replies
  • 0 kudos
Latest Reply
Renu_
Valued Contributor II
  • 0 kudos

Hi @EllaClark, Yes, you can automate tagging of Databricks notebooks based on folder structure using the REST API and a script. Use the Workspace API to list notebook paths, extract folder names, and treat them as tags.If the API supports metadata up...

  • 0 kudos
1 More Replies
Kabi
by New Contributor III
  • 969 Views
  • 1 replies
  • 1 kudos

Resolved! Simple notebook sync

Hi, is there a simple way to sync a local notebook with a Databricks notebook? For example, is it possible to just connect to the Databricks kernel or something similar?I know there are IDE extensions for this, but unfortunately, they use the local d...

  • 969 Views
  • 1 replies
  • 1 kudos
Latest Reply
Renu_
Valued Contributor II
  • 1 kudos

Hi @Kabi, as of my knowledge databricks doesn’t support directly connecting to Databricks kernel. However, here are practical ways to sync your local notebook with Databricks:You can use Git to version control your notebooks. Clone your repo into Dat...

  • 1 kudos

Join Us as a Local Community Builder!

Passionate about hosting events and connecting people? Help us grow a vibrant local community—sign up today to get started!

Sign Up Now
Labels