cancel
Showing results for 
Search instead for 
Did you mean: 
Get Started Discussions
Start your journey with Databricks by joining discussions on getting started guides, tutorials, and introductory topics. Connect with beginners and experts alike to kickstart your Databricks experience.
cancel
Showing results for 
Search instead for 
Did you mean: 

Forum Posts

chari
by Contributor
  • 1840 Views
  • 0 replies
  • 0 kudos

how to run a group of cells in databricks ?

Hello,I was experimenting with a ML model with different parameters and check the results. However, the important part of this code is contained in a couple of cells (say cell # 12, 13 & 14). I like to proceed to the next cell only when the results a...

  • 1840 Views
  • 0 replies
  • 0 kudos
AravindNani
by New Contributor
  • 3039 Views
  • 1 replies
  • 0 kudos

Unable to read data from API due to Private IP Restriction

I have data in my API Endpoint but am unable to read it using Databricks. My data is limited to my private IP address and can only be accessed over a VPN connection. I can't read data into Databricks as a result of this. I can obtain the data in VS C...

  • 3039 Views
  • 1 replies
  • 0 kudos
Latest Reply
Wojciech_BUK
Valued Contributor III
  • 0 kudos

Hi AravindNaniThis is more of infrastructure questions, you have to make sure that:1) Your databricks Workspace is provisioned in VNET Injection mode2) Your VNET is either peered to "HUB" network where you have S2S VPN Connection to API or you have t...

  • 0 kudos
hpicatto
by New Contributor III
  • 3126 Views
  • 1 replies
  • 0 kudos

using the api for getting cost in usd

I'm trying to use the API of billable usage and I do get a report but I have not been able to get the usd cost report, only the dbuHours. I guess I've to change the meter_name but I cannot find the key for that parameter anywhere

  • 3126 Views
  • 1 replies
  • 0 kudos
arkiboys
by Contributor
  • 4544 Views
  • 1 replies
  • 0 kudos

databricks email notification

In databricks, if a job fails, then an email is sent off as notification.The recipient, receives the email with the link to the databricks workspace.Question:How is it possible the email is sent without any link, just the plain text in the email is w...

  • 4544 Views
  • 1 replies
  • 0 kudos
RozaZaharieva
by New Contributor
  • 1841 Views
  • 0 replies
  • 0 kudos

set up Azure Databricks workspace and Unity catalog - how to automate not using Terraform

Hi everyone, I am looking for a way to automate initial setup of Azure Databricks workspace and Unity Catalog but can't find anything on this topic other than Terraform. Can you share if this is possible with powershell, for example. Thank you un adv...

  • 1841 Views
  • 0 replies
  • 0 kudos
mohaimen_syed
by New Contributor III
  • 9241 Views
  • 3 replies
  • 1 kudos

Fuzzy Match on PySpark using UDF/Pandas UDF

I'm trying to do fuzzy matching on two dataframes by cross joining them and then using a udf for my fuzzy matching. But using both python udf and pandas udf its either very slow or I get an error. @pandas_udf("int")def core_match_processor(s1: pd.Ser...

  • 9241 Views
  • 3 replies
  • 1 kudos
Latest Reply
mohaimen_syed
New Contributor III
  • 1 kudos

I'm now getting the error: (SQL_GROUPED_AGG_PANDAS_UDF) is not supported on clusters in Shared access mode.Even though this article clearly states that pandas udf is supported for shared cluster in databrickshttps://www.databricks.com/blog/shared-clu...

  • 1 kudos
2 More Replies
ntvdatabricks
by New Contributor II
  • 5898 Views
  • 2 replies
  • 1 kudos

Resolved! Okta and Unified login

Hey Folks anyone put Databricks behind Okta and enabled Unified Login with workspaces that have a Unity Catalog metastore applied and some that don't?There are some workspaces we can't move over yet and it isn't clear in documentation if Unity Catalo...

  • 5898 Views
  • 2 replies
  • 1 kudos
Latest Reply
Walter_C
Databricks Employee
  • 1 kudos

Yes, users should be able to use a single Okta application for all workspaces, regardless of whether the Unity Catalog metastore has been applied or not. The Unity Catalog is a feature that allows you to manage and secure access to your data across a...

  • 1 kudos
1 More Replies
Shravanshibu
by New Contributor III
  • 1154 Views
  • 0 replies
  • 0 kudos

Public preview API not working - artifact-allowlists

 I am trying to hit /api/2.1/unity-catalog/artifact-allowlists/as a part of INIT migration script. Its is in public preview, do we need to enable anything else to use a API which is in Public preview. I am getting 404 error. But using same token for ...

  • 1154 Views
  • 0 replies
  • 0 kudos
SaiNeelakantam
by New Contributor
  • 2507 Views
  • 1 replies
  • 0 kudos

How to enable "Create Vector Search Index" button in DB workspace?

How to enable "Create Vector Search Index" button in DB workspace?Following is the screenshot from the Microsoft Ignite 2023 Databricks presentation:

  • 2507 Views
  • 1 replies
  • 0 kudos
Latest Reply
PL_db
Databricks Employee
  • 0 kudos

The feature is in public preview only in some regions, you can check the available regions in the documentation here.  In addition there are certain requirements, such as a UC enabled workspace and Serverless Compute enabled, you can check all requir...

  • 0 kudos
SamGreene
by Contributor II
  • 3828 Views
  • 5 replies
  • 0 kudos

CONVERT_TIMEZONE issue in DLT

I can run a query that uses the CONVERT_TIMEZONE function in a SQL notebook.  When I move the code to my DLT notebook the pipeline produces this error:Cannot resolve function `CONVERT_TIMEZONE`Here is the line:  CONVERT_TIMEZONE('UTC', 'America/Phoen...

  • 3828 Views
  • 5 replies
  • 0 kudos
Latest Reply
annn
New Contributor II
  • 0 kudos

Yes, the notebook is set to SQL and the convert_timezone function is within a select statement.

  • 0 kudos
4 More Replies
Ak_0926
by New Contributor
  • 5731 Views
  • 2 replies
  • 1 kudos

Can we get the actual query execution plan programmatically after a query is executed? Apart from UI

Let's say i have run a query and it showed me results. we can find the respective query execution plan on the UI. Is there any way we can get that execution plan through programmatically or through API?

  • 5731 Views
  • 2 replies
  • 1 kudos
Latest Reply
Walter_C
Databricks Employee
  • 1 kudos

You can obtain the query execution plan programmatically using the EXPLAIN statement in SQL. The EXPLAIN statement displays the execution plan that the database planner generates for the supplied statement. The execution plan shows how the table(s) r...

  • 1 kudos
1 More Replies
Danny_Lee
by Valued Contributor
  • 2742 Views
  • 2 replies
  • 4 kudos

Top Kudoed Author 🌟🤩🧑‍🎤

I recently saw a link to the Kudos Leaderboard for the Community Discussions.  It has always been my hope and fantasy , ever since I was a little child that I would someday be the #1 Kudoed Author on Community Discusions on community.Databricks.com....

KudosOprahGIF.gif
  • 2742 Views
  • 2 replies
  • 4 kudos
Latest Reply
Danny_Lee
Valued Contributor
  • 4 kudos

Thanks @DB_Paul - I'm on my way!   

  • 4 kudos
1 More Replies
Khalil
by Contributor
  • 7719 Views
  • 5 replies
  • 7 kudos

Incremental ingestion of Snowflake data with Delta Live Table (CDC)

Hello,I have some data which are lying into Snowflake, so I want to apply CDC on them using delta live table but I am having some issues.Here is what I am trying to do:  @dlt.view() def table1(): return spark.read.format("snowflake").options(**opt...

  • 7719 Views
  • 5 replies
  • 7 kudos
Latest Reply
-werners-
Esteemed Contributor III
  • 7 kudos

The CDC for delta live works fine for delta tables, as you have noticed.  However it is not a full blown CDC implementation/software.If you want to capture changes in Snowflake, you will have to implement some CDC method on Snowflake itself, and read...

  • 7 kudos
4 More Replies
Anku_
by New Contributor II
  • 2066 Views
  • 2 replies
  • 0 kudos

New to PySpark

Hi all,I am trying to get the domain from an email field using below expression; but getting an error.Kindly help. df.select(df.email, substring(df.email,instr(df.email,'@'),length(df.email).alias('domain')))

  • 2066 Views
  • 2 replies
  • 0 kudos
Latest Reply
Walter_C
Databricks Employee
  • 0 kudos

In your case, you want to extract the domain from the email, which starts from the position just after '@'. So, you should add 1 to the position of '@'. Also, the length of the substring should be the difference between the total length of the email ...

  • 0 kudos
1 More Replies
kickbuttowski
by New Contributor II
  • 1432 Views
  • 1 replies
  • 0 kudos

Issue in inferring schema for streaming dataframe using json files

Below is the pileine design in databricks and it's not working out , kindly look on this and let me know whether it will work or not , I'm getting json files of different schemas from directory under the root directory and it read all the files using...

  • 1432 Views
  • 1 replies
  • 0 kudos
Latest Reply
AmanSehgal
Honored Contributor III
  • 0 kudos

Could you please share some sample of your dataset and code snippet of what you're trying to implement?

  • 0 kudos

Join Us as a Local Community Builder!

Passionate about hosting events and connecting people? Help us grow a vibrant local community—sign up today to get started!

Sign Up Now
Labels