cancel
Showing results for 
Search instead for 
Did you mean: 
Get Started Discussions
cancel
Showing results for 
Search instead for 
Did you mean: 

Forum Posts

Еmil
by New Contributor III
  • 667 Views
  • 3 replies
  • 1 kudos

Resolved! source set to GIT for Databricks Asset Bundle notebook_task - git authentication fails on run

My post was marked as Spam after trying to post the description of my issue so now I have posted the question on stackoverflow.

  • 667 Views
  • 3 replies
  • 1 kudos
Latest Reply
Kaniz
Community Manager
  • 1 kudos

Hi @Еmil, I've read through your question and believe I have a solution for you. Here's a response to your question: Since your job runs as a service principal, consider using OAuth M2M authentication for accessing your Azure DevOps Git repository.En...

  • 1 kudos
2 More Replies
Frustrated_DE
by New Contributor II
  • 153 Views
  • 1 replies
  • 0 kudos

DLT SQL demo pipeline issue

Hi,   First foray into DLT and following code exerts from the sample-DLT-notebook.I'm creating a notebook with the SQL below:CREATE STREAMING LIVE TABLE sales_orders_rawCOMMENT "The raw sales orders, ingested from /databricks-datasets."TBLPROPERTIES ...

  • 153 Views
  • 1 replies
  • 0 kudos
Latest Reply
Frustrated_DE
New Contributor II
  • 0 kudos

If you change the notebook default language as opposed to using magic command. I normally have it set to Python, I've wrongly assumed DLT would transpose as can't use magic command but have to change default in order for it to work. 

  • 0 kudos
hpicatto
by New Contributor III
  • 108 Views
  • 3 replies
  • 0 kudos

using the api for getting cost in usd

I'm trying to use the API of billable usage and I do get a report but I have not been able to get the usd cost report, only the dbuHours. I guess I've to change the meter_name but I cannot find the key for that parameter anywhere

  • 108 Views
  • 3 replies
  • 0 kudos
Latest Reply
Kaniz
Community Manager
  • 0 kudos

Hi @hpicatto, AWS Usage Reports: AWS provides detailed usage and cost reports through the AWS Cost and Usage Report. You can access this report via the AWS Management Console. Here are the steps: Log in to the AWS Management Console.Navigate to the B...

  • 0 kudos
2 More Replies
Floody
by New Contributor II
  • 252 Views
  • 1 replies
  • 0 kudos

New draft for every post I visit

When I visit my profile page, under the drafts section I see an entry for every post I visit in the discussions. Is this normal?

  • 252 Views
  • 1 replies
  • 0 kudos
Latest Reply
Kaniz
Community Manager
  • 0 kudos

Hi @Floody, Yes, it is normal to see an entry for every post you visit in the discussions under the drafts section of your profile page. This feature allows you to easily access and continue working on drafts of posts that you have started or viewed ...

  • 0 kudos
NoviKamayana
by New Contributor
  • 70 Views
  • 1 replies
  • 1 kudos

Database: Delta Lake or PostgreSQL

Hey all,I am searching for a non-political answer to my database questions. Please know that I am a data newbie and litteraly do not know anything about this topic, but I want to learn, so please be gentle.  Some context: I am working for an OEM that...

  • 70 Views
  • 1 replies
  • 1 kudos
Latest Reply
Kaniz
Community Manager
  • 1 kudos

Hi @NoviKamayana, Let’s dive into the world of data storage and explore the differences between Delta Lake and PostgreSQL. As a data newbie, you’re on the right track to learn more about these concepts. Data Lake vs. Delta Lake Benefits and Limit...

  • 1 kudos
mohaimen_syed
by New Contributor III
  • 1557 Views
  • 5 replies
  • 1 kudos

Fuzzy Match on PySpark using UDF/Pandas UDF

I'm trying to do fuzzy matching on two dataframes by cross joining them and then using a udf for my fuzzy matching. But using both python udf and pandas udf its either very slow or I get an error. @pandas_udf("int")def core_match_processor(s1: pd.Ser...

  • 1557 Views
  • 5 replies
  • 1 kudos
Latest Reply
Kaniz
Community Manager
  • 1 kudos

Hi @mohaimen_syed, Could you please help me with these details:-  - Cluster details, and - Check if Apache Apache Arrow optimization is enabled in your cluster.

  • 1 kudos
4 More Replies
kp12
by New Contributor II
  • 2140 Views
  • 3 replies
  • 0 kudos

column "id" is of type uuid but expression is of type character varying.

Hello,I'm trying to write to Azure PostgreSQL flexible  database from Azure Databricks, using PostgreSQL connector in Databricks Runtime in 12.2LTS.I'm using df.write.format("postgresql").save() to write to PostgreSQL database, but getting the follow...

  • 2140 Views
  • 3 replies
  • 0 kudos
Latest Reply
noname123
New Contributor II
  • 0 kudos

Had same problem. You need to add option "stringtype": "unspecified".Example:df.write.format("postgresql").mode("overwrite").option("truncate", "true").option("stringtype", "unspecified").option("dbtable", table).option("host", host).option("database...

  • 0 kudos
2 More Replies
ntvdatabricks
by New Contributor
  • 161 Views
  • 2 replies
  • 1 kudos

Resolved! Okta and Unified login

Hey Folks anyone put Databricks behind Okta and enabled Unified Login with workspaces that have a Unity Catalog metastore applied and some that don't?There are some workspaces we can't move over yet and it isn't clear in documentation if Unity Catalo...

  • 161 Views
  • 2 replies
  • 1 kudos
Latest Reply
Walter_C
Valued Contributor
  • 1 kudos

Yes, users should be able to use a single Okta application for all workspaces, regardless of whether the Unity Catalog metastore has been applied or not. The Unity Catalog is a feature that allows you to manage and secure access to your data across a...

  • 1 kudos
1 More Replies
Khalil
by Contributor
  • 1724 Views
  • 5 replies
  • 7 kudos

Incremental ingestion of Snowflake data with Delta Live Table (CDC)

Hello,I have some data which are lying into Snowflake, so I want to apply CDC on them using delta live table but I am having some issues.Here is what I am trying to do:  @dlt.view() def table1(): return spark.read.format("snowflake").options(**opt...

  • 1724 Views
  • 5 replies
  • 7 kudos
Latest Reply
-werners-
Esteemed Contributor III
  • 7 kudos

The CDC for delta live works fine for delta tables, as you have noticed.  However it is not a full blown CDC implementation/software.If you want to capture changes in Snowflake, you will have to implement some CDC method on Snowflake itself, and read...

  • 7 kudos
4 More Replies
thethirtyfour
by New Contributor III
  • 440 Views
  • 1 replies
  • 0 kudos

Resolved! Configure Databricks in VSCode through WSL

Hi,I am having a hard time configuring my Databricks workspace when working in VSCode via WSL. When following the steps to setup Databricks authentication I am receiving the following error on the Step 5 of "Step 4: Set up Databricks authentication"....

  • 440 Views
  • 1 replies
  • 0 kudos
Latest Reply
thethirtyfour
New Contributor III
  • 0 kudos

Scratch that, I found the alternative means of authenticating via this link: Authentication setup for the Databricks extension for Visual Studio Code

  • 0 kudos
Bhavishya
by New Contributor II
  • 710 Views
  • 3 replies
  • 0 kudos

Resolved! Databricks jdbc driver connectiion issue with apache solr

Hi,databricks jdbc version - 2.6.34I am facing the below issue with connecting databricks sql from apache solr Caused by: java.sql.SQLFeatureNotSupportedException: [Databricks][JDBC](10220) Driver does not support this optional feature.at com.databri...

  • 710 Views
  • 3 replies
  • 0 kudos
Latest Reply
Bhavishya
New Contributor II
  • 0 kudos

Databricks team recommended to set IgnoreTransactions=1 and autocommit=false in the connection string but that didn't resolve the issue .Ultimately I had to use solr update API for uploading documents

  • 0 kudos
2 More Replies
ChristianRRL
by Contributor
  • 155 Views
  • 1 replies
  • 0 kudos

Auto-Update API Data

Not sure if this has come up before, but I'm wondering if Databricks has any kind of functionality to "watch" an API call for changes?E.g. Currently I have a frequently running job that pulls data via an API call and overwrites the old data. This see...

  • 155 Views
  • 1 replies
  • 0 kudos
Latest Reply
Kaniz
Community Manager
  • 0 kudos

Hi @ChristianRRL, Databricks provides a REST API that allows you to interact with various aspects of your Databricks workspace programmatically. While there isn’t a direct built-in feature to “watch” an API call for changes, you can design a solut...

  • 0 kudos
Prashanthkumar
by New Contributor II
  • 1644 Views
  • 3 replies
  • 0 kudos

Is it possible to view Databricks cluster metrics using REST API

I am looking for some help on getting databricks cluster metrics such as memory utilization, CPU utilization, memory swap utilization, free file system using REST API.I am trying it in postman using databricks token and with my Service Principal bear...

Prashanthkumar_0-1705104529507.png
  • 1644 Views
  • 3 replies
  • 0 kudos
Latest Reply
Walter_C
Valued Contributor
  • 0 kudos

There is currently no option available to get this metrics available through API, but is coming soon.

  • 0 kudos
2 More Replies