Data Engineering

Forum Posts

Sorted by:

Start a conversation

by youssefmrini • Databricks Employee

03-01-2023 6:23:57 AM

3163 Views
1 replies
2 kudos

Resolved! Can I clone Apache Iceberg Tables ?

Data Engineering

3163 Views
1 replies
2 kudos

03-01-2023 6:23:57 AM

View Replies

Latest Reply

youssefmrini
Databricks Employee

03-01-2023 6:24:41 AM

2 kudos

Clone can now be used to create and incrementally update Delta tables that mirror Apache Parquet and Apache Iceberg tables. You can update your source Parquet table and incrementally apply the changes to their cloned Delta table with the clone comman...

2 kudos

03-01-2023 6:24:41 AM

by youssefmrini • Databricks Employee

03-01-2023 6:19:39 AM

1873 Views
1 replies
2 kudos

Resolved! Can I Authenticate to Power BI or Tableau using OAuth ?

Data Engineering

1873 Views
1 replies
2 kudos

03-01-2023 6:19:39 AM

View Replies

Latest Reply

youssefmrini
Databricks Employee

03-01-2023 6:20:12 AM

2 kudos

You can now use OAuth to authenticate to Power BI and Tableau. For more information, see Configure OAuth (Public Preview) for Power BI and Configure OAuth (Public Preview) for Tableau.https://docs.databricks.com/integrations/configure-oauth-powerbi.h...

2 kudos

03-01-2023 6:20:12 AM

by 156190 • New Contributor III

02-20-2023 12:49:42 PM

5543 Views
4 replies
3 kudos

Resolved! Is 'run_as' user available from jobs api 2.1?

I know that the run_as user generally defaults to the creator_user, but I would like to find the defined run_as user for each of our jobs. Unfortunately, I'm unable to locate that field in the api.

Data Engineering

5543 Views
4 replies
3 kudos

02-20-2023 12:49:42 PM

View Replies

Latest Reply

Anonymous
Not applicable

02-21-2023 10:48:06 PM

3 kudos

Hi @Keller, Michael Hope all is well! Just wanted to check in if you were able to resolve your issue and would you be happy to share the solution or mark an answer as best? Else please let us know if you need more help. We'd love to hear from you.Th...

3 kudos

02-21-2023 10:48:06 PM

3 More Replies

by SagarK1 • New Contributor

10-21-2021 2:41:08 AM

7268 Views
4 replies
2 kudos

Managing the permissions using MLFlow APIs

Hello All,I am trying to manage the permissions on the experiments using the MLFLow API. Do we have any MLFlow API which helps to manage the permissions of Can Read ,Can Edit , Can Manage.Example :I create the model using MLFlow APIs and through my c...

Data Engineering

7268 Views
4 replies
2 kudos

10-21-2021 2:41:08 AM

View Replies

Latest Reply

jsan
New Contributor II

03-01-2023 4:35:00 AM

2 kudos

Hey folks, did we get any workaround for this or what @Sean Owen said is true ?

2 kudos

03-01-2023 4:35:00 AM

3 More Replies

by zeta_load • Databricks Partner

02-28-2023 6:47:42 AM

25913 Views
1 replies
1 kudos

Resolved! Is it possible to restart a cluster from a Notebook without using the UI

I have some code that occasionally wrong executed, meaning that every n-th time a calculation in a table is wrong. If that happens, I want to be able to restart the cluster from the Notebook.- I'm therefore lookong for a piece of code that can accomp...

Data Engineering

25913 Views
1 replies
1 kudos

02-28-2023 6:47:42 AM

View Replies

Latest Reply

daniel_sahal
Databricks MVP

02-28-2023 10:38:44 PM

1 kudos

@Lukas Goldschmied It is. You'll need to use Databricks API.Here you can find an example:https://learn.microsoft.com/en-us/azure/databricks/_extras/notebooks/source/clusters-long-running-optional-restart.html

1 kudos

02-28-2023 10:38:44 PM

by 332588 • New Contributor II

01-11-2023 4:14:11 AM

2486 Views
3 replies
3 kudos

We are using the Databricks managed MLflow to log experiment runs for quite some time already and never experienced issues. However, now we seem to have encountered a bug in the associated Databricks UI.

We observe the following behavior when we keep adding new runs to an experiment:- In the beginning, the runs are still displayed correctly in the UI.- After a certain number of total runs, the following bug occurs in the UI: - In the UI, there are ...

Data Engineering

2486 Views
3 replies
3 kudos

01-11-2023 4:14:11 AM

View Replies

Latest Reply

Debayan
Databricks Employee

02-28-2023 10:25:33 PM

3 kudos

Hi @Timo Burmeister Apologies for the delay! I went through the video, does it happen all the time? I see after sorting it with different filter the list appears.

3 kudos

02-28-2023 10:25:33 PM

2 More Replies

by prasadvaze • Valued Contributor II

01-31-2023 2:49:24 PM

9148 Views
3 replies
0 kudos

Error loading MANAGED table in unity catalog delta lake on azure. Anyone seen this issue? "ErrorClass=INVALID_PARAMETER_VALUE] Input path <file system name>.dfs.core.windows.net overlaps with other external tables"

00007160: 2023-01-30T14:22:06 [TARGET_LOAD ]E: Failed (retcode -1) to execute statement: 'COPY INTO `e2underwriting_dbo`.`product` FROM(SELECT cast(_c0 as INT) as `ProductID`, _c1 as `ShortName`, cast(_c2 as INT) as `Status`, cast(_c3 as TIMESTA...

Data Engineering

9148 Views
3 replies
0 kudos

01-31-2023 2:49:24 PM

View Replies

Latest Reply

prasadvaze
Valued Contributor II

02-28-2023 1:53:50 PM

0 kudos

we have solved this issue related to Qlik replicate copying data into delta table

0 kudos

02-28-2023 1:53:50 PM

2 More Replies

by youssefmrini • Databricks Employee

02-28-2023 3:17:57 AM

2165 Views
1 replies
1 kudos

Resolved! Does Databricks workflows support continuous jobs ?

Data Engineering

2165 Views
1 replies
1 kudos

02-28-2023 3:17:57 AM

View Replies

Latest Reply

youssefmrini
Databricks Employee

02-28-2023 3:18:02 AM

1 kudos

You can ensure there is always an active run of your Databricks job with the new continuous trigger type. https://docs.databricks.com/workflows/jobs/jobs.html#continuous-jobs

1 kudos

02-28-2023 3:18:02 AM

by tw1 • New Contributor III

02-15-2023 3:09:21 AM

15401 Views
9 replies
3 kudos

Resolved! Can't write / overwrite delta table with error: oxxxx.saveAsTable. (Driver Error: OutOfMemory)

Current Cluster Config:Standard_DS3_v2 (14GB, 4 Cores) 2-6 workersStandard_DS3_v2 (14GB, 4Cores) for driverRuntime: 10.4x-scala2.12We want to overwrite a temporary delta table with new records. The records will be load by another delta table and tran...

Data Engineering

15401 Views
9 replies
3 kudos

02-15-2023 3:09:21 AM

View Replies

Latest Reply

tw1
New Contributor III

02-26-2023 11:39:12 PM

3 kudos

Hi,thank you for your help!We tested the configuration settings and it runs without any errors.Could you give us some more information, where we can find some documentation about such settings. We searched hours to fix our problem. So we contacted th...

3 kudos

02-26-2023 11:39:12 PM

8 More Replies

by Lulka • New Contributor II

02-20-2023 11:55:17 PM

6289 Views
2 replies
2 kudos

Resolved! How limit input rate reading delta table as stream?

Hello to everyone!I am trying to read delta table as a streaming source using spark. But my microbatches are disbalanced - one very small and the other are very huge. How I can limit this? I used different configurations with maxBytesPerTrigger and m...

Data Engineering

6289 Views
2 replies
2 kudos

02-20-2023 11:55:17 PM

View Replies

Latest Reply

-werners-
Esteemed Contributor III

02-21-2023 4:12:45 AM

2 kudos

besides the parameters you mention, I don't know of any other which controls the batch size.did you check if the delta table is not horribly skewed?

2 kudos

02-21-2023 4:12:45 AM

1 More Replies

by RyanHager • Contributor

04-28-2022 12:15:29 PM

5450 Views
5 replies
2 kudos

Are there any plans to add functions on the partition by fields of a delta table definition such as day() ? A similar capability exists in iceberg.

Benefit: This will help simplify the where clauses of the consumers of the tables? Just query on the main date field if I need all the data for a day. Not an extra day field we had to make.

Data Engineering

5450 Views
5 replies
2 kudos

04-28-2022 12:15:29 PM

View Replies

Latest Reply

Hubert-Dudek
Databricks MVP

05-07-2022 10:35:55 AM

2 kudos

@Ryan Hager , yes it is possible using AUTO GENERATED COLUMNS since delta lake 1.2For example, you can automatically generate a date column (for partitioning the table by date) from the timestamp column; any writes into the table need only specify t...

2 kudos

05-07-2022 10:35:55 AM

4 More Replies

by hare • New Contributor III

02-19-2023 10:56:59 PM

4964 Views
4 replies
3 kudos

Implementation of Late arriving dimension in databricks

Hi Team, Can you please suggest to me how to implement the late arriving dimension or early arriving fact with examples or any sample script for reference? I have to implement the same using pyspark.Thanks.

Data Engineering

4964 Views
4 replies
3 kudos

02-19-2023 10:56:59 PM

View Replies

Latest Reply

Anonymous
Not applicable

02-21-2023 10:54:38 PM

3 kudos

Hi @Hare Krishnan Hope all is well! Just wanted to check in if you were able to resolve your issue and would you be happy to share the solution or mark an answer as best? Else please let us know if you need more help. We'd love to hear from you.Than...

3 kudos

02-21-2023 10:54:38 PM

3 More Replies

by none_ranjeet • New Contributor III

01-09-2023 6:09:11 AM

4575 Views
3 replies
2 kudos

Resolved! Passed the Fundamentals of the Databricks Lakehouse Platform Accreditation, but no badge recieved. Tried "https://v2.accounts.accredible.com/retrieve-credentials?" showing no badge.

Passed the Fundamentals of the Databricks Lakehouse Platform Accreditation, but no badge recieved. Tried "https://v2.accounts.accredible.com/retrieve-credentials?" showing no badge.

Data Engineering

4575 Views
3 replies
2 kudos

01-09-2023 6:09:11 AM

View Replies

Latest Reply

Chaitanya_Raju
Honored Contributor

01-10-2023 5:57:40 AM

2 kudos

Hi @Ranjeet Ahlawat ,Congratulations on the certification. For any certification you take in the databricks you will be receiving the certificate and the badge in 24-48 hours and sometimes in lesser time as well. All the best for your future certifi...

2 kudos

01-10-2023 5:57:40 AM

2 More Replies

by asami34 • New Contributor II

01-23-2023 2:47:28 PM

6547 Views
7 replies
0 kudos

Cannot reset password, no support

I cannot log in to my Databricks community account. I have already tried to receive support and no real support has been given. I attempt to reset my password, the link gets sent, but once I enter the new password it gets stuck permanently loading. I...

Data Engineering

6547 Views
7 replies
0 kudos

01-23-2023 2:47:28 PM

View Replies

Latest Reply

Anonymous
Not applicable

02-24-2023 8:12:52 PM

0 kudos

Hi @Ahmet Korkmaz Hope all is well! Just wanted to check in if you were able to resolve your issue and would you be happy to share the solution or mark an answer as best? Else please let us know if you need more help. We'd love to hear from you.Than...

0 kudos

02-24-2023 8:12:52 PM

6 More Replies

by sujai_sparks • New Contributor III

02-24-2023 8:42:54 AM

31498 Views
14 replies
15 kudos

Resolved! How to convert records in Azure Databricks delta table to a nested JSON structure?

Let's say I have a delta table in Azure databricks that stores the staff details (denormalized). I wanted to export the data in the JSON format and save it as a single file on a storage location. I need help with the databricks sql query to group/co...

Data Engineering

31498 Views
14 replies
15 kudos

02-24-2023 8:42:54 AM

View Replies

Latest Reply

NateAnth
Databricks Employee

02-24-2023 6:14:40 PM

15 kudos

Glad it worked for you!!

15 kudos

02-24-2023 6:14:40 PM

13 More Replies

Databricks Community

Forum Posts

Resolved! Can I clone Apache Iceberg Tables ?

Resolved! Can I Authenticate to Power BI or Tableau using OAuth ?

Resolved! Is 'run_as' user available from jobs api 2.1?

Managing the permissions using MLFlow APIs

Resolved! Is it possible to restart a cluster from a Notebook without using the UI

We are using the Databricks managed MLflow to log experiment runs for quite some time already and never experienced issues. However, now we seem to have encountered a bug in the associated Databricks UI.

Error loading MANAGED table in unity catalog delta lake on azure. Anyone seen this issue? "ErrorClass=INVALID_PARAMETER_VALUE] Input path <file system name>.dfs.core.windows.net overlaps with other external tables"

Resolved! Does Databricks workflows support continuous jobs ?

Resolved! Can't write / overwrite delta table with error: oxxxx.saveAsTable. (Driver Error: OutOfMemory)

Resolved! How limit input rate reading delta table as stream?

Are there any plans to add functions on the partition by fields of a delta table definition such as day() ? A similar capability exists in iceberg.

Implementation of Late arriving dimension in databricks

Resolved! Passed the Fundamentals of the Databricks Lakehouse Platform Accreditation, but no badge recieved. Tried "https://v2.accounts.accredible.com/retrieve-credentials?" showing no badge.

Cannot reset password, no support

Resolved! How to convert records in Azure Databricks delta table to a nested JSON structure?

Vector index not syncing: DELTA_UNSUPPORTED_TIME_T...

Ingest data from REST endpoint into Databricks

databricks autoloader source files

How to update alias for catalogs

LDP Materialized View Incremental Refreshes - Chan...