Data Engineering

Forum Posts

Sorted by:

by stevenayers-bge • New Contributor II

5 hours ago

35 Views
1 replies
0 kudos

Bug: Shallow Clone `create or replace` causing [TABLE_OR_VIEW_NOT_FOUND]

I am having an issue where when I do a shallow clone using :create or replace table `catalog_a_test`.`schema_a`.`table_a` shallow clone `catalog_a`.`schema_a`.`table_a` I get:[TABLE_OR_VIEW_NOT_FOUND] The table or view catalog_a_test.schema_a.table_a...

Data Engineering

35 Views
1 replies
0 kudos

5 hours ago

View Replies

Latest Reply

Omar_hamdan
Community Manager

8m ago

0 kudos

Hi StevenThis is really a strange issue. First let's exclude some possible causes for this. We need to check the following:- The permission to table A and Catalog B. take a look at the following link to check what permission is needed: https://docs.d...

0 kudos

8m ago

by Abbe • New Contributor II

12-20-2022 7:04:50 AM

1062 Views
2 replies
0 kudos

Update data type of a column within a table that has a GENERATED ALWAYS AS IDENTITY-column

I want to cast the data type of a column "X" in a table "A" where column "ID" is defined as GENERATED ALWAYS AS IDENTITY. Databricks refer to overwrite to achieve this: https://docs.databricks.com/delta/update-schema.htmlThe following operation:(spar...

Data Engineering

1062 Views
2 replies
0 kudos

12-20-2022 7:04:50 AM

View Replies

Latest Reply

RajuBolla
Visitor

55m ago

0 kudos

Update is not working but delete is when i changed to DEFAULT property AnalysisException: UPDATE on IDENTITY column "XXXX_ID" is not supported.

0 kudos

55m ago

1 More Replies

by Devsql • Visitor

2 hours ago

15 Views
0 replies
0 kudos

How to find that given Parquet file got imported into Bronze Layer ?

Hi Team,Recently we had created new Databricks project/solution (based on Medallion architecture) having Bronze-Silver-Gold Layer based tables. So we have created Delta-Live-Table based pipeline for Bronze-Layer implementation. Source files are Parqu...

Data Engineering

Azure Databricks

Bronze Job

Delta Live Table

Delta Live Table Pipeline

15 Views
0 replies
0 kudos

2 hours ago

by mamiya • Visitor

2 hours ago

11 Views
0 replies
0 kudos

ODBC PowerBI 2 commands in one query

Hello everyone,I'm trying to use the ODBC DirectQuery option in PowerBI, but I keep getting an error about another command. The SQL query works while using the SQL Editor. Do I need to change the setup of my ODBC connector?DECLARE dateFrom DATE = DA...

Data Engineering

11 Views
0 replies
0 kudos

2 hours ago

by amitkmaurya • Visitor

5 hours ago

28 Views
0 replies
0 kudos

How to increase executor memory in Databricks jobs

May be I am new to Databricks that's why I have confusion.Suppose I have worker memory of 64gb in Databricks job max 12 nodes...and my job is failing due to Executor Lost due to 137 (OOM if found on internet).So, to fix this I need to increase execut...

Data Engineering

28 Views
0 replies
0 kudos

5 hours ago

by Fnazar • New Contributor

5 hours ago

27 Views
0 replies
0 kudos

Billing of Databricks Job clusters

Hi All,Please help me understand how the billing is calculated for using the Job cluster.Document says they are charged hourly basis, so if my job ran for 1hr 30mins then will be charged for the 30mins based on the hourly rate or it will be charged f...

Data Engineering

27 Views
0 replies
0 kudos

5 hours ago

by curiousoctopus • New Contributor II

3 weeks ago

401 Views
3 replies
0 kudos

Run multiple jobs with different source code at the same time with Databricks asset bundles

Hi,I am migrating from dbx to databricks asset bundles. Previously with dbx I could work on different features in separate branches and launch jobs without issue of one job overwritting the other. Now with databricks asset bundles it seems like I can...

Data Engineering

401 Views
3 replies
0 kudos

3 weeks ago

View Replies

Latest Reply

dattomo1893
Visitor

5 hours ago

0 kudos

Any updates here?My team is migrating from dbx to DABs and we are running into the same issue. Ideally, we would like to deploy multiple, parametrized jobs from a single bundle. If this is not possible, we have to keep dbx.Thank you!

0 kudos

5 hours ago

2 More Replies

by jorperort • Visitor

5 hours ago

40 Views
0 replies
0 kudos

[Databricks Assets Bundles] no deployment state

Good morning, I'm trying to run: databricks bundle run --debug -t dev integration_tests_job My bundle looks: bundle: name: x include: - ./resources/*.yml targets: dev: mode: development default: true workspace: host: x r...

Data Engineering

Databricks Assets Bundles

Deployment Error

pid=265687

40 Views
0 replies
0 kudos

5 hours ago

by amitkmaurya • Visitor

6 hours ago

38 Views
0 replies
0 kudos

Databricks job keep getting failed due to executor lost.

Getting following error while saving a dataframe partitioned by two columns.Job aborted due to stage failure: Task 5774 in stage 33.0 failed 4 times, most recent failure: Lost task 5774.3 in stage 33.0 (TID 7736) (13.2.96.110 executor 7): ExecutorLos...

Data Engineering

databricks jobs

spark

38 Views
0 replies
0 kudos

6 hours ago

by vinayaka_pallak • Visitor

yesterday

44 Views
0 replies
0 kudos

Pytest on Notebook

I am currently exploring testing methodologies for Databricks notebooks and would like to inquire whether it's possible to write pytest tests for notebooks that contain code not encapsulated within functions or classes.***********************a = 4b ...

Data Engineering

44 Views
0 replies
0 kudos

yesterday

by Phani1 • Valued Contributor

Tuesday

164 Views
4 replies
0 kudos

Parallel execution of SQL cell in Databricks Notebooks

Hi Team,Please provide guidance on enabling SQL cells parallel execution in a notebook containing multiple SQL cells. Currently, when we execute notebook and all the SQL cells they run sequentially. I would appreciate assistance on how to execute th...

Data Engineering

delta

164 Views
4 replies
0 kudos

Tuesday

View Replies

Latest Reply

Ajay-Pandey
Esteemed Contributor III

yesterday

0 kudos

Hi @Phani1 Yes you can achieve this scenario with the help of Databricks Workflow jobs where you can create task and dependencies for each other.

0 kudos

yesterday

3 More Replies

by Ameshj • New Contributor

yesterday

115 Views
5 replies
0 kudos

Dbfs init script migration

I need help with migrating from dbfs on databricks to workspace. I am new to databricks and am struggling with what is on the links provided.My workspace.yml also has dbfs hard-coded. Included is a full deployment with great expectations.This was don...

Data Engineering

Azure Databricks

dbfs

Great expectations

python

115 Views
5 replies
0 kudos

yesterday

View Replies

Latest Reply

NandiniN
Valued Contributor II

yesterday

0 kudos

There's also this KB specific to init script migration - https://kb.databricks.com/clusters/migration-guidance-for-init-scripts-on-dbfs

0 kudos

yesterday

4 More Replies

by subha2 • New Contributor

Saturday

120 Views
2 replies
0 kudos

metadata driven DQ validation for multiple tables dynamically

There are multiple tables in the config/metadata table. These tables need to bevalidated for DQ rules.1.Natural Key / Business Key /Primary Key cannot be null orblank.2.Natural Key/Primary Key cannot be duplicate.3.Join columns missing values4.Busine...

Data Engineering

120 Views
2 replies
0 kudos

Saturday

View Replies

Latest Reply

Kaniz
Community Manager

Monday

0 kudos

Hi @subha2, To dynamically validate the data quality (DQ) rules for tables configured in a metadata-driven system using PySpark, you can follow these steps: Define Metadata for Tables: First, create a metadata configuration that describes the rules ...

0 kudos

Monday

1 More Replies

by rt-slowth • Contributor

01-15-2024 12:07:53 AM

615 Views
6 replies
0 kudos

why the userIdentity is anonymous?

Do you know why the userIdentity is anonymous in AWS Cloudtail's logs even though I have specified an instance profile?

Data Engineering

615 Views
6 replies
0 kudos

01-15-2024 12:07:53 AM

View Replies

Latest Reply

Kaniz
Community Manager

01-18-2024 1:27:52 AM

0 kudos

Thank you for posting your question in our community! We are happy to assist you.To help us provide you with the most accurate information, could you please take a moment to review the responses and select the one that best answers your question?This...

0 kudos

01-18-2024 1:27:52 AM

5 More Replies

by rt-slowth • Contributor

01-10-2024 6:33:50 PM

925 Views
4 replies
2 kudos

AutoLoader File notification mode Configuration with AWS

from pyspark.sql import functions as F from pyspark.sql import types as T from pyspark.sql import DataFrame, Column from pyspark.sql.types import Row import dlt S3_PATH = 's3://datalake-lab/XXXXX/' S3_SCHEMA = 's3://datalake-lab/XXXXX/schemas/' ...

Data Engineering

925 Views
4 replies
2 kudos

01-10-2024 6:33:50 PM

View Replies

Latest Reply

djhs
New Contributor III

Tuesday

2 kudos

Was this resolved? I run into the same issue

2 kudos

Tuesday

3 More Replies

User

Count

1602

736

343

284

247

Databricks

Forum Posts

Bug: Shallow Clone `create or replace` causing [TABLE_OR_VIEW_NOT_FOUND]

Update data type of a column within a table that has a GENERATED ALWAYS AS IDENTITY-column

How to find that given Parquet file got imported into Bronze Layer ?

ODBC PowerBI 2 commands in one query

How to increase executor memory in Databricks jobs

Billing of Databricks Job clusters

Run multiple jobs with different source code at the same time with Databricks asset bundles

[Databricks Assets Bundles] no deployment state

Databricks job keep getting failed due to executor lost.

Pytest on Notebook

Parallel execution of SQL cell in Databricks Notebooks

Dbfs init script migration

metadata driven DQ validation for multiple tables dynamically

why the userIdentity is anonymous?

AutoLoader File notification mode Configuration with AWS

Best way to parse Google Analytics data in Databri...

DELTA_EXCEED_CHAR_VARCHAR_LIMIT

Not able to set run_as service_principal_name

Pyspark operations slowness in CLuster 14.3LTS as ...

[Databricks Assets Bundles] Workflow trigger on fi...