Data Engineering

Forum Posts

Sorted by:

by User16826988857 • Databricks Employee

06-09-2021 2:18:10 PM

3477 Views
1 replies
0 kudos

How to allow Table deletion without requiring ownership on table? Problem Description In DBR 6 (and earlier), a non-admin user can delete a table that...

How to allow Table deletion without requiring ownership on table?Problem DescriptionIn DBR 6 (and earlier), a non-admin user can delete a table that the user doesn't own, as long as the user has ownership on the table's parent database (perhaps throu...

Data Engineering

3477 Views
1 replies
0 kudos

06-09-2021 2:18:10 PM

View Replies

Latest Reply

abueno
Contributor

08-19-2025 7:24:37 PM

0 kudos

I am having the same issue but on Python 3.10.12.I need to be able to have another user have "manage" access to a table in the unity catalog. We both have write access to the schema.

0 kudos

08-19-2025 7:24:37 PM

by boskicl • New Contributor III

03-23-2022 11:04:23 AM

37228 Views
8 replies
11 kudos

Resolved! Table write command stuck "Filtering files for query."

Hello all,Background:I am having an issue today with databricks using pyspark-sql and writing a delta table. The dataframe is made by doing an inner join between two tables and that is the table which I am trying to write to a delta table. The table ...

Data Engineering

37228 Views
8 replies
11 kudos

03-23-2022 11:04:23 AM

View Replies

Latest Reply

nvashisth
New Contributor III

08-13-2025 9:40:21 AM

11 kudos

@timo199 , @boskicl I had similar issue and job was getting stuck at Filtering Files for Query indefinitely. I checked SPARK logs and based on that figured out that we had enabled PHOTON acceleration on our cluster for job and datatype of our columns...

11 kudos

08-13-2025 9:40:21 AM

7 More Replies

by HariharaSam • Contributor

01-12-2022 11:45:58 PM

35616 Views
10 replies
4 kudos

Resolved! To get Number of rows inserted after performing an Insert operation into a table

Consider we have two tables A & B.qry = """INSERT INTO Table ASelect * from Table B where Id is null """spark.sql(qry)I need to get the number of records inserted after running this in databricks.

Data Engineering

35616 Views
10 replies
4 kudos

01-12-2022 11:45:58 PM

View Replies

Latest Reply

User16653924625
Databricks Employee

07-23-2025 4:28:46 PM

4 kudos

in case someone is looking for purely SQL based solution: (add LIMIT 1 to the query if you are looking for last op only) select t.timestamp, t.operation, t.operationMetrics.numOutputRows as numOutputRows from ( DESCRIBE HISTORY <catalog>.<schema>....

4 kudos

07-23-2025 4:28:46 PM

9 More Replies

by Aj2 • New Contributor III

12-13-2022 12:03:53 AM

15577 Views
2 replies
5 kudos

Resolved! What is the difference between Streaming live table and live table?

Data Engineering

15577 Views
2 replies
5 kudos

12-13-2022 12:03:53 AM

View Replies

Latest Reply

bbp
New Contributor II

05-01-2025 3:37:57 AM

5 kudos

Instead of copy paste if someone can explain in practical terms with example, that would be upto the standard of DataBricks community

5 kudos

05-01-2025 3:37:57 AM

1 More Replies

by tototox • New Contributor III

05-11-2023 7:08:57 AM

17470 Views
4 replies
2 kudos

how to check table size by partition?

I want to check the size of the delta table by partition.As you can see, only the size of the table can be checked, but not by partition.

Data Engineering

17470 Views
4 replies
2 kudos

05-11-2023 7:08:57 AM

View Replies

Latest Reply

Carsten_Herbe
New Contributor II

12-17-2024 4:16:39 AM

2 kudos

The previous two answers did not work for me (DBX 15.4).I found a hacky way using the delta log: find latest (group of) checkpoint (parquet) file(s) in delta log and use it as source prefix `000000000000xxxxxxx.checkpoint`:SELECT partition_column_1,...

2 kudos

12-17-2024 4:16:39 AM

3 More Replies

by nadia • New Contributor II

06-12-2022 2:19:33 PM

30229 Views
4 replies
2 kudos

Resolved! Executor heartbeat timed out

Hello, I'm trying to read a table that is located on Postgreqsl and contains 28 million rows. I have the following result:"SparkException: Job aborted due to stage failure: Task 0 in stage 0.0 failed 4 times, most recent failure: Lost task 0.3 in sta...

Data Engineering

30229 Views
4 replies
2 kudos

06-12-2022 2:19:33 PM

View Replies

Latest Reply

SparkJun
Databricks Employee

06-18-2024 1:52:44 PM

2 kudos

Please also review the Spark UI to see the failed Spark job and Spark stage. Please check on the GC time and data spill to memory and disk. See if there is any error in the failed task in the Spark stage view. This will confirm data skew or GC/memory...

2 kudos

06-18-2024 1:52:44 PM

3 More Replies

by my_community2 • New Contributor III

09-13-2022 5:47:13 PM

20942 Views
9 replies
6 kudos

Resolved! dropping a managed table does not remove the underlying files

the documentation states that "drop table":Deletes the table and removes the directory associated with the table from the file system if the table is not EXTERNAL table. An exception is thrown if the table does not exist.In case of an external table...

Data Engineering

20942 Views
9 replies
6 kudos

09-13-2022 5:47:13 PM

View Replies

Latest Reply

MajdSAAD_7953
New Contributor II

09-14-2023 1:21:18 AM

6 kudos

Hi,There is a way to force delete files after drop the table and don't wait 30 days to see size in S3 decrease?Tables that I dropped related to the dev and staging, I don't want to keep there files for 30 days

6 kudos

09-14-2023 1:21:18 AM

8 More Replies

by explorer • New Contributor III

02-09-2023 5:47:38 AM

6326 Views
4 replies
1 kudos

Resolved! Deleting records manually in databricks streaming table.

Hi Team , Let me know if there is any ways I can delete records manually from databricks streaming table without corrupting table and data.Can we delete the few records (based on some condition) manually in databricks streaming table (having checkpoi...

Data Engineering

6326 Views
4 replies
1 kudos

02-09-2023 5:47:38 AM

View Replies

Latest Reply

SparkJun
Databricks Employee

08-16-2024 1:55:27 PM

1 kudos

If you use the applyChanges method in DLT for Change Data Capture (CDC), you can delete records manually without affecting the consistency of the table, as applyChanges respects manual deletions. You must configure your DLT pipeline to respect manu...

1 kudos

08-16-2024 1:55:27 PM

3 More Replies

by Chris_Konsur • New Contributor III

03-20-2023 7:24:31 AM

24327 Views
4 replies
7 kudos

Resolved! Error: The associated location ... is not empty but it's not a Delta table

I try to create a table but I get this error: AnalysisException: Cannot create table ('`spark_catalog`.`default`.`citation_all_tenants`'). The associated location ('dbfs:/user/hive/warehouse/citation_all_tenants') is not empty but it's not a Delta t...

Data Engineering

24327 Views
4 replies
7 kudos

03-20-2023 7:24:31 AM

View Replies

Latest Reply

sachin_tirth
New Contributor II

05-23-2024 11:51:33 PM

7 kudos

Hi Team, I am facing the same issue. When we try to load data to table in production batch getting error as table not in delta format. there is no recent change in table. and we are not trying any create or replace table. this is existing table in pr...

7 kudos

05-23-2024 11:51:33 PM

3 More Replies

by wyzer • Contributor II

01-17-2022 1:31:45 AM

11426 Views
8 replies
4 kudos

Resolved! How to pass parameters in SSRS/Power BI (report builder) ?

Hello,In SSRS/Power BI (report builder), how to query a table in Databricks with parameters please ?Because this code doesn't works :SELECT * FROM TempBase.Customers WHERE Name = {{ @P_Name }}Thanks.

Data Engineering

11426 Views
8 replies
4 kudos

01-17-2022 1:31:45 AM

View Replies

Latest Reply

Nj11
New Contributor II

08-24-2023 2:33:54 AM

4 kudos

Hi, I am not able to see the data in SSRS while I am using date parameters but with manual dates data is populating fine. The database is pointing to databricks. I am not sure what I am missing here. Please help me in this. ThanksI am trying with que...

4 kudos

08-24-2023 2:33:54 AM

7 More Replies

by Abbe • New Contributor II

12-20-2022 7:04:50 AM

3210 Views
2 replies
0 kudos

Update data type of a column within a table that has a GENERATED ALWAYS AS IDENTITY-column

I want to cast the data type of a column "X" in a table "A" where column "ID" is defined as GENERATED ALWAYS AS IDENTITY. Databricks refer to overwrite to achieve this: https://docs.databricks.com/delta/update-schema.htmlThe following operation:(spar...

Data Engineering

3210 Views
2 replies
0 kudos

12-20-2022 7:04:50 AM

View Replies

Latest Reply

RajuBolla
New Contributor II

05-02-2024 6:32:16 AM

0 kudos

Update is not working but delete is when i changed to DEFAULT property AnalysisException: UPDATE on IDENTITY column "XXXX_ID" is not supported.

0 kudos

05-02-2024 6:32:16 AM

1 More Replies

by MBV3 • Contributor

10-31-2022 9:46:03 AM

15075 Views
5 replies
7 kudos

Resolved! External table from parquet partition

Hi,I have data in parquet format in GCS buckets partitioned by name eg. gs://mybucket/name=ABCD/I am trying to create a table in Databaricks as followsDROP TABLE IF EXISTS name_test; CREATE TABLE name_testUSING parquetLOCATION "gs://mybucket/name=*/...

Data Engineering

15075 Views
5 replies
7 kudos

10-31-2022 9:46:03 AM

View Replies

Latest Reply

Pat
Esteemed Contributor

11-01-2022 1:46:26 AM

7 kudos

Hi @M Baig ,the error doesn't tell me much, but you could try:CREATE TABLE name_test USING parquet PARTITIONED BY ( name STRING) LOCATION "gs://mybucket/";

7 kudos

11-01-2022 1:46:26 AM

4 More Replies

by AkifCakir • New Contributor II

10-21-2021 7:56:24 AM

26596 Views
3 replies
4 kudos

Resolved! Why Spark Save Modes , "overwrite" always drops table although "truncate" is true ?

Hi Dear Team, I am trying to import data from databricks to Exasol DB. I am using following code in below with Spark version is 3.0.1 ,dfw.write \ .format("jdbc") \ .option("driver", exa_driver) \ .option("url", exa_url) \ .option("db...

Data Engineering

26596 Views
3 replies
4 kudos

10-21-2021 7:56:24 AM

View Replies

Latest Reply

Gembo
New Contributor III

11-29-2023 9:39:00 AM

4 kudos

@AkifCakir , Were you able to find a way to truncate without dropping the table using the .write function as I am facing the same issue as well.

4 kudos

11-29-2023 9:39:00 AM

2 More Replies

by Graham • New Contributor III

09-16-2022 10:41:51 AM

11213 Views
5 replies
3 kudos

"MERGE" always slower than "CREATE OR REPLACE"

OverviewTo update our Data Warehouse tables, we have tried two methods: "CREATE OR REPLACE" and "MERGE". With every query we've tried, "MERGE" is slower.My question is this: Has anyone successfully gotten a "MERGE" to perform faster than a "CREATE OR...

Data Engineering

11213 Views
5 replies
3 kudos

09-16-2022 10:41:51 AM

View Replies

Latest Reply

Manisha_Jena
Databricks Employee

11-02-2023 2:18:28 AM

3 kudos

Hi @Graham Can you please try Low Shuffle Merge [LSM] and see if it helps? LSM is a new MERGE algorithm that aims to maintain the existing data organization (including z-order clustering) for unmodified data, while simultaneously improving performan...

3 kudos

11-02-2023 2:18:28 AM

4 More Replies

by Juha • New Contributor II

01-26-2023 11:39:17 PM

3629 Views
3 replies
2 kudos

I'm trying to create a table in databricks sql using widget values in table naming. The idea is that the users could select / enter table naming values as they create their tables. This can be done in notebooks but I can't get the syntax working in DBSQL.

Here's an example: CREATE OR REPLACE TABLE {{workspace}}.{{TableNameFirstPart}}_{{TableNameEndPart}} AS SELECT ...

Data Engineering

3629 Views
3 replies
2 kudos

01-26-2023 11:39:17 PM

View Replies

Latest Reply

lawrence009
Contributor

08-19-2023 8:29:14 PM

2 kudos

Have you figured out what the problem was? Could the issue be permission related?

2 kudos

08-19-2023 8:29:14 PM

2 More Replies

Databricks Community

How to allow Table deletion without requiring ownership on table? Problem Description In DBR 6 (and earlier), a non-admin user can delete a table that...

Resolved! Table write command stuck "Filtering files for query."

Resolved! To get Number of rows inserted after performing an Insert operation into a table

Resolved! What is the difference between Streaming live table and live table?

how to check table size by partition?

Resolved! Executor heartbeat timed out

Resolved! dropping a managed table does not remove the underlying files

Resolved! Deleting records manually in databricks streaming table.

Resolved! Error: The associated location ... is not empty but it's not a Delta table

Resolved! How to pass parameters in SSRS/Power BI (report builder) ?

Update data type of a column within a table that has a GENERATED ALWAYS AS IDENTITY-column

Resolved! External table from parquet partition

Resolved! Why Spark Save Modes , "overwrite" always drops table although "truncate" is true ?

"MERGE" always slower than "CREATE OR REPLACE"

I'm trying to create a table in databricks sql using widget values in table naming. The idea is that the users could select / enter table naming values as they create their tables. This can be done in notebooks but I can't get the syntax working in DBSQL.