cancel
Showing results for 
Search instead for 
Did you mean: 
Machine Learning
cancel
Showing results for 
Search instead for 
Did you mean: 

Forum Posts

MadelynM
by New Contributor III
  • 1083 Views
  • 0 replies
  • 2 kudos

2021-08-Best-Practices-for-Your-Data-Architecture-v3-OG-1200x628

Thanks to everyone who joined the Best Practices for Your Data Architecture session on Optimizing Data Performance. You can access the on-demand session recording here and the pre-run performance benchmarks using the Spark UI Simulator. Proper cluste...

  • 1083 Views
  • 0 replies
  • 2 kudos
User16826994223
by Honored Contributor III
  • 315 Views
  • 0 replies
  • 1 kudos

docs.databricks.com

Advantage of using Photon EngineThe following summarizes the advantages of Photon:Supports SQL and equivalent DataFrame operations against Delta and Parquet tables.Expected to accelerate queries that process a significant amount of data (100GB+) and ...

  • 315 Views
  • 0 replies
  • 1 kudos
MoJaMa
by Valued Contributor II
  • 593 Views
  • 1 replies
  • 0 kudos
  • 593 Views
  • 1 replies
  • 0 kudos
Latest Reply
MoJaMa
Valued Contributor II
  • 0 kudos

Data is stored in the control plane. Metadata (eg feature table descriptions, column types, etc) is stored in the control plane. The location where the Delta table is stored is determined by the database location. The customer could call  CREATE DATA...

  • 0 kudos
User16826994223
by Honored Contributor III
  • 1493 Views
  • 1 replies
  • 0 kudos

Delta Lake MERGE INTO statement error

I'm trying to run Delta Lake MergeMERGE INTO source USING updates ON source.d = updates.sessionId WHEN MATCHED THEN UPDATE * WHEN NOT MATCHED THEN INSERT *I'm getting an SQL errorParseException: mismatched input 'MERGE' expecting {'(', 'SELECT', 'FR...

  • 1493 Views
  • 1 replies
  • 0 kudos
Latest Reply
User16826994223
Honored Contributor III
  • 0 kudos

The merge SQL support is added in Delta Lake 0.7.0. You also need to upgrade your Apache Spark to 3.0.0 and enable the integration with Apache Spark DataSourceV2 and C

  • 0 kudos
User16826994223
by Honored Contributor III
  • 1085 Views
  • 1 replies
  • 0 kudos

Using l vacuum with a dry run in Python for a Delta Lake

I can see an example on how to call the vacuum function for a Delta lake in python here. how to use the same in python %sql VACUUM delta.`dbfs:/mnt/<myfolder>` DRY RUN

  • 1085 Views
  • 1 replies
  • 0 kudos
Latest Reply
User16826994223
Honored Contributor III
  • 0 kudos

The dry run for non-SQL code is not yet available in Delta version 0.8. I see there is a bug that is opened with delta opensource in git . hope it get resolved soon

  • 0 kudos
User16869510359
by Esteemed Contributor
  • 1081 Views
  • 1 replies
  • 0 kudos
  • 1081 Views
  • 1 replies
  • 0 kudos
Latest Reply
User16869510359
Esteemed Contributor
  • 0 kudos

The impact will be only on the files touched by the MERGE operation. The newly created files will not be optimized and data co-locality is not ensured. However, the files which are not touched by the MERGE operation will continue to show the improvem...

  • 0 kudos
User16826994223
by Honored Contributor III
  • 590 Views
  • 1 replies
  • 0 kudos

difference between optimize and auto optimize and Optimize in delta

What would be good for me , if I should use Optimize every time or should I be using auto-optimize?

  • 590 Views
  • 1 replies
  • 0 kudos
Latest Reply
User16869510359
Esteemed Contributor
  • 0 kudos

Optimize:  Bin-packing/Compaction. Idempotent and IncrementalOptimize + Z-Order: Helps in Data Skipping; Use Range PartitioningOptimize write: Improve the write operation to the Delta table. optimization is performed before the write/during the writ...

  • 0 kudos
User16826994223
by Honored Contributor III
  • 551 Views
  • 0 replies
  • 0 kudos

Delta sharing Features- Share live data directly - Easily share existing, live data in your Delta Lake without copying it to another system.Support d...

Delta sharing Features-Share live data directly - Easily share existing, live data in your Delta Lake without copying it to another system.Support diverse clients - Data recipients can directly connect to Delta Shares from Pandas, Apache Sparkâ„¢, Rus...

sharing-hero-v3
  • 551 Views
  • 0 replies
  • 0 kudos
User16826994223
by Honored Contributor III
  • 3532 Views
  • 2 replies
  • 0 kudos

Can I access Delta tables outside of Databricks Runtime?

Is it possible to write same table with Databricks and from OSS too, Also what if I want to read the data from Map redeuce or hive

  • 3532 Views
  • 2 replies
  • 0 kudos
Latest Reply
sean_owen
Honored Contributor II
  • 0 kudos

Yes. The Delta client is open source, and lets you read/write Delta tables if you add it to your external application. See https://docs.delta.io/latest/index.html

  • 0 kudos
1 More Replies
Labels