- 752 Views
- 0 replies
- 0 kudos
Can I create a Delta Lake table on Databricks and query it with open-source Spark?Yes, in order to do this, you would install Open Source Spark and Delta Lake, both are open source. Delta Engine, which is only available on Databricks, will make delta...
- 752 Views
- 0 replies
- 0 kudos
- 747 Views
- 0 replies
- 0 kudos
Data scientist Job Profile will be relevant in the future?By seeing current features in Databricks like AUTO ML, I am assuming that should the Data scientist job will be mostly automated and sooner the data scientist in the company will start decl...
- 747 Views
- 0 replies
- 0 kudos
- 904 Views
- 0 replies
- 0 kudos
Spark 3.0 Pandas UDF Old vs New Pandas UDF interfaceThis slide shows the difference between the old and the new interface. The same here. The new interface can also be used for the existing Grouped Aggregate Pandas UDFs. In addition, the old Pandas U...
- 904 Views
- 0 replies
- 0 kudos
- 1087 Views
- 0 replies
- 0 kudos
Cluster Sizees on DB sql Cluster size Driver size Worker count
2X-Small i3.2xlarge 1
X-Small i3.2xlarge 2
Small i3.4xlarge 4
Medium i3.8xlarge 8
Large i3.8xlarge 16
X-Large i3.16xlarge 32
2X-Large i3.16xlarge...
- 1087 Views
- 0 replies
- 0 kudos
- 1715 Views
- 0 replies
- 0 kudos
Muti Cluster Load balancingMulti-cluster Load Balancing: the minimum and maximum number of clusters over which queries sent to the endpoint are distributed. The default is Off with a maximum of 1 cluster. When set to On, the default is minimum 1 clus...
- 1715 Views
- 0 replies
- 0 kudos
- 1027 Views
- 0 replies
- 0 kudos
I read this article in the docs about isolation levels, but I am not sure if I should be specifying this for my Delta tables. What situation would I want to change the isolation level from the default?
- 1027 Views
- 0 replies
- 0 kudos