cancel
Showing results for 
Search instead for 
Did you mean: 
Data Governance
Join discussions on data governance practices, compliance, and security within the Databricks Community. Exchange strategies and insights to ensure data integrity and regulatory compliance.
cancel
Showing results for 
Search instead for 
Did you mean: 

Databricks Runtime 10.5 (Beta)  �� Auto Loader: new SQL function CLOUD_FILES_STATE You can use the new CLOUD_FILES_STATE function to query the ...

Hubert-Dudek
Esteemed Contributor III

Databricks Runtime 10.5 (Beta)

👉 Auto Loader: new SQL function CLOUD_FILES_STATE

You can use the new CLOUD_FILES_STATE function to query the internal state of an Auto Loader stream.

👉 Delta Lake: new maxRecordsPerFile option for maximum records written to a single file

When you use the DataFrame APIs to write to a Delta table, you can use the

"maxRecordsPerFile" option to specify the maximum number of records to write out to a single file. Setting a value of zero or a negative value represents no limit.

👉 Deprecation of Koalas

👉 Unity Catalog: SQL LIST output removes "is_directory" column and adds trailing / for directory paths...

👉 more info https://docs.databricks.com/release-notes/runtime/10.5.html

1 REPLY 1

SørenRavn
New Contributor II

anyone that got CLOUD_FILES_STATE to work on 10.5 Beta/Azure.

Syntax:

%sql

SELECT * FROM cloud_files_state('abfss://test@test.dfs.core.windows.net/test/checkpoint/');

i get this error:

com.databricks.backend.common.rpc.DatabricksExceptions$SQLExecutionException: org.apache.spark.sql.AnalysisException: could not resolve `cloud_files_state` to a table-valued function

Connect with Databricks Users in Your Area

Join a Regional User Group to connect with local Databricks users. Events will be happening in your city, and you won’t want to miss the chance to attend and share knowledge.

If there isn’t a group near you, start one and help create a community that brings people together.

Request a New Group