<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: Error while inserting data to unity catalog from AWS EMR (spark) for uniform enable table in Data Governance</title>
    <link>https://community.databricks.com/t5/data-governance/error-while-inserting-data-to-unity-catalog-from-aws-emr-spark/m-p/129310#M2586</link>
    <description>&lt;P&gt;Hi&amp;nbsp;&lt;a href="https://community.databricks.com/t5/user/viewprofilepage/user-id/180263"&gt;@adityapa&lt;/a&gt;&amp;nbsp;,&lt;/P&gt;&lt;P&gt;Can you first confirm that EMR can actually see your catalog and schema? Try running:&lt;/P&gt;&lt;P&gt;spark.sql("SHOW CATALOGS").show(false)&lt;BR /&gt;spark.sql("SHOW SCHEMAS IN &amp;lt;catalog_name&amp;gt;").show(false)&lt;/P&gt;</description>
    <pubDate>Fri, 22 Aug 2025 14:45:09 GMT</pubDate>
    <dc:creator>SP_6721</dc:creator>
    <dc:date>2025-08-22T14:45:09Z</dc:date>
    <item>
      <title>Error while inserting data to unity catalog from AWS EMR (spark) for uniform enable table</title>
      <link>https://community.databricks.com/t5/data-governance/error-while-inserting-data-to-unity-catalog-from-aws-emr-spark/m-p/129087#M2583</link>
      <description>&lt;P&gt;Hi Everyone,&lt;/P&gt;&lt;P&gt;I am trying to write data to a delta table created on Unity Catalog with external location. I am using AWS EMR and below are my table and spark properties.&lt;/P&gt;&lt;P&gt;#### Spark Shell&lt;/P&gt;&lt;P&gt;```&lt;BR /&gt;spark-shell \&lt;BR /&gt;--conf "spark.sql.defaultCatalog=&amp;lt;catalog_name&amp;gt;" \&lt;BR /&gt;--conf "spark.sql.catalog.&amp;lt;catalog_name&amp;gt;.warehouse=&amp;lt;catalog_name&amp;gt;" \&lt;BR /&gt;--conf spark.databricks.unityCatalog.enabled=true \&lt;BR /&gt;--conf spark.hadoop.fs.s3.impl=org.apache.hadoop.fs.s3a.S3AFileSystem \&lt;BR /&gt;--conf spark.hadoop.fs.s3a.impl=org.apache.hadoop.fs.s3a.S3AFileSystem \&lt;BR /&gt;--conf "spark.sql.catalog.&amp;lt;catalog_name&amp;gt;=io.unitycatalog.spark.UCSingleCatalog" \&lt;BR /&gt;--conf "spark.sql.catalog.&amp;lt;catalog_name&amp;gt;.type=rest" \&lt;BR /&gt;--conf "spark.sql.catalog.spark_catalog=org.apache.spark.sql.delta.catalog.DeltaCatalog" \&lt;BR /&gt;--conf "spark.sql.catalog.&amp;lt;catalog_name&amp;gt;.uri=https://${URI}/api/2.1/unity-catalog" \&lt;BR /&gt;--packages "org.apache.hadoop:hadoop-aws:3.4.1,org.apache.hadoop:hadoop-common:3.4.1,io.delta:delta-spark_2.12:3.2.1,io.unitycatalog:unitycatalog-spark_2.12:0.2.1,org.apache.iceberg:iceberg-spark-runtime-3.5_2.12:1.9.1,io.delta:delta-iceberg_2.12:3.3.2" \&lt;BR /&gt;--conf "spark.sql.catalog.&amp;lt;catalog_name&amp;gt;.credential=token" \&lt;BR /&gt;--conf "spark.sql.catalog.&amp;lt;catalog_name&amp;gt;.token=${DATABRICKS_TOKEN}" \&lt;BR /&gt;--conf "spark.hadoop.fs.s3a.endpoint=s3.us-west-1.amazonaws.com" \&lt;BR /&gt;--conf "spark.hadoop.fs.s3a.endpoint.region=us-west-1" \&lt;BR /&gt;--conf "spark.hadoop.fs.s3a.region=us-west-1" \&lt;BR /&gt;--conf "spark.databricks.delta.uniform.iceberg.sync.convert.enabled=true" \&lt;BR /&gt;--conf "spark.sql.extensions=io.delta.sql.DeltaSparkSessionExtension,org.apache.iceberg.spark.extensions.IcebergSparkSessionExtensions" \&lt;BR /&gt;--conf "spark.hadoop.fs.s3a.aws.credentials.provider=org.apache.hadoop.fs.s3a.TemporaryAWSCredentialsProvider"&lt;BR /&gt;```&lt;/P&gt;&lt;P&gt;#### Table Configs :&lt;/P&gt;&lt;P&gt;```&lt;BR /&gt;CREATE EXTERNAL TABLE poc_prod_adm.v1.table8 (a STRING, b STRING, c BIGINT, d BIGINT)&lt;BR /&gt;USING DELTA&lt;BR /&gt;PARTITIONED BY (a, b)&lt;BR /&gt;LOCATION 's3://&amp;lt;bucket&amp;gt;/&amp;lt;subfolder&amp;gt;/&amp;lt;catalog_name&amp;gt;/&amp;lt;schema_name&amp;gt;/table8'&lt;BR /&gt;TBLPROPERTIES (&lt;BR /&gt;'delta.columnMapping.mode' = 'name',&lt;BR /&gt;'delta.enableIcebergCompatV2' = 'true',&lt;BR /&gt;'delta.universalFormat.enabledFormats' = 'iceberg',&lt;BR /&gt;'delta.minReaderVersion' = 2,&lt;BR /&gt;'delta.minWriterVersion' = 5&lt;BR /&gt;);&lt;BR /&gt;```&lt;/P&gt;&lt;P&gt;-----&lt;/P&gt;&lt;P&gt;While inserting data from AWS EMR (Spark), I am getting following error :&lt;/P&gt;&lt;P&gt;```&lt;/P&gt;&lt;P&gt;scala&amp;gt; spark.sql("""INSERT INTO &amp;lt;catalog_name&amp;gt;.&amp;lt;schema_name&amp;gt;.table8 (&lt;BR /&gt;| a,&lt;BR /&gt;| b,&lt;BR /&gt;| c,&lt;BR /&gt;| d&lt;BR /&gt;| )&lt;BR /&gt;| VALUES (&lt;BR /&gt;| 'a',&lt;BR /&gt;| 'b',&lt;BR /&gt;| 20250820,&lt;BR /&gt;| 20250915,&lt;BR /&gt;| );""");&lt;BR /&gt;SLF4J: Failed to load class "org.slf4j.impl.StaticLoggerBinder".&lt;BR /&gt;SLF4J: Defaulting to no-operation (NOP) logger implementation&lt;BR /&gt;SLF4J: See &lt;A href="http://www.slf4j.org/codes.html#StaticLoggerBinder" target="_blank"&gt;http://www.slf4j.org/codes.html#StaticLoggerBinder&lt;/A&gt; for further details.&lt;BR /&gt;25/08/20 13:13:02 WARN SparkStringUtils: Truncated the string representation of a plan since it was too large. This behavior can be adjusted by setting 'spark.sql.debug.maxToStringFields'.&lt;BR /&gt;25/08/20 13:13:17 WARN HiveConf: HiveConf of name hive.server2.thrift.url does not exist&lt;BR /&gt;25/08/20 13:13:18 WARN HiveConf: HiveConf of name hive.server2.thrift.url does not exist&lt;BR /&gt;25/08/20 13:13:18 ERROR IcebergConverter: Error when converting to Iceberg metadata&lt;BR /&gt;org.apache.spark.sql.catalyst.analysis.NoSuchDatabaseException: [SCHEMA_NOT_FOUND] The schema `&amp;lt;schema_name&amp;gt;` cannot be found. Verify the spelling and correctness of the schema and catalog.&lt;BR /&gt;If you did not qualify the name with a catalog, verify the current_schema() output, or qualify the name with the correct catalog.&lt;BR /&gt;To tolerate the error on drop use DROP SCHEMA IF EXISTS.&lt;BR /&gt;```&lt;/P&gt;&lt;P&gt;&lt;BR /&gt;#### Notes :&lt;BR /&gt;1. Our requirement is that we should be able to write the data from Spark (delta interface) and read from both delta and iceberg interface tools like spark, duckdb, trino etc.&lt;BR /&gt;2. We are using uniform table for our requirement and hence those properties are crucial.&lt;BR /&gt;2.1 Only `'delta.universalFormat.enabledFormats' = 'iceberg',` is required and other properties are added to support it (as they need to be enabled or are defaults)&lt;BR /&gt;3. Spark config `spark.databricks.delta.uniform.iceberg.sync.convert.enabled=true` is set to true as per the details mentioned in : &lt;A href="https://github.com/delta-io/delta/blob/v3.3.2/spark/src/main/scala/org/apache/spark/sql/delta/sources/DeltaSQLConf.scala#L1508" target="_blank"&gt;https://github.com/delta-io/delta/blob/v3.3.2/spark/src/main/scala/org/apache/spark/sql/delta/sources/DeltaSQLConf.scala#L1508&lt;/A&gt;&lt;/P&gt;&lt;P&gt;&lt;BR /&gt;Any help is appreciated.&lt;/P&gt;</description>
      <pubDate>Thu, 21 Aug 2025 08:32:06 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-governance/error-while-inserting-data-to-unity-catalog-from-aws-emr-spark/m-p/129087#M2583</guid>
      <dc:creator>adityapa</dc:creator>
      <dc:date>2025-08-21T08:32:06Z</dc:date>
    </item>
    <item>
      <title>Re: Error while inserting data to unity catalog from AWS EMR (spark) for uniform enable table</title>
      <link>https://community.databricks.com/t5/data-governance/error-while-inserting-data-to-unity-catalog-from-aws-emr-spark/m-p/129310#M2586</link>
      <description>&lt;P&gt;Hi&amp;nbsp;&lt;a href="https://community.databricks.com/t5/user/viewprofilepage/user-id/180263"&gt;@adityapa&lt;/a&gt;&amp;nbsp;,&lt;/P&gt;&lt;P&gt;Can you first confirm that EMR can actually see your catalog and schema? Try running:&lt;/P&gt;&lt;P&gt;spark.sql("SHOW CATALOGS").show(false)&lt;BR /&gt;spark.sql("SHOW SCHEMAS IN &amp;lt;catalog_name&amp;gt;").show(false)&lt;/P&gt;</description>
      <pubDate>Fri, 22 Aug 2025 14:45:09 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-governance/error-while-inserting-data-to-unity-catalog-from-aws-emr-spark/m-p/129310#M2586</guid>
      <dc:creator>SP_6721</dc:creator>
      <dc:date>2025-08-22T14:45:09Z</dc:date>
    </item>
    <item>
      <title>Re: Error while inserting data to unity catalog from AWS EMR (spark) for uniform enable table</title>
      <link>https://community.databricks.com/t5/data-governance/error-while-inserting-data-to-unity-catalog-from-aws-emr-spark/m-p/129585#M2588</link>
      <description>&lt;P&gt;Hi&amp;nbsp;&lt;a href="https://community.databricks.com/t5/user/viewprofilepage/user-id/156441"&gt;@SP_6721&lt;/a&gt;&amp;nbsp;,&lt;BR /&gt;&lt;BR /&gt;I am able to read data from Spark (as it is using delta logs) and able to view the schema/catalog details on Trino over EMR.&lt;/P&gt;&lt;P&gt;I am also able to write data to delta files in s3 using UC. However, the metadata/manifest file for iceberg is not getting updated causing the above mentioned issue.&lt;/P&gt;</description>
      <pubDate>Mon, 25 Aug 2025 09:59:47 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-governance/error-while-inserting-data-to-unity-catalog-from-aws-emr-spark/m-p/129585#M2588</guid>
      <dc:creator>adityapa</dc:creator>
      <dc:date>2025-08-25T09:59:47Z</dc:date>
    </item>
  </channel>
</rss>

