<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: Query AWS Redshift from Databricks SQL in Data Engineering</title>
    <link>https://community.databricks.com/t5/data-engineering/query-aws-redshift-from-databricks-sql/m-p/31000#M22531</link>
    <description>&lt;P&gt;@Cristian Constantinescu​&amp;nbsp;can you point me to the documentation page which says DBSQL supports JDBC connections?&lt;/P&gt;</description>
    <pubDate>Fri, 28 Jan 2022 13:53:03 GMT</pubDate>
    <dc:creator>BilalAslamDbrx</dc:creator>
    <dc:date>2022-01-28T13:53:03Z</dc:date>
    <item>
      <title>Query AWS Redshift from Databricks SQL</title>
      <link>https://community.databricks.com/t5/data-engineering/query-aws-redshift-from-databricks-sql/m-p/30995#M22526</link>
      <description>&lt;P&gt;Greetings,&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;In the documentation for Databricks SQL it states that it supports JDBC connections, however when connecting to AWS Redshift via the built in PostgreSQL driver ("CREATE TABLE sample USING JDBC" and "jdbc://postgresql:/..." URI) I'm getting a weird error as follows:&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;"&lt;/P&gt;&lt;P&gt;org.apache.spark.sql.AnalysisException:&amp;nbsp;&lt;/P&gt;&lt;P&gt;The user-specified schema doesn't match the actual schema:&lt;/P&gt;&lt;P&gt;user-specified: `id` INT, (...other columns edited out), actual: `id` INT, (...other columns edited out). If you're using&lt;/P&gt;&lt;P&gt;DataFrameReader.schema API or creating a table, please do not specify the schema.&lt;/P&gt;&lt;P&gt;Or if you're scanning an existed table, please drop it and re-create it.&lt;/P&gt;&lt;P&gt;"&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;The list of columns were edited out but it is exactly the same list in "user-specified" and in "actual", seems like rather some internal detail is different rather than the printed DDL.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Does this mean that currently there is no way to query AWS Redshift as external data store from Databricks SQL?&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Thanks,&lt;/P&gt;&lt;P&gt;Cristian&lt;/P&gt;</description>
      <pubDate>Tue, 25 Jan 2022 11:04:27 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/query-aws-redshift-from-databricks-sql/m-p/30995#M22526</guid>
      <dc:creator>cristianc</dc:creator>
      <dc:date>2022-01-25T11:04:27Z</dc:date>
    </item>
    <item>
      <title>Re: Query AWS Redshift from Databricks SQL</title>
      <link>https://community.databricks.com/t5/data-engineering/query-aws-redshift-from-databricks-sql/m-p/30996#M22527</link>
      <description>&lt;P&gt;@Cristian Constantinescu​&amp;nbsp; Thanks for posting your query/issue here.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;can you please try to create the table using the jdbc connections in the databricks notebooks using the 9.1 version of DBR and check if you are able to perform it?&lt;/P&gt;</description>
      <pubDate>Tue, 25 Jan 2022 11:47:40 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/query-aws-redshift-from-databricks-sql/m-p/30996#M22527</guid>
      <dc:creator>pavan_kumar</dc:creator>
      <dc:date>2022-01-25T11:47:40Z</dc:date>
    </item>
    <item>
      <title>Re: Query AWS Redshift from Databricks SQL</title>
      <link>https://community.databricks.com/t5/data-engineering/query-aws-redshift-from-databricks-sql/m-p/30997#M22528</link>
      <description>&lt;P&gt;From notebooks the spark sql queries are working as expected.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;The problem seems to appear when using the "Databricks SQL" product, for instance when asking for a data sample or when running queries on the external table (that does the JDBC to Redshift via PostgreSQL JDBC URI) from the metastore.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;The strangest of the things is that this code seems to be throwing the error:&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;spark/sql/core/src/main/scala/org/apache/spark/sql/execution/datasources/DataSource.scala&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;&lt;B&gt;if&lt;/B&gt; (baseRelation.schema != schema) {&lt;/P&gt;&lt;P&gt; 362 &lt;B&gt;throw new&lt;/B&gt; AnalysisException(&lt;/P&gt;&lt;P&gt; 363 "The user-specified schema doesn't match the actual schema: " +&lt;/P&gt;&lt;P&gt; 364 s"user-specified: ${schema.toDDL}, actual: ${baseRelation.schema.toDDL}. If " +&lt;/P&gt;&lt;P&gt; 365 "you're using DataFrameReader.schema API or creating a table, please do not " +&lt;/P&gt;&lt;P&gt; 366 "specify the schema. Or if you're scanning an existed table, please drop " +&lt;/P&gt;&lt;P&gt; 367 "it and re-create it.")&lt;/P&gt;&lt;P&gt; 368 }&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Meaning baseRelation.schema != schema but schema.toDDL and baseRelation.schema.toDDL is the same string in my case.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Hope this better describes the problem.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Thanks for your interest in solving this issue!&lt;/P&gt;</description>
      <pubDate>Tue, 25 Jan 2022 11:59:31 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/query-aws-redshift-from-databricks-sql/m-p/30997#M22528</guid>
      <dc:creator>cristianc</dc:creator>
      <dc:date>2022-01-25T11:59:31Z</dc:date>
    </item>
    <item>
      <title>Re: Query AWS Redshift from Databricks SQL</title>
      <link>https://community.databricks.com/t5/data-engineering/query-aws-redshift-from-databricks-sql/m-p/30998#M22529</link>
      <description>&lt;P&gt;@Cristian Constantinescu​&amp;nbsp; we see that issue is occurring on the 10.1 version of dbr with 3.2 version of spark and recently we have fixed this issue and the fix will be rolled out soon on all the workspaces.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;you can try to use the 10.1 dbr with notebooks and check if the issue is reproducible and for the time you can use the lower versions of dbr with notebooks.&lt;/P&gt;</description>
      <pubDate>Wed, 26 Jan 2022 07:01:19 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/query-aws-redshift-from-databricks-sql/m-p/30998#M22529</guid>
      <dc:creator>pavan_kumar</dc:creator>
      <dc:date>2022-01-26T07:01:19Z</dc:date>
    </item>
    <item>
      <title>Re: Query AWS Redshift from Databricks SQL</title>
      <link>https://community.databricks.com/t5/data-engineering/query-aws-redshift-from-databricks-sql/m-p/30999#M22530</link>
      <description>&lt;P&gt;@Pavan Kumar Chalamcharla​&amp;nbsp;thank you for your help in understanding better this situation.&lt;/P&gt;</description>
      <pubDate>Wed, 26 Jan 2022 08:39:03 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/query-aws-redshift-from-databricks-sql/m-p/30999#M22530</guid>
      <dc:creator>cristianc</dc:creator>
      <dc:date>2022-01-26T08:39:03Z</dc:date>
    </item>
    <item>
      <title>Re: Query AWS Redshift from Databricks SQL</title>
      <link>https://community.databricks.com/t5/data-engineering/query-aws-redshift-from-databricks-sql/m-p/31000#M22531</link>
      <description>&lt;P&gt;@Cristian Constantinescu​&amp;nbsp;can you point me to the documentation page which says DBSQL supports JDBC connections?&lt;/P&gt;</description>
      <pubDate>Fri, 28 Jan 2022 13:53:03 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/query-aws-redshift-from-databricks-sql/m-p/31000#M22531</guid>
      <dc:creator>BilalAslamDbrx</dc:creator>
      <dc:date>2022-01-28T13:53:03Z</dc:date>
    </item>
    <item>
      <title>Re: Query AWS Redshift from Databricks SQL</title>
      <link>https://community.databricks.com/t5/data-engineering/query-aws-redshift-from-databricks-sql/m-p/31001#M22532</link>
      <description>&lt;P&gt;@Bilal Aslam​&amp;nbsp;there is a high possibility that my understand is not correct, but when I was reading the Databricks SQL guide I stumbled upon this page: &lt;A href="https://docs.databricks.com/sql/language-manual/sql-ref-syntax-ddl-create-table-using.html" target="test_blank"&gt;https://docs.databricks.com/sql/language-manual/sql-ref-syntax-ddl-create-table-using.html&lt;/A&gt;&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;In the page it explains how the CREATE TABLE statement works for DBSQL, there is the following section:&lt;/P&gt;&lt;P&gt;"USING data_source The file format to use for the table. data_source must be one of TEXT, AVRO, CSV, JSON, JDBC, PARQUET, ORC, or DELTA. If USING is omitted, the default is DELTA."&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;In the meantime we discussed with our account rep and we were briefed that JDBC is not supported for DBSQL which is compatible with our experience.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;I hope that this information was helpful in any way!&lt;/P&gt;</description>
      <pubDate>Fri, 28 Jan 2022 14:04:00 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/query-aws-redshift-from-databricks-sql/m-p/31001#M22532</guid>
      <dc:creator>cristianc</dc:creator>
      <dc:date>2022-01-28T14:04:00Z</dc:date>
    </item>
    <item>
      <title>Re: Query AWS Redshift from Databricks SQL</title>
      <link>https://community.databricks.com/t5/data-engineering/query-aws-redshift-from-databricks-sql/m-p/31002#M22533</link>
      <description>&lt;P&gt;@Cristian Constantinescu​&amp;nbsp;you're right, this is a documentation bug since DBSQL does not [yet] support querying JDBC. I'll work with the team to fix it. Thanks for sharing the doc link!&lt;/P&gt;</description>
      <pubDate>Fri, 28 Jan 2022 14:05:21 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/query-aws-redshift-from-databricks-sql/m-p/31002#M22533</guid>
      <dc:creator>BilalAslamDbrx</dc:creator>
      <dc:date>2022-01-28T14:05:21Z</dc:date>
    </item>
    <item>
      <title>Re: Query AWS Redshift from Databricks SQL</title>
      <link>https://community.databricks.com/t5/data-engineering/query-aws-redshift-from-databricks-sql/m-p/31003#M22534</link>
      <description>&lt;P&gt;@Bilal Aslam​&amp;nbsp;anytime! Is there a place where customers could follow the timeline when such features are introduced?&lt;/P&gt;</description>
      <pubDate>Fri, 28 Jan 2022 14:10:12 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/query-aws-redshift-from-databricks-sql/m-p/31003#M22534</guid>
      <dc:creator>cristianc</dc:creator>
      <dc:date>2022-01-28T14:10:12Z</dc:date>
    </item>
    <item>
      <title>Re: Query AWS Redshift from Databricks SQL</title>
      <link>https://community.databricks.com/t5/data-engineering/query-aws-redshift-from-databricks-sql/m-p/31004#M22535</link>
      <description>&lt;P&gt;@Cristian Constantinescu​&amp;nbsp;I believe that's at ideas.databricks.com&lt;/P&gt;</description>
      <pubDate>Fri, 28 Jan 2022 14:26:22 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/query-aws-redshift-from-databricks-sql/m-p/31004#M22535</guid>
      <dc:creator>BilalAslamDbrx</dc:creator>
      <dc:date>2022-01-28T14:26:22Z</dc:date>
    </item>
  </channel>
</rss>

