<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: German Umlauts wrong via JDBC in Data Engineering</title>
    <link>https://community.databricks.com/t5/data-engineering/german-umlauts-wrong-via-jdbc/m-p/123302#M46982</link>
    <description>&lt;P&gt;hm, now its getting even more weird. I usually use NetbeansIDE. I now tried the same code with Eclipse and here it worked without any special options. In Netbeans, even with&amp;nbsp;&lt;/P&gt;&lt;PRE&gt;-Dfile.encoding=UTF-8&lt;/PRE&gt;&lt;P&gt;there is no change. Does anyone know what can lead Netbeans to this behaviour ?&lt;/P&gt;</description>
    <pubDate>Mon, 30 Jun 2025 13:52:07 GMT</pubDate>
    <dc:creator>jordan72</dc:creator>
    <dc:date>2025-06-30T13:52:07Z</dc:date>
    <item>
      <title>German Umlauts wrong via JDBC</title>
      <link>https://community.databricks.com/t5/data-engineering/german-umlauts-wrong-via-jdbc/m-p/123271#M46971</link>
      <description>&lt;P&gt;Hi,&lt;/P&gt;&lt;P&gt;I have the issue that German Umlauts are not getting retrieved correctly via the JDBC driver.&lt;/P&gt;&lt;P&gt;It shows&amp;nbsp;M�nchen instead of&amp;nbsp;München.&lt;/P&gt;&lt;P&gt;I load the driver in my java app via:&lt;/P&gt;&lt;P&gt;&amp;lt;groupId&amp;gt;com.databricks&amp;lt;/groupId&amp;gt;&lt;BR /&gt;&amp;lt;artifactId&amp;gt;databricks-jdbc&amp;lt;/artifactId&amp;gt;&lt;BR /&gt;&amp;lt;version&amp;gt;2.7.3&amp;lt;/version&amp;gt;&lt;/P&gt;&lt;P&gt;and set the charsets via:&lt;/P&gt;&lt;P&gt;System.setProperty("file.encoding", "UTF-8");&lt;BR /&gt;System.setProperty("sun.jnu.encoding", "UTF-8");&lt;/P&gt;&lt;P&gt;In the Databricks UI everything look correctly. The column type is STRING.&lt;/P&gt;&lt;P&gt;Regards&lt;/P&gt;&lt;P&gt;Volker Jordan&lt;/P&gt;</description>
      <pubDate>Mon, 30 Jun 2025 11:17:04 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/german-umlauts-wrong-via-jdbc/m-p/123271#M46971</guid>
      <dc:creator>jordan72</dc:creator>
      <dc:date>2025-06-30T11:17:04Z</dc:date>
    </item>
    <item>
      <title>Re: German Umlauts wrong via JDBC</title>
      <link>https://community.databricks.com/t5/data-engineering/german-umlauts-wrong-via-jdbc/m-p/123280#M46974</link>
      <description>&lt;P&gt;Hi&amp;nbsp;&lt;a href="https://community.databricks.com/t5/user/viewprofilepage/user-id/172844"&gt;@jordan72&lt;/a&gt;&amp;nbsp;,&lt;/P&gt;&lt;P&gt;Maybe try to add to your jdbc connection url following parameters:&amp;nbsp;&lt;/P&gt;&lt;P&gt;-&amp;nbsp;&lt;STRONG&gt;CharacterEncoding=UTF-8;&lt;/STRONG&gt;&lt;/P&gt;&lt;P&gt;- &lt;STRONG&gt;UseUnicode=true;&lt;/STRONG&gt;&lt;/P&gt;&lt;P&gt;&lt;STRONG&gt;-&amp;nbsp;CharSet=UTF-8;&lt;/STRONG&gt;&lt;/P&gt;&lt;LI-CODE lang="python"&gt;String url = "jdbc:databricks://&amp;lt;your-host&amp;gt;:443/default;transportMode=http;ssl=1;httpPath=&amp;lt;http-path&amp;gt;;AuthMech=3;UID=token;PWD=&amp;lt;token&amp;gt;;CharSet=UTF-8;characterEncoding=UTF-8;UseUnicode=true;";&lt;/LI-CODE&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Mon, 30 Jun 2025 12:03:05 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/german-umlauts-wrong-via-jdbc/m-p/123280#M46974</guid>
      <dc:creator>szymon_dybczak</dc:creator>
      <dc:date>2025-06-30T12:03:05Z</dc:date>
    </item>
    <item>
      <title>Re: German Umlauts wrong via JDBC</title>
      <link>https://community.databricks.com/t5/data-engineering/german-umlauts-wrong-via-jdbc/m-p/123281#M46975</link>
      <description>&lt;P&gt;I already tried all those parameters, but nothing changed.&lt;/P&gt;&lt;P&gt;Surprisingly, in DataGrip (which also used the JDBC driver) the results are correct. And I copied the url from DataGrip into a raw Java IDE, and here it does not work.&lt;/P&gt;</description>
      <pubDate>Mon, 30 Jun 2025 12:19:37 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/german-umlauts-wrong-via-jdbc/m-p/123281#M46975</guid>
      <dc:creator>jordan72</dc:creator>
      <dc:date>2025-06-30T12:19:37Z</dc:date>
    </item>
    <item>
      <title>Re: German Umlauts wrong via JDBC</title>
      <link>https://community.databricks.com/t5/data-engineering/german-umlauts-wrong-via-jdbc/m-p/123283#M46976</link>
      <description>&lt;P&gt;Ok, thanks for additional information. So maybe the issue is somehow related to JVM environment.&amp;nbsp;&lt;BR /&gt;I noticed that you're setting following property:&amp;nbsp;&lt;SPAN&gt;System.setProperty("file.encoding", "UTF-8");&lt;/SPAN&gt;&lt;BR /&gt;Java sets file.encoding once at JVM startup — setting it with System.setProperty at runtime has no effect on string decoding in most libraries, including JDBC drivers.&lt;/P&gt;&lt;P&gt;Try to launch your application with following&amp;nbsp; VM option.&amp;nbsp;&lt;BR /&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;/P&gt;&lt;LI-CODE lang="python"&gt;java -Dfile.encoding=UTF-8&lt;/LI-CODE&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Mon, 30 Jun 2025 12:59:27 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/german-umlauts-wrong-via-jdbc/m-p/123283#M46976</guid>
      <dc:creator>szymon_dybczak</dc:creator>
      <dc:date>2025-06-30T12:59:27Z</dc:date>
    </item>
    <item>
      <title>Re: German Umlauts wrong via JDBC</title>
      <link>https://community.databricks.com/t5/data-engineering/german-umlauts-wrong-via-jdbc/m-p/123286#M46978</link>
      <description>&lt;P&gt;Another thought, you can check if this is not problem with your IDE configuration. Assuming you're using Intellij, then check your file encodings settings: Settings -&amp;gt; Editior -&amp;gt; File encodings.&amp;nbsp;&lt;BR /&gt;&lt;BR /&gt;&lt;A href="https://blog.jetbrains.com/idea/2013/03/use-the-utf-8-luke-file-encodings-in-intellij-idea/#" target="_blank" rel="noopener"&gt;Use the UTF-8, Luke! File Encodings in IntelliJ IDEA | The IntelliJ IDEA Blog&lt;/A&gt;&lt;/P&gt;</description>
      <pubDate>Mon, 30 Jun 2025 13:03:58 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/german-umlauts-wrong-via-jdbc/m-p/123286#M46978</guid>
      <dc:creator>szymon_dybczak</dc:creator>
      <dc:date>2025-06-30T13:03:58Z</dc:date>
    </item>
    <item>
      <title>Re: German Umlauts wrong via JDBC</title>
      <link>https://community.databricks.com/t5/data-engineering/german-umlauts-wrong-via-jdbc/m-p/123302#M46982</link>
      <description>&lt;P&gt;hm, now its getting even more weird. I usually use NetbeansIDE. I now tried the same code with Eclipse and here it worked without any special options. In Netbeans, even with&amp;nbsp;&lt;/P&gt;&lt;PRE&gt;-Dfile.encoding=UTF-8&lt;/PRE&gt;&lt;P&gt;there is no change. Does anyone know what can lead Netbeans to this behaviour ?&lt;/P&gt;</description>
      <pubDate>Mon, 30 Jun 2025 13:52:07 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/german-umlauts-wrong-via-jdbc/m-p/123302#M46982</guid>
      <dc:creator>jordan72</dc:creator>
      <dc:date>2025-06-30T13:52:07Z</dc:date>
    </item>
    <item>
      <title>Re: German Umlauts wrong via JDBC</title>
      <link>https://community.databricks.com/t5/data-engineering/german-umlauts-wrong-via-jdbc/m-p/123304#M46983</link>
      <description>&lt;P&gt;Ok, so that only confirms that this problem is not related to driver. Rather, this is weird quirk of Netbeans.&lt;BR /&gt;In netbeans it's not sufficient to use only option -Dfile.encoding=UTF-8.&amp;nbsp;&lt;BR /&gt;Please follow approach suggested in following stackoverflow thread, depending on Java version you're using&lt;/P&gt;&lt;P&gt;&lt;A href="https://stackoverflow.com/questions/4896311/how-to-use-utf-8-character-in-netbeans/27848261#27848261" target="_blank" rel="noopener"&gt;java - How to use UTF-8 character in Netbeans - Stack Overflow&lt;/A&gt;&lt;/P&gt;</description>
      <pubDate>Mon, 30 Jun 2025 14:13:10 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/german-umlauts-wrong-via-jdbc/m-p/123304#M46983</guid>
      <dc:creator>szymon_dybczak</dc:creator>
      <dc:date>2025-06-30T14:13:10Z</dc:date>
    </item>
    <item>
      <title>Re: German Umlauts wrong via JDBC</title>
      <link>https://community.databricks.com/t5/data-engineering/german-umlauts-wrong-via-jdbc/m-p/123315#M46984</link>
      <description>&lt;P&gt;ok, so it seems that it has something to do with the newly introduced&amp;nbsp;native.encoding system property.&lt;/P&gt;&lt;P&gt;So In Netbeans you have to provide&amp;nbsp;-Dstdout.encoding=utf-8 to the vm if you are using JDK21.&lt;/P&gt;</description>
      <pubDate>Mon, 30 Jun 2025 15:40:48 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/german-umlauts-wrong-via-jdbc/m-p/123315#M46984</guid>
      <dc:creator>jordan72</dc:creator>
      <dc:date>2025-06-30T15:40:48Z</dc:date>
    </item>
    <item>
      <title>Re: German Umlauts wrong via JDBC</title>
      <link>https://community.databricks.com/t5/data-engineering/german-umlauts-wrong-via-jdbc/m-p/123317#M46985</link>
      <description>&lt;P&gt;Yes, this is exactly what the link I provided above suggested:&lt;BR /&gt;&lt;BR /&gt;&lt;/P&gt;&lt;P&gt;&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="szymon_dybczak_0-1751298256641.png" style="width: 400px;"&gt;&lt;img src="https://community.databricks.com/t5/image/serverpage/image-id/17861iA4E84DC31856BACF/image-size/medium?v=v2&amp;amp;px=400" role="button" title="szymon_dybczak_0-1751298256641.png" alt="szymon_dybczak_0-1751298256641.png" /&gt;&lt;/span&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;/P&gt;</description>
      <pubDate>Mon, 30 Jun 2025 15:44:19 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/german-umlauts-wrong-via-jdbc/m-p/123317#M46985</guid>
      <dc:creator>szymon_dybczak</dc:creator>
      <dc:date>2025-06-30T15:44:19Z</dc:date>
    </item>
  </channel>
</rss>

