Is there a known issue regarding Databricks JDBC driver character values such as Japanese etc?

151640
Databricks Partner

A Parquet file contains character data for various languages and is shown by the Data Explorer UX. A simple "select *" query using the Databricks JDBC driver (version 2.6.29) with a tool such as SQLSquirrel displays invalid characters.

image

Debayan
Databricks Employee
Databricks Employee

Hi @nigel campbell​ , Could you please look into https://community.databricks.com/s/question/0D53f00001LKG9ECAX/how-to-properly-load-unicode-utf8-cha... and let us know if this helps.

151640
Databricks Partner

That thread doesn't address the issue. The driver should return valid values by default as do many other products. Appears to be a fairly significant defect.

Anonymous
Not applicable

Hi @nigel campbell​ 

Hope everything is going great.

Does @Kaniz Fatma​  and @Debayan Mukherjee​ response answer your question? If yes, would you be happy to mark it as best so that other members can find the solution more quickly?

We'd love to hear from you.

Thanks!

151640
Databricks Partner

The issue encountered has been confirmed to be a defect in the Databricks JDBC driver.

View solution in original post