cancel
Showing results for 
Search instead for 
Did you mean: 
Data Engineering
Join discussions on data engineering best practices, architectures, and optimization strategies within the Databricks Community. Exchange insights and solutions with fellow data engineers.
cancel
Showing results for 
Search instead for 
Did you mean: 

Forum Posts

RajibRajib_Mand
by New Contributor III
  • 3774 Views
  • 7 replies
  • 6 kudos

Unicode field separator to create unamanged table in databricks for csv file

We are getting \u318a (ㆊ) separated csv file. We want to create unmanaged table in databricks, Here is the table creation script.create table IF NOT EXISTS db_test_raw.t_data_otc_poc (`caseidt` String,`worktype` String,`doctyp` String,`brand` String,...

  • 3774 Views
  • 7 replies
  • 6 kudos
Latest Reply
Hubert-Dudek
Esteemed Contributor III
  • 6 kudos

sep "\u318a"delimeter " \x318a"sep " \x318a"Try to use sep instead or/and x instead.

  • 6 kudos
6 More Replies
Quan
by New Contributor III
  • 15775 Views
  • 9 replies
  • 6 kudos

Resolved! How to properly load Unicode (UTF-8) characters from table over JDBC connection using Simba Spark Driver

Hello all, I'm trying to pull table data from databricks tables that contain foreign language characters in UTF-8 into an ETL tool using a JDBC connection. I'm using the latest Simba Spark JDBC driver available from the Databricks website.The issue i...

  • 15775 Views
  • 9 replies
  • 6 kudos
Latest Reply
Anonymous
Not applicable
  • 6 kudos

Can you try setting UseUnicodeSqlCharacterTypes=1 in the driver, and also make sure 'file.encoding' is set to UTF-8 in jvm and see if the issue still persists?

  • 6 kudos
8 More Replies
Labels