Lakebridge Transpiler Fails with UnicodeDecodeError while Analyzer Works Successfully

shashankB
Databricks Partner

 

Hello Team,

I am facing an issue with Lakebridge transpiler.
The Analyzer step runs successfully and produces the expected analysis files. However, when I run the Transpiler, it fails with the following error:

 

 
ERROR [src/databricks/labs/Lakebridge.transpile] UnicodeDecodeError: 'charmap' codec can't decode byte 0x8f in position 71: character maps to <undefined> Error: unexpected end of JSON input Lakebridge Transpile failed with exit code 1
 

 
Command I executed:

databricks labs lakebridge transpile --input-source "C:\Users\user_name\Downloads\segment_pioneer" --source-dialect synapse --output-folder "C:\Users\user_name\Downloads\segment_pioneer\output\Converted_Code"
 
What confuses me is that:

The Analyzer works fine and completes successfully.

The Transpiler fails immediately with encoding-related error.

If there was a code issue in SQL, I would expect the Analyzer to also fail. So it seems related to how files/paths are being read by the transpiler (maybe encoding issue in Windows).

Could you please help clarify:

  1. Why Analyzer runs but Transpiler fails on the same input?
  2. Is there a known workaround for the UnicodeDecodeError on Windows (e.g., forcing UTF-8)?
  3. Should I try running this with a different CLI encoding setting?

Thanks in advance.