How to remove more than 4 byte characters using pyspark in databricks?

eimis_pacheco
Contributor

Hi community,

We have the need of removing more than 4 byte characters using pyspark in databricks since these are not supported by amazon Redshift. Does someone know how can I accomplish this?

Thank you very much in advance

Regards