Thursday
For all it's positives, one of the first general issues we had with databricks was case sensitivity.
We have a lot of data specific filters in our code
Problem is, we land and view data from lots of different case insensitive source systems e.g. SQL Server
As such, we have to be very careful with our code and convert columns to UPPER when making a comparison.
Most of our code is written in SQL.
About 18 months ago I asked whether there was going to be a catalog, schema or table setting for this i.e. make the object case insensitive.
I was told it was on its way.
Not heard anything since and cannot find anything.
Does anybody know whether this is in place or expected?
Thanks
Thursday - last edited Thursday
Hi @dpc ,
I think you can try to use a collation for that purpose. A collation is a set of rules that determines how string comparisons are performed. Collations are used to compare strings in a case-insensitive, accent-insensitive, or trailing space insensitive manner, or to sort strings in a specific language-aware order.
Friday
Thanks.
Collation is table specific though isn't it? and you have to apply it to each columns.
Is there a was to just say, this schema, catalog or table is case insensitive or can you only do it by column?
Friday
Friday
Hi, You can set the default collation at Catalog level or schema level and the tables in the catalog will inherit the collation. This is supported from DBR 17.1 and above.
15 hours ago
Thanks.
I'll test collation at catalog, sschema and table level using 17.1