Custom ENUM input as parameter for SQL UDF?
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
04-20-2024 10:12 AM
Hello -
We're migrating from T-SQL to Spark SQL. We're migrating a significant number of queries.
"datediff(unit, start,end)" is different between these two implementations (in a good way). For the purpose of migration, we'd like to stay as consistent with possible, so I was evaluating writing a custom migration-specific UDF that emulated the t-sql behavior - so people could transition over time with minimal disruption.
I'm bumping into something I have no idea how to achieve - allowing ENUMs to be exposed to SQL from a Python function.
For example -
SELECT
DATEDIFF(
DAY, -- <---- the enum unit
TIMESTAMP'1999-12-31 23:59:59',
TIMESTAMP'2000-01-01 23:59:58'
) diff_day
Note the "DAY" there is NOT a string - it's some for of ENUM exposed to Spark SQL. I'd like to achieve that similar behavior if at all possible, but I'm not sure how to go about doing it. I'm writing the function in Python and then registering the UDF ala:
When I call this in SQL, though, I get an error (which I kind of expected):
%sql
- Labels:
-
Spark
data:image/s3,"s3://crabby-images/d6be0/d6be025e52e1a61c30ea16a2fda1ef9155483c43" alt=""
data:image/s3,"s3://crabby-images/d6be0/d6be025e52e1a61c30ea16a2fda1ef9155483c43" alt=""