Re: Is there a way to register a scala function th... - Databricks Community - 5313

Register to join the community

Data Engineering

Join discussions on data engineering best practices, architectures, and optimization strategies within the Databricks Community. Exchange insights and solutions with fellow data engineers.

I am in a situation where I have a notebook that runs in a pipeline that creates a "live streaming table". So, I cannot use a language other than sql in the pipeline. I would like to format a certain column in the pipeline using a scala code (it's a complicated formatting and difficult to replicate in SQL).

Spark allows you to register scala methods as udf and access those registered methods in SQL.

But given my current situation (pipeline with DLT), I cannot include the scala method and the statement to register the method in spark context in the notebook.

Is there any work around here?

3 REPLIES 3

no, DLT does not work with Scala unfortunately.

Delta Live Tables are not vanilla spark.

Is python an option instead of scala?

yes, python is an option if I can use the library https://pypi.org/project/phonenumbers/

afaik you can create python udf's, but somehow I do not find the docs anymore.

https://docs.databricks.com/workflows/delta-live-tables/delta-live-tables-cookbook.html

and

https://docs.databricks.com/workflows/delta-live-tables/delta-live-tables-cookbook.html#import-pytho...

But they seem to be removed. If someone knows where to find these...

never-displayed

You must be signed in to add attachments

never-displayed

Announcements

Databricks Learning Festival (Virtual): 15 January - 31 January 2025

Milestone: DatabricksTV Reaches 100 Videos!

Announcing the new Meta Llama 3.3 model on Databricks

Databricks Community Champion - December 2024 - Sujesh Menon

Dotmatics and Databricks Partner to Advance Scientific Intelligence in Life Sciences