Databricks Community

gvvishnu · ‎01-22-2025

current project we are using murmur hash function in hadoop.we are planning for migration to databricks.can databricks support murmur hash function ?

brockb · ‎01-25-2025

Hi @gvvishnu ,

Thanks for your question.

My understanding is that the Apache Spark `hash()` function implements the `org.apache.spark.sql.catalyst.expressions.Murmur3Hash` expression.

You can see this in the Spark source code here:

https://github.com/apache/spark/blob/master/sql/core/src/test/resources/sql-functions/sql-expression...

The best way however to confirm that this is equivalent to the existing 'murmur hash function in hadoop' that you have previously used is to do some tests and comparisons. Please do that testing and verify that this function returns the expected results.

Thank you.

Databricks Community

can databricks support murmur hash function

Join Us as a Local Community Builder!

PSA: Community Edition retires on January 1, 2026. Move to the Free Edition today to keep your work.

🎤 Call for Presentations: Data + AI Summit 2026 is Open!

Last Chance: Help Shape the 2026 Data + AI Summit | Win a Full Conference Pass

🌟 Community Pulse: Your Weekly Roundup! December 05 – 11, 2025

Jaipur Usergroup First Virtual Meetup: AI/BI Genie + Data Science Careers — 19 Dec | 6 PM IST