cancel
Showing results for 
Search instead for 
Did you mean: 
Data Engineering
Join discussions on data engineering best practices, architectures, and optimization strategies within the Databricks Community. Exchange insights and solutions with fellow data engineers.
cancel
Showing results for 
Search instead for 
Did you mean: 

Apache Hudi Table creation using hudi maven library

ros
New Contributor III

I installed hudi maven library org.apache.hudi:hudi-spark3.3-bundle_2.12:0.13.0

in Dbricks Runtime Ver : 12.2 LTS (includes Apache Spark 3.3.2, Scala 2.12)

with spark config :

spark.sql.catalog.spark_catalog org.apache.spark.sql.hudi.catalog.HoodieCatalog
 
spark.serializer org.apache.spark.serializer.KryoSerializer
 
spark.sql.extensions org.apache.spark.sql.hudi.HoodieSparkSessionExtension

I ran this python cmd :

import org.apache.hudi.DataSourceReadOptions._
import org.apache.hudi.DataSourceWriteOptions._
import org.apache.hudi.config.HoodieWriteConfig._

which gave me error :

ModuleNotFoundError: No module named 'org.apache.hudi'

And then i ran sql command in notebook :

create table hudi_cow_pt_tbl (
id bigint,
name string,
ts bigint,
dt string,
hh string
) using hudi
tblproperties (
type = 'cow',
primaryKey = 'id',
preCombineField = 'ts'
)
partitioned by (dt, hh)
location 's3://incred-databricks-data/hudi_dms_data/hudi_cow_pt_tbl';

which gives me error :

java.io.FileNotFoundException: No such file or directory: s3://incred-databricks-data/hudi_dms_data/hudi_cow_pt_tbl

where as this path exists : s3://incred-databricks-data/hudi_dms_data/

2 REPLIES 2

shan_chandra
Esteemed Contributor
Esteemed Contributor

@Roshan RC​ - can you please try with a mount location instead and let us know?

ros
New Contributor III

@Shanmugavel Chandrakasu​ 

%sql
create table hudi_cow_pt_tbl (
id bigint,
name string,
ts bigint,
dt string,
hh string
) using hudi
tblproperties (
type = 'cow',
primaryKey = 'id',
preCombineField = 'ts'
)
partitioned by (dt, hh)
location '/mnt/data/hudi_dms_data/hudi_cow_pt_tbl';

It still gave me error :

org.apache.hudi.exception.TableNotFoundException: Hoodie table not found in path Unable to find a hudi table for the user provided paths.

Join 100K+ Data Experts: Register Now & Grow with Us!

Excited to expand your horizons with us? Click here to Register and begin your journey to success!

Already a member? Login and join your local regional user group! If there isn’t one near you, fill out this form and we’ll create one for you to join!