cancel
Showing results for 
Search instead for 
Did you mean: 
Machine Learning
Dive into the world of machine learning on the Databricks platform. Explore discussions on algorithms, model training, deployment, and more. Connect with ML enthusiasts and experts.
cancel
Showing results for 
Search instead for 
Did you mean: 

Forum Posts

Jack_Watson
by Contributor
  • 10959 Views
  • 4 replies
  • 0 kudos

Resolved! I am saving a new feature table to the Databricks feature store, and it won't write the data sources of the tables used to create the feature table, because they are Hive tables that point to Azure Data Lake Storage Gen1 Delta tables

My notebook is pulling in Hive tables from DBFS, that point to ADLS Gen1 file locations for their data (Delta tables), creating the feature table as a data frame within the notebook, then calling on the feature store client to save down the feature t...

  • 10959 Views
  • 4 replies
  • 0 kudos
Latest Reply
Atanu
Databricks Employee
  • 0 kudos

@Jack Watson​  Could you please confirm the write is succeeding ? If yes, as per my understanding This is a warning for some validation that we will be removing shortly. We’ll likely remove the validation which save the data source.Thanks.

  • 0 kudos
3 More Replies
thib
by New Contributor III
  • 2461 Views
  • 3 replies
  • 4 kudos

Resolved! Feature store : Can create_training_set() be implemented to execute an inner join?

For timeseries feature tables, an inner join is made at the creation of the feature table. For the other type of feature tables, a left join is made, so NaN values can show up in the training set. Can the inner join in create_training_set() method be...

  • 2461 Views
  • 3 replies
  • 4 kudos
Latest Reply
thib
New Contributor III
  • 4 kudos

Thank you Hubert, that's a good alternative, I just thought I'd stick to the api as much as possible, but this solves it.

  • 4 kudos
2 More Replies
Edmondo
by New Contributor III
  • 2661 Views
  • 0 replies
  • 0 kudos

MlFlow and Feature Store: mlflow.spark.autolog, using feature store on Databricks, FeatureStoreClient.log_model()?

As I am moving my first steps within the Databricks Machine Learning Workspace, I am getting confused by some features that by "documentation" seem to overlap. Does autolog for spark on mlflow provide different tracking than using a training set crea...

  • 2661 Views
  • 0 replies
  • 0 kudos
mhansinger
by New Contributor II
  • 2130 Views
  • 1 replies
  • 1 kudos

Resolved! Get FeatureStore write date

Hi,is there a way to get the time stamp of the last update of a feature store table with the feature store client API? The creation time stamp can be querried as: feature_store.FeatureStoreClient().get_feature_table(name="my.table").creation_timestam...

  • 2130 Views
  • 1 replies
  • 1 kudos
Latest Reply
sean_owen
Databricks Employee
  • 1 kudos

(The question is about querying table metadata, not creating one)I can confirm that there isn't a way to query this, not that I can see in the current API in 10.2

  • 1 kudos
MoJaMa
by Databricks Employee
  • 1245 Views
  • 1 replies
  • 0 kudos
  • 1245 Views
  • 1 replies
  • 0 kudos
Latest Reply
MoJaMa
Databricks Employee
  • 0 kudos

Data is stored in the control plane. Metadata (eg feature table descriptions, column types, etc) is stored in the control plane. The location where the Delta table is stored is determined by the database location. The customer could call  CREATE DATA...

  • 0 kudos
User16789201666
by Databricks Employee
  • 1772 Views
  • 1 replies
  • 0 kudos

When would you use the Feature Store?

For example would you use a feature store on your raw data or what's is the granularity of the features in the store?

  • 1772 Views
  • 1 replies
  • 0 kudos
Latest Reply
Joseph_B
Databricks Employee
  • 0 kudos

I'll try to answer the broad question first, followed by the specific ones.When would you use the Feature Store?A Feature Store is primarily used to solve 2 challenges.(1) Discoverability and governance of featuresChallenge: In a large team or organi...

  • 0 kudos
Anonymous
by Not applicable
  • 2035 Views
  • 1 replies
  • 0 kudos
  • 2035 Views
  • 1 replies
  • 0 kudos
Latest Reply
sajith_appukutt
Honored Contributor II
  • 0 kudos

The feature store has both online / offline components. The offline feature store is used for feature discovery, model training, and batch inference and is backed by  Delta tables. You could read/write to offline store from Databricks clusters that...

  • 0 kudos
User16789201666
by Databricks Employee
  • 3902 Views
  • 1 replies
  • 0 kudos

When should we use offline store vs online store for Feature Store?

Looking at the docs we see both options, can we use both e.g.?

  • 3902 Views
  • 1 replies
  • 0 kudos
Latest Reply
User16789201666
Databricks Employee
  • 0 kudos

Online store is for real time inferencing, in most case you will use the offline store.

  • 0 kudos
Labels