Community Platform Discussions

by Rishabh-Pandey • Esteemed Contributor

10-21-2024 4:59:40 AM

1485 Views
5 replies
0 kudos

Resolved! Issues with Content Writing on Databricks Community

Hi @Sujitha , @Rishabh_Tiwari ,I wanted to bring to your attention that whenever I’m writing content on Databricks, I often encounter errors due to invalid HTML. Additionally, some terms seem to be prohibited by the Databricks community, which is puz...

Community Platform Discussions

Reply

1485 Views
5 replies
0 kudos

10-21-2024 4:59:40 AM

View Replies

Latest Reply

Rishabh_Tiwari
Databricks Employee

10-21-2024 5:27:08 AM

0 kudos

@Rishabh-Pandey I understand. Please be assured that I am actively working on this and tracking these posts to update our filter on a regular basis. If you come across something similar again, feel free to tag me, and I'll take care of that.

0 kudos

10-21-2024 5:27:08 AM

4 More Replies

by Rishabh-Pandey • Esteemed Contributor

10-21-2024 4:02:31 AM

537 Views
0 replies
1 kudos

Live, Virtual Workshop How to build a Golden Data Warehouse in Financial Services with Databricks

Reasons to join: Most Financial Services organizations have major on-prem investments. You can use that as your starting point to activate your organization on gold-level insights in the cloud.Providing a path to easier and quicker migration to the c...

Community Platform Discussions

Reply

537 Views
0 replies
1 kudos

10-21-2024 4:02:31 AM

by kro • New Contributor

10-21-2024 1:34:13 AM

491 Views
0 replies
0 kudos

OCRmyPDF in Databricks

Hello,Do any of you have experience with using OCRmyPDF in Databricks? I have tried to install it in various was with different versions, but my notebook keep crashing with the error:The Python process exited with exit code 139 (SIGSEGV: Segmentation...

Community Platform Discussions

ocr

ocrmypdf

pdf

segmentation fault

tesseract

Reply

491 Views
0 replies
0 kudos

10-21-2024 1:34:13 AM

by Sourav789027 • New Contributor II

10-19-2024 12:58:09 AM

362 Views
1 replies
1 kudos

Databricks Certifications

Hello Everyone , My name is Sourav Das. I am from Kolkata, currently working as Azure Data Engineer in Cognizant.I have cleared multiple databricks certifications(Databricks data engineer associate, databricks data engineer professional, databricks d...

Community Platform Discussions

Reply

362 Views
1 replies
1 kudos

10-19-2024 12:58:09 AM

View Replies

Latest Reply

gchandra
Databricks Employee

10-20-2024 2:49:33 PM

1 kudos

Good luck. You can continue to improve your skills by helping other community members on this platform.

1 kudos

10-20-2024 2:49:33 PM

by DineshReddyN • New Contributor II

10-17-2024 1:19:58 AM

1096 Views
5 replies
0 kudos

Filestore endpoint not visible in Databricks community edition

In community edition of Databricks after multiple attempts of enable, refreshes, unable to navigate to File store endpoint.Under catalog it is not visible

Community Platform Discussions

Databricks Community Edition

filestore

GUI

Reply

1096 Views
5 replies
0 kudos

10-17-2024 1:19:58 AM

View Replies

Latest Reply

gchandra
Databricks Employee

10-18-2024 12:57:20 PM

0 kudos

Follow these alternate solutions. https://community.databricks.com/t5/data-engineering/databricks-community-edition-dbfs-alternative-solutions/td-p/94933

0 kudos

10-18-2024 12:57:20 PM

4 More Replies

by Sudheer2 • New Contributor III

10-18-2024 3:12:21 AM

322 Views
1 replies
0 kudos

Migrating ML Model Experiments Using Python REST APIs

Hi everyone,I’m looking to migrate ML model experiments from a source Databricks workspace to a target workspace. Specifically, I want to use Python and the available REST APIs for this process.Can anyone help me on this!Thanks in advance!

Community Platform Discussions

Reply

322 Views
1 replies
0 kudos

10-18-2024 3:12:21 AM

View Replies

Latest Reply

gchandra
Databricks Employee

10-18-2024 12:56:22 PM

0 kudos

You can use https://github.com/mlflow/mlflow-export-import utility. The example given below doesn't use Python but uses CLI and CICD pipeline to do the same. https://medium.com/@gchandra/databricks-copy-ml-models-across-unity-catalog-metastores-188...

0 kudos

10-18-2024 12:56:22 PM

by abubakar-saddiq • New Contributor

10-17-2024 7:33:31 AM

2008 Views
2 replies
1 kudos

How to Pass Dynamic Parameters (e.g., Current Date) in Databricks Workflow UI?

I'm setting up a job in the Databricks Workflow UI and I want to pass a dynamic parameter, like the current date (run_date), each time the job runs.In Azure Data Factory, I can use expressions like @utcnow() to calculate this at runtime. However, I w...

Community Platform Discussions

Reply

2008 Views
2 replies
1 kudos

10-17-2024 7:33:31 AM

View Replies

Latest Reply

-werners-
Esteemed Contributor III

10-18-2024 12:16:47 AM

1 kudos

As szymon mentioned, dynamic parameter values exist, but the functionality is still far from what Data Factory has to offer.I am pretty sure though that this will be extended.So for the moment I suggest you do the value derivation in data factory, an...

1 kudos

10-18-2024 12:16:47 AM

1 More Replies

by oleprince • New Contributor

10-14-2023 9:03:23 AM

5248 Views
1 replies
0 kudos

Delta table definition - Identity column

Hello,Would anyone know if it is possible to create a delta table using Python that includes a column that is generated by default as identity (identity column for which the value inserted can be manually overriden)?There seems to be a way to create ...

Community Platform Discussions

Reply

5248 Views
1 replies
0 kudos

10-14-2023 9:03:23 AM

View Replies

Latest Reply

gmiguel
Contributor

10-17-2024 9:29:22 AM

0 kudos

Hi @oleprince ,As far as I know, it's not possible yet to create tables with Identity columns using pyspark (DeltaTable api). You can create generated columns, but Identity columns are not allowed.The only way to achieve this is through Spark Sql.

0 kudos

10-17-2024 9:29:22 AM

by fiverrpromotion • New Contributor

10-14-2024 2:19:25 AM

528 Views
1 replies
0 kudos

Addressing Memory Constraints in Scaling XGBoost and LGBM: A Comprehensive Approach for High-Volume

Scaling XGBoost and LightGBM models to handle exceptionally large datasets—those comprising billions to tens of billions of rows—presents a formidable computational challenge, particularly when constrained by the limitations of in-memory processing o...

Community Platform Discussions

Reply

528 Views
1 replies
0 kudos

10-14-2024 2:19:25 AM

View Replies

Latest Reply

earntodiessaz
New Contributor II

10-17-2024 3:22:27 AM

0 kudos

Well, that's a superb article! Thank you for this great information, you write very well which I like very much. I am really impressed by your post. run 3

0 kudos

10-17-2024 3:22:27 AM

by Chris_Shehu • Valued Contributor III

07-27-2023 12:38:26 PM

1282 Views
1 replies
1 kudos

Feature Request: GUI: Additional Collapse options

When you're using a very large notebook sometimes it gets frustrating scrolling through all the code blocks. It would be nice to have a few additional options to make this easier. 1) Add a collapse all code cells button to the top.2) Add a collapse a...

Community Platform Discussions

Enhancement

Feature

GUI

Request

Reply

1282 Views
1 replies
1 kudos

07-27-2023 12:38:26 PM

View Replies

Latest Reply

fdawoud
New Contributor II

10-14-2024 2:47:18 PM

1 kudos

this feature please

1 kudos

10-14-2024 2:47:18 PM

by qwerty3 • Contributor

10-14-2024 12:58:45 PM

672 Views
1 replies
0 kudos

Resolved! Does a queued databricks job incur cost?

Does a queued databricks job incur cost?

Community Platform Discussions

Reply

672 Views
1 replies
0 kudos

10-14-2024 12:58:45 PM

View Replies

Latest Reply

filipniziol
Esteemed Contributor

10-14-2024 1:05:00 PM

0 kudos

Hi @qwerty3 ,No, it does not. When a job is queued (waiting for an available cluster or resources), there is no compute usage, so no charges are incurred for Databricks units (DBUs) or cloud infrastructure (VMs). The queued state is essentially a wai...

0 kudos

10-14-2024 1:05:00 PM

by tejaswi24 • New Contributor III

09-30-2024 4:52:35 AM

2001 Views
11 replies
1 kudos

Resolved! databricks Asset Bundle

i have come accross a documentation on asset bundles long back whcih states that when you typedatabricks bundle initit gives us option to choose a project type. But i see the below error when i do that i see the below erroris there a way, i can take ...

Community Platform Discussions

Databricks Asset Bundle

databricks bundle

Reply

2001 Views
11 replies
1 kudos

09-30-2024 4:52:35 AM

View Replies

Latest Reply

gchandra
Databricks Employee

10-07-2024 10:50:13 AM

1 kudos

Bash

1 kudos

10-07-2024 10:50:13 AM

10 More Replies

by DW • New Contributor II

10-11-2024 9:07:27 AM

1113 Views
1 replies
2 kudos

column mask on <tinyint>Y columns gives error

My table breaks when I try to mask a column with a name like `<tinyint>Y` -- Create a table with a masked column> CREATE FUNCTION mask_int_col(col_val INTEGER) RETURN CASE WHEN is_member('HumanResourceDept') THEN col_val ELSE CAST(NULL as INTEGER) EN...

Community Platform Discussions

Reply

1113 Views
1 replies
2 kudos

10-11-2024 9:07:27 AM

View Replies

Latest Reply

filipniziol
Esteemed Contributor

10-13-2024 4:09:44 AM

2 kudos

Hi @DW ,I have replicated your scenario and encountered the same error when applying a column mask to a column named 1Y in Databricks SQL.In short, it makes sense simply to follow Databricks documentation and use the SQL naming conventions, so that c...

2 kudos

10-13-2024 4:09:44 AM

by abueno • New Contributor III

10-11-2024 6:03:49 PM

492 Views
1 replies
0 kudos

Databricks Pyspark filter several columns with similar criteria

I am querying a table from the Databricks Catalog which I have to filter several columns with the same criteria. below is what I have created so far. I have 10 columns that I have filter with a set of criteria from (dx_list1) and another 10 that I ...

Community Platform Discussions

Reply

492 Views
1 replies
0 kudos

10-11-2024 6:03:49 PM

View Replies

Latest Reply

filipniziol
Esteemed Contributor

10-13-2024 3:18:25 AM

0 kudos

Hi @abueno ,As I understand the logic you want to implement is:1. For every pair of columns:First Column (DX_i): Must be in dx_list1Second Column (DX_{i+1}): Must be in dx_list22. The condition for each pair is:col('DX_i').isin(dx_list1) OR col('DX_{...

0 kudos

10-13-2024 3:18:25 AM

by ayush19 • New Contributor III

07-17-2024 11:42:50 PM

1643 Views
3 replies
1 kudos

How to retrieve Spark Session inside java jar library installed on Cluster

I have a java app in form of jar package. This jar is installed on a Databricks cluster. This jar package reads and writes to few tables in databricks. In order to achieve that, I need SparkSession available in the code. Given that spark session is a...

Community Platform Discussions

Reply

1643 Views
3 replies
1 kudos

07-17-2024 11:42:50 PM

View Replies

Latest Reply

IslaGray
New Contributor II

10-11-2024 3:13:59 AM

1 kudos

Thanks for the update, I will try it too.

1 kudos

10-11-2024 3:13:59 AM

2 More Replies

Databricks Community

Forum Posts

Resolved! Issues with Content Writing on Databricks Community

Live, Virtual Workshop How to build a Golden Data Warehouse in Financial Services with Databricks

OCRmyPDF in Databricks

Databricks Certifications

Filestore endpoint not visible in Databricks community edition

Migrating ML Model Experiments Using Python REST APIs

How to Pass Dynamic Parameters (e.g., Current Date) in Databricks Workflow UI?

Delta table definition - Identity column

Addressing Memory Constraints in Scaling XGBoost and LGBM: A Comprehensive Approach for High-Volume

Feature Request: GUI: Additional Collapse options

Resolved! Does a queued databricks job incur cost?

Resolved! databricks Asset Bundle

column mask on <tinyint>Y columns gives error

Databricks Pyspark filter several columns with similar criteria

How to retrieve Spark Session inside java jar library installed on Cluster

Join Us as a Local Community Builder!

Query: Extracting Resolved 'Input' Parameter from ...

Error when executing an INSERT statement on an Ext...

How best to measure the time-spent-waiting-for-an-...

When is it time to change from ETL in notebooks to...

Deduplication with rocksdb, should old state files...