Data Engineering

Forum Posts

Sorted by:

by Charmin • New Contributor

12-20-2022 12:53:40 PM

561 Views
1 replies
0 kudos

Why 'runCommand' action does NOT show up in databricksNotebook audit log table?

I understand databricks can send diagnostic/audit logs to log analytics in azure. There is a standard 'DatabricksNotebook' table that provides audit log for notebook actions. In this table there is an action called 'runCommand' but this does not show...

Data Engineering

561 Views
1 replies
0 kudos

12-20-2022 12:53:40 PM

View Replies

Latest Reply

rsamant07
New Contributor III

03-17-2023 3:24:36 AM

0 kudos

Hi @Charmin patel , you need to enable verbose audit logging in workspace setting for runCommand to appear in the audit logs

0 kudos

03-17-2023 3:24:36 AM

by laksh • New Contributor II

03-01-2023 5:26:53 PM

811 Views
2 replies
0 kudos

Real time data quality validation (Streaming data ingestion)

I was wondering how the Unity Catalog would help in data quality validations for real time (streaming data) data ingestion.

Data Engineering

811 Views
2 replies
0 kudos

03-01-2023 5:26:53 PM

View Replies

Latest Reply

Anonymous
Not applicable

03-16-2023 10:35:54 PM

0 kudos

Hi @arun laksh Hope all is well! Just wanted to check in if you were able to resolve your issue and would you be happy to share the solution or mark an answer as best? Else please let us know if you need more help. We'd love to hear from you.Thanks!

0 kudos

03-16-2023 10:35:54 PM

1 More Replies

by mahesh_vardhan_ • New Contributor

03-02-2023 12:40:23 AM

2019 Views
2 replies
2 kudos

Resolved! How do I use numpy case when condition in pyspark.pandas?

I do have some legacy pandas codes which I want to migrate to spark to leaverage parellelization in Databricks. I see datadricks has launched a wrapper package on top of pandas which uses pandas nomenclature but use spark engine in the backend.I comf...

Data Engineering

2019 Views
2 replies
2 kudos

03-02-2023 12:40:23 AM

View Replies

Latest Reply

Anonymous
Not applicable

03-16-2023 10:30:40 PM

2 kudos

Hi @mahesh vardhan gandhi Hope all is well! Just wanted to check in if you were able to resolve your issue and would you be happy to share the solution or mark an answer as best? Else please let us know if you need more help. We'd love to hear from ...

2 kudos

03-16-2023 10:30:40 PM

1 More Replies

by Paras • New Contributor II

03-01-2023 11:32:21 PM

1224 Views
4 replies
6 kudos

can you tell me what could be problem to my spark job?

Data Engineering

1224 Views
4 replies
6 kudos

03-01-2023 11:32:21 PM

View Replies

Latest Reply

Anonymous
Not applicable

03-16-2023 10:18:47 PM

6 kudos

Hi @Paras Gadhiya Hope all is well! Just wanted to check in if you were able to resolve your issue and would you be happy to share the solution or mark an answer as best? Else please let us know if you need more help. We'd love to hear from you.Than...

6 kudos

03-16-2023 10:18:47 PM

3 More Replies

by JustinDM • New Contributor II

03-01-2023 9:26:04 PM

654 Views
2 replies
2 kudos

I have been getting a False Alert on Data bricks SQL.

When ever a legitimate alert is triggered, I get a False Alert with 0.00 triggered at 12.00AM the next day. I tried Altering the Query but its still the same. Not posting examples as data is not shareable but I can give an example. if the alert is se...

Data Engineering

654 Views
2 replies
2 kudos

03-01-2023 9:26:04 PM

View Replies

Latest Reply

Anonymous
Not applicable

03-16-2023 10:16:54 PM

2 kudos

Hi @justin moorthy Hope all is well! Just wanted to check in if you were able to resolve your issue and would you be happy to share the solution or mark an answer as best? Else please let us know if you need more help. We'd love to hear from you.Tha...

2 kudos

03-16-2023 10:16:54 PM

1 More Replies

by Mustafa91 • New Contributor II

03-01-2023 12:09:28 PM

634 Views
2 replies
3 kudos

Databricks job api validation

Hey everyone,I am creating databricks Jobs using ADO pipelines. I am creating the json content using python and in thr release pipeline i call databricks cli create command with the json. What I would like to do is that in my CI pipeline, I need t...

Data Engineering

634 Views
2 replies
3 kudos

03-01-2023 12:09:28 PM

View Replies

Latest Reply

Anonymous
Not applicable

03-16-2023 10:08:42 PM

3 kudos

Hi @Mustafa Akilli Thank you for your question! To assist you better, please take a moment to review the answer and let me know if it best fits your needs.Please help us select the best solution by clicking on "Select As Best" if it does.Your feedba...

3 kudos

03-16-2023 10:08:42 PM

1 More Replies

by kkawka1 • New Contributor III

03-01-2023 10:04:43 AM

1495 Views
2 replies
1 kudos

Resolved! Data explorer in the community edition

Hi,Does anyone know how to access data explorer in the community edition? I would like to have an overview of what files are saved in the FileStore. This is what happens when I select "Data" in the left-hand side menu

Data Engineering

1495 Views
2 replies
1 kudos

03-01-2023 10:04:43 AM

View Replies

Latest Reply

Anonymous
Not applicable

03-16-2023 10:06:12 PM

1 kudos

Hi @Konrad Kawka Thank you for your question! To assist you better, please take a moment to review the answer and let me know if it best fits your needs.Please help us select the best solution by clicking on "Select As Best" if it does.Your feedback...

1 kudos

03-16-2023 10:06:12 PM

1 More Replies

by Sanmati • New Contributor II

03-01-2023 1:34:48 AM

911 Views
2 replies
2 kudos

Resolved! Request for reattempt voucher. Databricks Certified Data Engineer Associate exam

HiOn Feb 27th ,I attempted the Databricks Certified Data Engineer Associate exam for 1st time , unfortunately I ended up by failing grade. My passing grade was 70%, and I received 64.88%.I am planning to reattempt the exam, Could you kindly give me a...

Data Engineering

911 Views
2 replies
2 kudos

03-01-2023 1:34:48 AM

View Replies

Latest Reply

Anonymous
Not applicable

03-16-2023 9:54:33 PM

2 kudos

Hi @Sanmati Mahesh Undodi Hope all is well! Just wanted to check in if you were able to resolve your issue and would you be happy to share the solution or mark an answer as best? Else please let us know if you need more help. We'd love to hear from ...

2 kudos

03-16-2023 9:54:33 PM

1 More Replies

by gangs • New Contributor

03-01-2023 4:23:15 AM

883 Views
3 replies
3 kudos

Resolved! Getting OOM error while loading huge zipped CSV file to the databricks Hive_metasore table

Is any better way to load huge zipped CSV file to hive_metastore table ?????

Data Engineering

883 Views
3 replies
3 kudos

03-01-2023 4:23:15 AM

View Replies

Latest Reply

Anonymous
Not applicable

03-16-2023 9:53:35 PM

3 kudos

Hi @Ankit Gangwal Thank you for posting your question in our community! We are happy to assist you.To help us provide you with the most accurate information, could you please take a moment to review the responses and select the one that best answers...

3 kudos

03-16-2023 9:53:35 PM

2 More Replies

by Naeem_K • New Contributor III

02-07-2023 8:46:28 PM

860 Views
5 replies
0 kudos

Data Engineer Associate Certificate and badge not yet received

I have cleared the certification exam on 26th January 2023, but still haven't received the certificate. I had given the exam with a different mail ID but I'm not receiving any emails from Databricks to that mail ID.Kindly help me resolve the issue.

Data Engineering

860 Views
5 replies
0 kudos

02-07-2023 8:46:28 PM

View Replies

Latest Reply

Anonymous
Not applicable

03-16-2023 8:02:50 PM

0 kudos

Hi @Naeemah Khatib Hope all is well! Just wanted to check in if you were able to resolve your issue and would you be happy to share the solution or mark an answer as best? Else please let us know if you need more help. We'd love to hear from you.Tha...

0 kudos

03-16-2023 8:02:50 PM

4 More Replies

by Galdino • New Contributor II

05-15-2022 2:29:06 PM

2980 Views
3 replies
1 kudos

How to read a json from BytesIO with PySpark?

I want read a json from IO variable using PySpark.My code using pandas:io = BytesIO()ftp.retrbinary('RETR '+ file_name, io.write)io.seek(0)# With pandasdf = pd.read_json(io)What I tried using PySpark, but don't work: io = BytesIO() ftp.retrbinary('...

Data Engineering

2980 Views
3 replies
1 kudos

05-15-2022 2:29:06 PM

View Replies

Latest Reply

Erik_L
Contributor II

03-16-2023 11:57:30 AM

1 kudos

Just use pandas and follow with spark.createDataFrame(df)

1 kudos

03-16-2023 11:57:30 AM

2 More Replies

by iwan_aucamp • New Contributor III

02-06-2023 1:24:11 AM

820 Views
2 replies
2 kudos

Are there any python API bindings for the Databricks Account API?

All I could find in terms of API bindings for python is https://pypi.org/project/databricks-cli/, and this does not include the Account API and it is also not official.I will just use the OpenAPI spec, but just want to be sure I'm not doing unnecessa...

Data Engineering

820 Views
2 replies
2 kudos

02-06-2023 1:24:11 AM

View Replies

Latest Reply

Anonymous
Not applicable

03-07-2023 10:11:47 PM

2 kudos

@Iwan Aucamp : Yes, there are Python API bindings available for the Databricks Account API.For Databricks Account API with Python, please refer to the Databricks documentation: https://docs.databricks.com/dev-tools/api/latest/accounts.html#python-ap...

2 kudos

03-07-2023 10:11:47 PM

1 More Replies

by Sagar1 • New Contributor III

03-18-2022 4:57:34 AM

3157 Views
7 replies
7 kudos

Resolved! Notebook dropdown widget

I have created a dropdown (say B) in my notebook whose input depend on dropdown( say B). So if select some value in dropdown A, it corresponding value appears in B dropdown & i'm selecting one amongst it. Now if i change the value in dropdown A, then...

Data Engineering

3157 Views
7 replies
7 kudos

03-18-2022 4:57:34 AM

View Replies

Latest Reply

nic_paul24
New Contributor II

03-16-2023 6:06:47 AM

7 kudos

If the previously selected value of B is not meant to be in the list of values for newly selected dropdown A value, then you could set a default value (ie: 'No selection') that the B dropdown should have when first created. In a method to define how ...

7 kudos

03-16-2023 6:06:47 AM

6 More Replies

by weldermartins • Honored Contributor

11-23-2022 9:41:29 AM

4044 Views
7 replies
35 kudos

Resolved! pyspark - regexp_extract

hello everyone, I'm creating a regex expression to fetch only the value of a string, but some values are negative. I am not able to create the rule to compose the negative value. can you help me?from pyspark.sql.functions import regexp_extract fro...

Data Engineering

4044 Views
7 replies
35 kudos

11-23-2022 9:41:29 AM

View Replies

Latest Reply

ErinArmistead
New Contributor II

03-16-2023 5:50:31 AM

35 kudos

Have you found the answer? If you are a student in college or school searching for free essay examples online, you may want to visit the website https://writinguniverse.com/free-essay-examples/soccer/ here you will find a vast collection of free essa...

35 kudos

03-16-2023 5:50:31 AM

6 More Replies

by Baumeister • New Contributor II

03-10-2023 6:00:09 AM

1558 Views
2 replies
0 kudos

Error when importing .dbc of a complete Workspace

I saved the content of an older Databricks Workspace by clicking on the Dropdown next to Workspace -> Export -> DBC Archive and saved it on my local machine.In a new Databricks Workspace, I now want to import That .DBC archive to restore the previous...

Data Engineering

1558 Views
2 replies
0 kudos

03-10-2023 6:00:09 AM

View Replies

Latest Reply

Anonymous
Not applicable

03-14-2023 2:01:16 AM

0 kudos

@Sebastian K :It looks like the error you are facing while importing the DBC archive could be due to the version incompatibility between the Databricks instance where you created the DBC archive and the one where you are trying to import it. Can you...

0 kudos

03-14-2023 2:01:16 AM

1 More Replies

User

Count

1601

736

343

284

246

Databricks

Forum Posts

Why 'runCommand' action does NOT show up in databricksNotebook audit log table?

Real time data quality validation (Streaming data ingestion)

Resolved! How do I use numpy case when condition in pyspark.pandas?

can you tell me what could be problem to my spark job?

I have been getting a False Alert on Data bricks SQL.

Databricks job api validation

Resolved! Data explorer in the community edition

Resolved! Request for reattempt voucher. Databricks Certified Data Engineer Associate exam

Resolved! Getting OOM error while loading huge zipped CSV file to the databricks Hive_metasore table

Data Engineer Associate Certificate and badge not yet received

How to read a json from BytesIO with PySpark?

Are there any python API bindings for the Databricks Account API?

Resolved! Notebook dropdown widget

Resolved! pyspark - regexp_extract

Error when importing .dbc of a complete Workspace

DELTA_EXCEED_CHAR_VARCHAR_LIMIT

Not able to set run_as service_principal_name

Pyspark operations slowness in CLuster 14.3LTS as ...

[Databricks Assets Bundles] Workflow trigger on fi...

Addressing Pipeline Error Handling in Databricks b...