cancel
Showing results for 
Search instead for 
Did you mean: 
Data Engineering
Join discussions on data engineering best practices, architectures, and optimization strategies within the Databricks Community. Exchange insights and solutions with fellow data engineers.
cancel
Showing results for 
Search instead for 
Did you mean: 

Forum Posts

Mustafa91
by New Contributor II
  • 2316 Views
  • 2 replies
  • 3 kudos

Databricks job api validation

Hey everyone,​I am creating databricks Jobs using ADO pipelines. I am creating the json content using python and in thr release pipeline i call databricks cli create command with the json. ​W​hat I would like to do is that in my CI pipeline, I need t...

  • 2316 Views
  • 2 replies
  • 3 kudos
Latest Reply
Anonymous
Not applicable
  • 3 kudos

Hi @Mustafa Akilli​ Thank you for your question! To assist you better, please take a moment to review the answer and let me know if it best fits your needs.Please help us select the best solution by clicking on "Select As Best" if it does.Your feedba...

  • 3 kudos
1 More Replies
kkawka1
by New Contributor III
  • 5010 Views
  • 2 replies
  • 1 kudos

Resolved! Data explorer in the community edition

Hi,Does anyone know how to access data explorer in the community edition? I would like to have an overview of what files are saved in the FileStore. This is what happens when I select "Data" in the left-hand side menu

image
  • 5010 Views
  • 2 replies
  • 1 kudos
Latest Reply
Anonymous
Not applicable
  • 1 kudos

Hi @Konrad Kawka​ Thank you for your question! To assist you better, please take a moment to review the answer and let me know if it best fits your needs.Please help us select the best solution by clicking on "Select As Best" if it does.Your feedback...

  • 1 kudos
1 More Replies
Sanmati
by New Contributor II
  • 3098 Views
  • 2 replies
  • 2 kudos

Resolved! Request for reattempt voucher. Databricks Certified Data Engineer Associate exam

HiOn Feb 27th ,I attempted the Databricks Certified Data Engineer Associate exam for 1st time , unfortunately I ended up by failing grade. My passing grade was 70%, and I received 64.88%.I am planning to reattempt the exam, Could you kindly give me a...

  • 3098 Views
  • 2 replies
  • 2 kudos
Latest Reply
Anonymous
Not applicable
  • 2 kudos

Hi @Sanmati Mahesh Undodi​ Hope all is well! Just wanted to check in if you were able to resolve your issue and would you be happy to share the solution or mark an answer as best? Else please let us know if you need more help. We'd love to hear from ...

  • 2 kudos
1 More Replies
gangs
by New Contributor
  • 2639 Views
  • 3 replies
  • 3 kudos

Resolved! Getting OOM error while loading huge zipped CSV file to the databricks Hive_metasore table

Is any better way to load huge zipped CSV file to hive_metastore table ?????​

  • 2639 Views
  • 3 replies
  • 3 kudos
Latest Reply
Anonymous
Not applicable
  • 3 kudos

Hi @Ankit Gangwal​ Thank you for posting your question in our community! We are happy to assist you.To help us provide you with the most accurate information, could you please take a moment to review the responses and select the one that best answers...

  • 3 kudos
2 More Replies
Naeem_K
by New Contributor III
  • 2761 Views
  • 5 replies
  • 0 kudos

Data Engineer Associate Certificate and badge not yet received

I have cleared the certification exam on 26th January 2023, but still haven't received the certificate. I had given the exam with a different mail ID but I'm not receiving any emails from Databricks to that mail ID.​Kindly help me resolve the issue.

  • 2761 Views
  • 5 replies
  • 0 kudos
Latest Reply
Anonymous
Not applicable
  • 0 kudos

Hi @Naeemah Khatib​ Hope all is well! Just wanted to check in if you were able to resolve your issue and would you be happy to share the solution or mark an answer as best? Else please let us know if you need more help. We'd love to hear from you.Tha...

  • 0 kudos
4 More Replies
Galdino
by New Contributor II
  • 6471 Views
  • 3 replies
  • 1 kudos

How to read a json from BytesIO with PySpark?

I want read a json from IO variable using PySpark.My code using pandas:io = BytesIO()ftp.retrbinary('RETR '+ file_name, io.write)io.seek(0)# With pandasdf = pd.read_json(io)What I tried using PySpark, but don't work: io = BytesIO() ftp.retrbinary('...

  • 6471 Views
  • 3 replies
  • 1 kudos
Latest Reply
Erik_L
Contributor II
  • 1 kudos

Just use pandas and follow with spark.createDataFrame(df)

  • 1 kudos
2 More Replies
iwan_aucamp
by New Contributor III
  • 2425 Views
  • 2 replies
  • 2 kudos

Are there any python API bindings for the Databricks Account API?

All I could find in terms of API bindings for python is https://pypi.org/project/databricks-cli/, and this does not include the Account API and it is also not official.   I will just use the OpenAPI spec, but just want to be sure I'm not doing unnece...

  • 2425 Views
  • 2 replies
  • 2 kudos
Latest Reply
Anonymous
Not applicable
  • 2 kudos

@Iwan Aucamp​ : Yes, there are Python API bindings available for the Databricks Account API.For Databricks Account API with Python, please refer to the Databricks documentation: https://docs.databricks.com/dev-tools/api/latest/accounts.html#python-ap...

  • 2 kudos
1 More Replies
Sagar1
by New Contributor III
  • 9641 Views
  • 3 replies
  • 5 kudos

Notebook dropdown widget

I have created a dropdown (say B) in my notebook whose input depend on dropdown( say B). So if select some value in dropdown A, it corresponding value appears in B dropdown & i'm selecting one amongst it. Now if i change the value in dropdown A, then...

  • 9641 Views
  • 3 replies
  • 5 kudos
Latest Reply
nic_paul24
New Contributor II
  • 5 kudos

If the previously selected value of B is not meant to be in the list of values for newly selected dropdown A value, then you could set a default value (ie: 'No selection') that the B dropdown should have when first created. In a method to define how ...

  • 5 kudos
2 More Replies
weldermartins
by Honored Contributor
  • 29894 Views
  • 7 replies
  • 35 kudos

Resolved! pyspark - regexp_extract

hello everyone, I'm creating a regex expression to fetch only the value of a string, but some values ​​are negative. I am not able to create the rule to compose the negative value. can you help me?from pyspark.sql.functions import regexp_extract fro...

image
  • 29894 Views
  • 7 replies
  • 35 kudos
Latest Reply
ErinArmistead
New Contributor II
  • 35 kudos

Have you found the answer? If you are a student in college or school searching for free essay examples online, you may want to visit the website https://writinguniverse.com/free-essay-examples/soccer/ here you will find a vast collection of free essa...

  • 35 kudos
6 More Replies
Baumeister
by New Contributor II
  • 7683 Views
  • 2 replies
  • 0 kudos

Error when importing .dbc of a complete Workspace

I saved the content of an older Databricks Workspace by clicking on the Dropdown next to Workspace -> Export -> DBC Archive and saved it on my local machine.In a new Databricks Workspace, I now want to import That .DBC archive to restore the previous...

dbcerror
  • 7683 Views
  • 2 replies
  • 0 kudos
Latest Reply
Anonymous
Not applicable
  • 0 kudos

@Sebastian K​ :It looks like the error you are facing while importing the DBC archive could be due to the version incompatibility between the Databricks instance where you created the DBC archive and the one where you are trying to import it. Can you...

  • 0 kudos
1 More Replies
bluesky111
by New Contributor II
  • 2973 Views
  • 1 replies
  • 3 kudos

Resolved! I Input the wrong schedule time for the exams can it be reschedule ?

Helo today ,i think i was scheduled to do an exams at 2.15 PM but unfortunately i made a mistake put the time to 2.15 AM, could it be rescheduled? i already submit a ticket to https://help.databricks.com/s/contact-us?ReqType=training but no reply yet...

  • 2973 Views
  • 1 replies
  • 3 kudos
Latest Reply
APadmanabhan
Databricks Employee
  • 3 kudos

Hello @heron halim,​ If the exam time and date have already passed, we cannot help in the situation; we can only change the time/date of the exam if we are notified a minimum of 30 hours before the exam date/time. Test-takers must ensure they check t...

  • 3 kudos
Harun
by Honored Contributor
  • 6692 Views
  • 5 replies
  • 6 kudos

how to load structured stream data into delta table whose location is in ADLS Gen2

Hi All,I am working on a streaming data processing. As a intial step i have read the data from azure eventhub using readstream. now i want to writestream this into a delta table. My requirement is, The data should present in external location (adls g...

  • 6692 Views
  • 5 replies
  • 6 kudos
Latest Reply
Anonymous
Not applicable
  • 6 kudos

There are a couple ways to connect to ADLS Gen2. Please refer to below doc. For instance, if you decide to go by service principal method, you need to add below storage account configurations details to the cluster or notebooks. Same goes for storag...

  • 6 kudos
4 More Replies
KVNARK
by Honored Contributor II
  • 4660 Views
  • 1 replies
  • 4 kudos

Resolved! Deploying global parameters from lower to higher env in ADF

how can we deploy global parameters from dev to higher environments in ADF. Could anyone throw some light on this.I'm using GIT in DEV and deploying it to PROD using Azure CICD pipeline.

  • 4660 Views
  • 1 replies
  • 4 kudos
Latest Reply
Anonymous
Not applicable
  • 4 kudos

@KVNARK .​ : To deploy global parameters from dev to higher environments in Azure Data Factory (ADF), you can follow these steps:In your DEV environment, create the global parameters in ADF and save them.Commit and push the changes to your Git reposi...

  • 4 kudos
alvaro_databric
by New Contributor III
  • 3622 Views
  • 1 replies
  • 2 kudos

Resolved! Fastest Azure VM for Databricks Big Data workload

Hi All,It is well known that Azure provides a wide variety of VM for Databricks, some of which provide powerful features such as Photon and Delta Caching. I would like to ask the community which do you think is the fastests cluster for performing Big...

  • 3622 Views
  • 1 replies
  • 2 kudos
Latest Reply
Anonymous
Not applicable
  • 2 kudos

@Alvaro Moure​ :The performance of a Databricks cluster for big data operations depends on many factors, such as the amount and structure of the data, the nature of the operations being performed, the configuration of the cluster, and the specific re...

  • 2 kudos
Sujitha
by Databricks Employee
  • 7982 Views
  • 0 replies
  • 2 kudos

Weekly Release Notes RecapHere’s a quick recap of the latest release notes updates from the past one week. Databricks platform release notesMarch 13 -...

Weekly Release Notes RecapHere’s a quick recap of the latest release notes updates from the past one week.Databricks platform release notesMarch 13 - 17, 2023Execute SQL cells in the notebook in parallelYou can now run SQL cells in Databricks noteboo...

  • 7982 Views
  • 0 replies
  • 2 kudos

Join Us as a Local Community Builder!

Passionate about hosting events and connecting people? Help us grow a vibrant local community—sign up today to get started!

Sign Up Now
Labels