- 1195 Views
- 2 replies
- 0 kudos
Hi,Does anyone know what's the difference of V3 exam for Databricks Certified Data Engineer Associate, comparing with V2?Looks like there is no practice exam for V3?Which version covers more stuff?Thanks,h_aloha
- 1195 Views
- 2 replies
- 0 kudos
Latest Reply
Hi @Helen Morgen Thank you for reaching out! Please submit a ticket to our Training Team here: https://help.databricks.com/s/contact-us?ReqType=training and our team will get back to you shortly.
1 More Replies
- 788 Views
- 2 replies
- 0 kudos
I have a table with a timestamp column (t) and a list of columns for which I would like to compute the difference over time (v), by some key(k): v_diff(t) = v(t)-v(t-1) for each k independently.Normally I would write:lag_window = Window.partitionBy(C...
- 788 Views
- 2 replies
- 0 kudos
Latest Reply
I found this but could not make it work https://www.databricks.com/blog/2022/10/18/python-arbitrary-stateful-processing-structured-streaming.html
1 More Replies
- 401 Views
- 1 replies
- 1 kudos
Difference between “ And ‘ in Spark Dataframe APIYou must tell your compiler that you want to represent a string inside a string using a different symbol for the inner string.Here is an example.“ Name = “HARI” “The above is wrong. Why? Because the in...
- 401 Views
- 1 replies
- 1 kudos
Latest Reply
sher
Valued Contributor II
- 1817 Views
- 5 replies
- 3 kudos
B1123451020-502,"","{""m"": {""difference"": 60}}","","","",2022-02-12T15:40:00.783Z
B1456741975-266,"","{""m"": {""difference"": 60}}","","","",2022-02-04T17:03:59.566Z
B1789753479-460,"","",",","","",2022-02-18T14:46:57.332Z
B1456741977-123,"","{""...
- 1817 Views
- 5 replies
- 3 kudos
Latest Reply
Hi @Tarique Anwer Hope all is well! Just wanted to check in if you were able to resolve your issue and would you be happy to share the solution or mark an answer as best? Else please let us know if you need more help. We'd love to hear from you.Than...
4 More Replies
- 1695 Views
- 3 replies
- 1 kudos
I have observed a very strange behavior with some of our integration pipelines. This week one of the csv files was getting broken when read with read function given below.def ReadCSV(files,schema_struct,header,delimiter,timestampformat,encode="utf8...
- 1695 Views
- 3 replies
- 1 kudos
Latest Reply
Hi @nafri A ,What is the error you are getting, can you share it please? Like @Hubert Dudek mentioned, both will call the same APIs
2 More Replies
- 1345 Views
- 1 replies
- 0 kudos
I see a significant performance difference when calling spark.sessionState.catalog.list compared to spark.catalog.list. Is that expected?
- 1345 Views
- 1 replies
- 0 kudos
Latest Reply
spark.sessionState.catalog.listTables is a more lazy implementation.. it does not pull the column details when listing the tables. Hence it's faster. Whereas catalog.listTables will pull the column details as well. If the database has many Delta tabl...
- 1719 Views
- 1 replies
- 0 kudos
Normalization typically means rescales the values into a range of [0,1].Standardization typically means rescales data to have a mean of 0 and a standard deviation of 1 (unit variance).
- 1719 Views
- 1 replies
- 0 kudos
Latest Reply
Normalization typically means rescales the values into a range of [0,1]. Standardization typically means rescales data to have a mean of 0 and a standard deviation of 1 (unit variance).A link which explains better is - https://towardsdatascience.com...