- 488 Views
- 0 replies
- 0 kudos
Queue based Autoloader processes files in the order they are received only when the job is up and running. However when the job is down, the files that queue up are processed in lexical order once the job is up. Since autoloader jobs need to be stopp...
If I run a query as "SELECT fare_amount FROM nyctaxi.trips where fare_amount > 1.5". The query results will be cached for 24 hours.I then compose a second query using the previous query as a subquery "SELECT * FROM nyctaxi.trips WHERE fare_amount IN...
Loved the delta live tables training
Loving the summit so far, awesome keynote speakers, great trainers and paid courses. Finished certification #databrickslearning
Hi Databricks community!I have previsouly worked on a project that easily could be optimized with Databricks. It is currently running on Azure Synapse, but the premise is the same.Ill describe the scenario here:1. Data owners send a constant flow of ...
How can I propagate a deletion to all tables where a customer requests to be removed from the database as part of the GDPR compliance ?
We use a python script that enables and removes access to tables based on role-group, but can be user as well. Also have a script that removes all access- can be executes in seconds.
Happy to be part of the data summit 2023. Wondering if DBSQL is enabled for DLT tables
Attended the Data and AI Summit 2023 and gained insights into the utility catalog and services that it has to offer, definitely going to try the data governance as it's a game changer.
Can I reference theUnity Catalog through my Glue Serverless jobs?
Hi,I want to convert column of XML strings to column of Json in PySpark., using withcolumn and xmltodict method as UDF, is giving Json with '=' instead of ':' in the dictionary. Please let me know if there is any alternative for this.
To convert a column of XML strings to a column of JSON in PySpark, you can use the `from_json` function along with the `xmltodict` library. However, instead of using a UDF with `withColumn`, you can use the `select` function to transform the column.
Having a great time at the community hub at the Summit. Highly recommend!
Today I walked into a session that talked about a fairly new language - Rust. The name can mislead you, I believe taking a look at the roots of how to best use CPU cycles is a game changer and Rust is traversing new areas that others might have ignor...
Yes, Rust is definitely part of the future. It brings performance and simplicity to us. I think it will add to the community, rather than replacing. Scala and R will never go away, Python will always be strong, but Rust gives us one other tool in ...
Attended training and few sessions.. great experience
How does one implement a databricks pipeline in a classified environment
Passionate about hosting events and connecting people? Help us grow a vibrant local community—sign up today to get started!
Sign Up NowUser | Count |
---|---|
1614 | |
771 | |
349 | |
286 | |
253 |