Databricks Community

koushiknpvs · 04-25-2024

Hi, My exam got suspended in the middle of answering. My name is Koushik Nandiraju. My Email id is koushiknandiraju@gmail.com Can you guys help me out please? Here is the request id - #00466317

koushiknpvs · 05-16-2024

This can be resolved by redefining the schema structure explicitly and using that schema to read the file. from pyspark.sql.types import StructType, StructField, StringType, IntegerType, ArrayType# Define the schema according to the JSON structuresch...

koushiknpvs · 05-16-2024

This can be resolved by redefining the schema structure explicitly and using that schema to read the file. from pyspark.sql.types import StructType, StructField, StringType, IntegerType, ArrayType# Define the schema according to the JSON structuresch...

koushiknpvs · 05-15-2024

Please give me a kudos if this works.Efficiency in Data Collection: Using .collect() on large datasets can lead to out-of-memory errors as it collects all rows to the driver node. If the dataset is large, consider alternatives such as extracting only...

koushiknpvs · 05-13-2024

Hi Verr,In short it depends on how your child notebook is configured. But I would start with the following points -Output Logging Settings: Check the logging settings for your notebook cells. Ensure that the cells are configured to display output. In...

koushiknpvs · 05-09-2024

This is what my medallion architecture looks like - 1) Bronze Layer - append raw data.2) Silver Layer, reflect current(active) data and I do business logic transformations. The Silver layer should serve as your cleaned and transformed staging area. H...

Databricks Community

User Stats

User Activity

Databricks Certification Exam Suspended. Kindly help to reschedule

Re: cant read json file with just 1,75 MiB ?

Re: performance issues when transformin json-stat2

Re: performance issues when transformin json-stat2

Re: child notebook is not displaying output.

Re: Silver layer when tracking SCD2