Databricks Community

Mano99 · ‎04-21-2025

Hi Databricks Team/ Community,

We have created a Databricks External table on top of ADLS Gen 2. Both parquet and delta tables. we are loading nested json structure into a table. Few column will have huge nested json data. Im getting results too large error. But transformations and others are working fine. Only i cant able to display it.

Here, What i want to know is what is the maximum size(in terms of MB or GB) per row databricks can accept or it can store.
Saw some references in google and AI, they are saying upto 2.5GB. Is it true? If anyone knows the exact number, please help here. And leave a comments on above known issue to understand better.

Thanks & Regards,
Manohar G

dennis65 · ‎04-21-2025

@Mano99 ktagwrote:
Hi Databricks Team/ Community,

We have created a Databricks External table on top of ADLS Gen 2. Both parquet and delta tables. we are loading nested json structure into a table. Few column will have huge nested json data. Im getting results too large error. But transformations and others are working fine. Only i cant able to display it.

Here, What i want to know is what is the maximum size(in terms of MB or GB) per row databricks can accept or it can store.
Saw some references in google and AI, they are saying upto 2.5GB. Is it true? If anyone knows the exact number, please help here. And leave a comments on above known issue to understand better.

Thanks & Regards,
Manohar G

Databricks/Spark can generally store rows up to around 2-2.5 GB, a practical limit due to underlying data structures. However, the "results too large" error you're seeing is a limitation on the driver node's ability to *display* large result sets, especially with huge nested JSON columns. To resolve this, avoid displaying the entire table directly; instead, use `.limit()`, filter for specific rows, project only necessary columns, sample the data, or write the data to a file for external analysis. The storage limit is separate from the display limitation.

View solution in original post

dennis65 · ‎04-21-2025

@Mano99 ktagwrote:
Hi Databricks Team/ Community,

We have created a Databricks External table on top of ADLS Gen 2. Both parquet and delta tables. we are loading nested json structure into a table. Few column will have huge nested json data. Im getting results too large error. But transformations and others are working fine. Only i cant able to display it.

Here, What i want to know is what is the maximum size(in terms of MB or GB) per row databricks can accept or it can store.
Saw some references in google and AI, they are saying upto 2.5GB. Is it true? If anyone knows the exact number, please help here. And leave a comments on above known issue to understand better.

Thanks & Regards,
Manohar G

Databricks/Spark can generally store rows up to around 2-2.5 GB, a practical limit due to underlying data structures. However, the "results too large" error you're seeing is a limitation on the driver node's ability to *display* large result sets, especially with huge nested JSON columns. To resolve this, avoid displaying the entire table directly; instead, use `.limit()`, filter for specific rows, project only necessary columns, sample the data, or write the data to a file for external analysis. The storage limit is separate from the display limitation.