Confuse about large memory usage of cluster

Data Engineering

Join discussions on data engineering best practices, architectures, and optimization strategies within the Databricks Community. Exchange insights and solutions with fellow data engineers.

We set up a demo DLT pipeline with no data involved:

@Dlt.table(
    name="demo"
)

def sample():
    df = spark.sql("SELECT 'silver' as Layer")
    return df

However, when we check the metric of the cluster, it looks like 10GB memory has already been used which doesn’t make sense.

I noticed that the access mode for the cluster is “shard”. Does this mean the 10GB memory was consumed by other users maybe?

If so, do we use the cluster at the same time or do I take over this one after the other user finishes?

0 REPLIES 0

never-displayed

You must be signed in to add attachments

never-displayed

Announcements

Databricks Named a Leader in the 2024 Gartner® Magic Quadrant™ for Cloud Database Management Systems

Announcing the new Meta Llama 3.3 model on Databricks

Milestone: DatabricksTV Reaches 100 Videos!

Dotmatics and Databricks Partner to Advance Scientific Intelligence in Life Sciences

Databricks Community Champion - December 2024 - Sujesh Menon

Databricks Community

Confuse about large memory usage of cluster

Connect with Databricks Users in Your Area

Databricks Named a Leader in the 2024 Gartner® Magic Quadrant™ for Cloud Database Management Systems

Announcing the new Meta Llama 3.3 model on Databricks

Milestone: DatabricksTV Reaches 100 Videos!

Dotmatics and Databricks Partner to Advance Scientific Intelligence in Life Sciences

Databricks Community Champion - December 2024 - Sujesh Menon