<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: Delta sharing vs CosmosDB in Get Started Discussions</title>
    <link>https://community.databricks.com/t5/get-started-discussions/delta-sharing-vs-cosmosdb/m-p/102467#M4657</link>
    <description>&lt;H3 class="_1jeaq5e0 _1t7bu9h9 heading3"&gt;Costs:&lt;/H3&gt;
&lt;OL&gt;
&lt;LI&gt;
&lt;P class="_1t7bu9h1 paragraph"&gt;&lt;SPAN&gt;&lt;STRONG&gt;Egress Costs&lt;/STRONG&gt;:&lt;/SPAN&gt;&lt;/P&gt;
&lt;UL class="_1t7bu9h7 _1t7bu9h2"&gt;
&lt;LI&gt;&lt;SPAN&gt;&lt;STRONG&gt;Delta Sharing&lt;/STRONG&gt;: Within the same region, Delta Sharing incurs no egress costs. However, sharing data across different regions or clouds may result in egress fees charged by the cloud provider. Databricks supports sharing from Cloudflare R2, which incurs no egress fees. Tools and recommendations are available to monitor and avoid egress fees (source: "Monitor and manage Delta Sharing egress costs (for providers)").&lt;/SPAN&gt;&lt;/LI&gt;
&lt;LI&gt;&lt;STRONG&gt;CosmosDB&lt;/STRONG&gt;: Egress costs apply when data is read from CosmosDB and transferred out of the Azure region. The exact costs depend on the amount of data transferred and the regions involved.&lt;/LI&gt;
&lt;/UL&gt;
&lt;/LI&gt;
&lt;LI&gt;
&lt;P class="_1t7bu9h1 paragraph"&gt;&lt;STRONG&gt;Computation Costs&lt;/STRONG&gt;:&lt;/P&gt;
&lt;UL class="_1t7bu9h7 _1t7bu9h2"&gt;
&lt;LI&gt;&lt;SPAN&gt;&lt;STRONG&gt;Delta Sharing&lt;/STRONG&gt;: The data provider incurs storage costs for Delta or Parquet files, but there are no compute costs on the provider's side for sharing data. The recipient incurs data processing costs when they access and process the shared data (source: "Delta Sharing Cost Breakdown").&lt;/SPAN&gt;&lt;/LI&gt;
&lt;LI&gt;&lt;STRONG&gt;CosmosDB&lt;/STRONG&gt;: Costs are associated with the provisioned throughput (RU/s) for read and write operations, as well as storage costs.&lt;/LI&gt;
&lt;/UL&gt;
&lt;/LI&gt;
&lt;/OL&gt;
&lt;H3 class="_1jeaq5e0 _1t7bu9h9 heading3"&gt;Performance Issues:&lt;/H3&gt;
&lt;UL class="_1t7bu9h7 _1t7bu9h2"&gt;
&lt;LI&gt;&lt;SPAN&gt;&lt;STRONG&gt;Delta Sharing&lt;/STRONG&gt;: There can be performance issues when accessing the Delta Sharing table concurrently through API calls. For example, if the number of API requests exceeds the predefined rate limit, it can result in "Too Many Requests" errors (HTTP 429).&amp;nbsp;&lt;/SPAN&gt;&lt;/LI&gt;
&lt;LI&gt;&lt;STRONG&gt;CosmosDB&lt;/STRONG&gt;: CosmosDB is designed to handle high throughput and low latency for concurrent operations. However, performance can be impacted if the provisioned throughput is not sufficient for the workload&lt;/LI&gt;
&lt;/UL&gt;</description>
    <pubDate>Wed, 18 Dec 2024 11:39:50 GMT</pubDate>
    <dc:creator>Walter_C</dc:creator>
    <dc:date>2024-12-18T11:39:50Z</dc:date>
    <item>
      <title>Delta sharing vs CosmosDB</title>
      <link>https://community.databricks.com/t5/get-started-discussions/delta-sharing-vs-cosmosdb/m-p/102452#M4656</link>
      <description>&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Hi All,&lt;/P&gt;&lt;P&gt;We have a situation where we write data to CosmosDB and create JSON data for a transaction table, which includes a mini statement in JSON format.&lt;/P&gt;&lt;P&gt;Now, we want to introduce the concept of delta sharing and share the transaction table. The Java application will access the delta table to generate JSON data, and there could be hundreds of API calls to the delta sharing table.&lt;/P&gt;&lt;P&gt;We would like to understand the costs involved in this scenario when comparing delta sharing(computation costs, egress cost etc ) to CosmosDB. Additionally, we want to know if there are any performance issues when accessing the delta sharing table concurrently through API calls to generate JSON data.&lt;/P&gt;&lt;P&gt;Regards,&lt;/P&gt;&lt;P&gt;Phani&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Wed, 18 Dec 2024 07:09:42 GMT</pubDate>
      <guid>https://community.databricks.com/t5/get-started-discussions/delta-sharing-vs-cosmosdb/m-p/102452#M4656</guid>
      <dc:creator>Phani1</dc:creator>
      <dc:date>2024-12-18T07:09:42Z</dc:date>
    </item>
    <item>
      <title>Re: Delta sharing vs CosmosDB</title>
      <link>https://community.databricks.com/t5/get-started-discussions/delta-sharing-vs-cosmosdb/m-p/102467#M4657</link>
      <description>&lt;H3 class="_1jeaq5e0 _1t7bu9h9 heading3"&gt;Costs:&lt;/H3&gt;
&lt;OL&gt;
&lt;LI&gt;
&lt;P class="_1t7bu9h1 paragraph"&gt;&lt;SPAN&gt;&lt;STRONG&gt;Egress Costs&lt;/STRONG&gt;:&lt;/SPAN&gt;&lt;/P&gt;
&lt;UL class="_1t7bu9h7 _1t7bu9h2"&gt;
&lt;LI&gt;&lt;SPAN&gt;&lt;STRONG&gt;Delta Sharing&lt;/STRONG&gt;: Within the same region, Delta Sharing incurs no egress costs. However, sharing data across different regions or clouds may result in egress fees charged by the cloud provider. Databricks supports sharing from Cloudflare R2, which incurs no egress fees. Tools and recommendations are available to monitor and avoid egress fees (source: "Monitor and manage Delta Sharing egress costs (for providers)").&lt;/SPAN&gt;&lt;/LI&gt;
&lt;LI&gt;&lt;STRONG&gt;CosmosDB&lt;/STRONG&gt;: Egress costs apply when data is read from CosmosDB and transferred out of the Azure region. The exact costs depend on the amount of data transferred and the regions involved.&lt;/LI&gt;
&lt;/UL&gt;
&lt;/LI&gt;
&lt;LI&gt;
&lt;P class="_1t7bu9h1 paragraph"&gt;&lt;STRONG&gt;Computation Costs&lt;/STRONG&gt;:&lt;/P&gt;
&lt;UL class="_1t7bu9h7 _1t7bu9h2"&gt;
&lt;LI&gt;&lt;SPAN&gt;&lt;STRONG&gt;Delta Sharing&lt;/STRONG&gt;: The data provider incurs storage costs for Delta or Parquet files, but there are no compute costs on the provider's side for sharing data. The recipient incurs data processing costs when they access and process the shared data (source: "Delta Sharing Cost Breakdown").&lt;/SPAN&gt;&lt;/LI&gt;
&lt;LI&gt;&lt;STRONG&gt;CosmosDB&lt;/STRONG&gt;: Costs are associated with the provisioned throughput (RU/s) for read and write operations, as well as storage costs.&lt;/LI&gt;
&lt;/UL&gt;
&lt;/LI&gt;
&lt;/OL&gt;
&lt;H3 class="_1jeaq5e0 _1t7bu9h9 heading3"&gt;Performance Issues:&lt;/H3&gt;
&lt;UL class="_1t7bu9h7 _1t7bu9h2"&gt;
&lt;LI&gt;&lt;SPAN&gt;&lt;STRONG&gt;Delta Sharing&lt;/STRONG&gt;: There can be performance issues when accessing the Delta Sharing table concurrently through API calls. For example, if the number of API requests exceeds the predefined rate limit, it can result in "Too Many Requests" errors (HTTP 429).&amp;nbsp;&lt;/SPAN&gt;&lt;/LI&gt;
&lt;LI&gt;&lt;STRONG&gt;CosmosDB&lt;/STRONG&gt;: CosmosDB is designed to handle high throughput and low latency for concurrent operations. However, performance can be impacted if the provisioned throughput is not sufficient for the workload&lt;/LI&gt;
&lt;/UL&gt;</description>
      <pubDate>Wed, 18 Dec 2024 11:39:50 GMT</pubDate>
      <guid>https://community.databricks.com/t5/get-started-discussions/delta-sharing-vs-cosmosdb/m-p/102467#M4657</guid>
      <dc:creator>Walter_C</dc:creator>
      <dc:date>2024-12-18T11:39:50Z</dc:date>
    </item>
    <item>
      <title>Re: Delta sharing vs CosmosDB</title>
      <link>https://community.databricks.com/t5/get-started-discussions/delta-sharing-vs-cosmosdb/m-p/102762#M4665</link>
      <description>&lt;P&gt;Thanks for&amp;nbsp; your reply,&lt;/P&gt;&lt;P&gt;Right now, the team is transferring data from Databricks to Cosmos DB, and then they're using REST APIs to access that data. They handle about 100 requests per minute, with some tables needing around 100 requests per second due to high transactions.&lt;/P&gt;&lt;P&gt;However, the customer wants to move away from Cosmos DB because it requires a lot of effort to write the data,&lt;BR /&gt;and it's becoming too expensive. They are looking for alternative ways to access the delta table data using REST APIs through Java. Can you please provide any approach on this.&lt;/P&gt;</description>
      <pubDate>Fri, 20 Dec 2024 09:24:48 GMT</pubDate>
      <guid>https://community.databricks.com/t5/get-started-discussions/delta-sharing-vs-cosmosdb/m-p/102762#M4665</guid>
      <dc:creator>Phani1</dc:creator>
      <dc:date>2024-12-20T09:24:48Z</dc:date>
    </item>
    <item>
      <title>Re: Delta sharing vs CosmosDB</title>
      <link>https://community.databricks.com/t5/get-started-discussions/delta-sharing-vs-cosmosdb/m-p/102783#M4666</link>
      <description>&lt;P&gt;You could use a JDBC connection to connect to a cluster or SQL warehouse and from there you could run your SQL commands to query the delta tables&amp;nbsp;&lt;A href="https://docs.databricks.com/en/integrations/jdbc/index.html" target="_blank"&gt;https://docs.databricks.com/en/integrations/jdbc/index.html&lt;/A&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Fri, 20 Dec 2024 12:14:06 GMT</pubDate>
      <guid>https://community.databricks.com/t5/get-started-discussions/delta-sharing-vs-cosmosdb/m-p/102783#M4666</guid>
      <dc:creator>Walter_C</dc:creator>
      <dc:date>2024-12-20T12:14:06Z</dc:date>
    </item>
  </channel>
</rss>

