<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: Hi Experts i am new to data bricks and i want to know how data bricks supports real time reporting needs in Business intelligence? in Data Engineering</title>
    <link>https://community.databricks.com/t5/data-engineering/hi-experts-i-am-new-to-data-bricks-and-i-want-to-know-how-data/m-p/24633#M17154</link>
    <description>&lt;P&gt;I get it.  I think you have another understanding of 'container' than me.&lt;/P&gt;&lt;P&gt;Basically you are talking about 2 'tables', a realtime one and a batch one.&lt;/P&gt;&lt;P&gt;Whereas that is certainly an option, you could also pull it off in one single table using partitioning on date.  For the realtime queries you can then apply a filter on the partitioning column to apply partition pruning (only relevant partitions are selected).&lt;/P&gt;</description>
    <pubDate>Thu, 31 Mar 2022 09:54:22 GMT</pubDate>
    <dc:creator>-werners-</dc:creator>
    <dc:date>2022-03-31T09:54:22Z</dc:date>
    <item>
      <title>Hi Experts i am new to data bricks and i want to know how data bricks supports real time reporting needs in Business intelligence?</title>
      <link>https://community.databricks.com/t5/data-engineering/hi-experts-i-am-new-to-data-bricks-and-i-want-to-know-how-data/m-p/24624#M17145</link>
      <description>&lt;P&gt;Delta lake have 3 levels to maintain data quality ( bronze , silver and gold tables ) but this supports the reporting and&amp;nbsp;&lt;/P&gt;&lt;P&gt;BI solutions how does this supports the streaming analytics ?&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;example : I have an app that loads all the operational data in adls gen 2 and i want use this data&amp;nbsp;which will help me to support real time&amp;nbsp;&amp;nbsp;reporting needs and also i need to build a report with all the historical data stored in ADLS gen 2.&lt;/P&gt;&lt;P&gt;So For this do i need to store data into two different container of ADLS gen2 ? one container will have all history data and other will store&amp;nbsp;&amp;nbsp;&amp;nbsp;day of data to support real time reporting ?&amp;nbsp;&amp;nbsp;and for above&amp;nbsp;requirement i want make use of same spark cluster to support both real time reporting needs as well has descriptive&amp;nbsp;analytics with all history data.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;</description>
      <pubDate>Sun, 27 Mar 2022 01:58:11 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/hi-experts-i-am-new-to-data-bricks-and-i-want-to-know-how-data/m-p/24624#M17145</guid>
      <dc:creator>BasavarajAngadi</dc:creator>
      <dc:date>2022-03-27T01:58:11Z</dc:date>
    </item>
    <item>
      <title>Re: Hi Experts i am new to data bricks and i want to know how data bricks supports real time reporting needs in Business intelligence?</title>
      <link>https://community.databricks.com/t5/data-engineering/hi-experts-i-am-new-to-data-bricks-and-i-want-to-know-how-data/m-p/24625#M17146</link>
      <description>&lt;P&gt;delta lake does not has levels.  The bronze/silver/gold is a concept to structure the  data lake.&lt;/P&gt;&lt;P&gt;You do not need a separate ADLS container, I would even advice against it. Why?  Well, you have to ingest the data anyway, so it's best to do this only once (one container).  Within a single container you can structure the data any way you want.  It does not have to be bronze/silver/gold but can be anything that works for you/your company.  I do suggest to foresee some kind of structure, it will help you in the long run.&lt;/P&gt;&lt;P&gt;For the reporting part: if you want to run reports, Databricks SQL is better than the classic spark clusters (which excel in engineering/ML).  Databricks SQL supports concurrency ofc.&lt;/P&gt;</description>
      <pubDate>Mon, 28 Mar 2022 08:17:52 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/hi-experts-i-am-new-to-data-bricks-and-i-want-to-know-how-data/m-p/24625#M17146</guid>
      <dc:creator>-werners-</dc:creator>
      <dc:date>2022-03-28T08:17:52Z</dc:date>
    </item>
    <item>
      <title>Re: Hi Experts i am new to data bricks and i want to know how data bricks supports real time reporting needs in Business intelligence?</title>
      <link>https://community.databricks.com/t5/data-engineering/hi-experts-i-am-new-to-data-bricks-and-i-want-to-know-how-data/m-p/24626#M17147</link>
      <description>&lt;P&gt;@Werner Stinckens​&amp;nbsp; Then Do i need to use Databricks SQL end point to Connect to BI tools ?&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;If yes then i should create separate Spark Runtime Cluster for it ?  So this means i need two spark clusters one to run ETL or ELT jobs and other one to Query Delta Tables ?&lt;/P&gt;</description>
      <pubDate>Mon, 28 Mar 2022 08:23:54 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/hi-experts-i-am-new-to-data-bricks-and-i-want-to-know-how-data/m-p/24626#M17147</guid>
      <dc:creator>BasavarajAngadi</dc:creator>
      <dc:date>2022-03-28T08:23:54Z</dc:date>
    </item>
    <item>
      <title>Re: Hi Experts i am new to data bricks and i want to know how data bricks supports real time reporting needs in Business intelligence?</title>
      <link>https://community.databricks.com/t5/data-engineering/hi-experts-i-am-new-to-data-bricks-and-i-want-to-know-how-data/m-p/24627#M17148</link>
      <description>&lt;P&gt;You certainly can connect BI tools to the 'classic' clusters.  That works fine, but databricks SQL is optimized for fast read queries (Analytics tools) .  So you will get better performance.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;</description>
      <pubDate>Mon, 28 Mar 2022 08:42:50 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/hi-experts-i-am-new-to-data-bricks-and-i-want-to-know-how-data/m-p/24627#M17148</guid>
      <dc:creator>-werners-</dc:creator>
      <dc:date>2022-03-28T08:42:50Z</dc:date>
    </item>
    <item>
      <title>Re: Hi Experts i am new to data bricks and i want to know how data bricks supports real time reporting needs in Business intelligence?</title>
      <link>https://community.databricks.com/t5/data-engineering/hi-experts-i-am-new-to-data-bricks-and-i-want-to-know-how-data/m-p/24628#M17149</link>
      <description>&lt;P&gt;@Werner Stinckens​&amp;nbsp; So in this context how data bricks eliminates the need of lamda architecture? we anyway need two spark clusters one to process etl or elt jobs and other to connect the BI tools ( SQL end point )&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;&lt;/P&gt;</description>
      <pubDate>Mon, 28 Mar 2022 14:00:21 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/hi-experts-i-am-new-to-data-bricks-and-i-want-to-know-how-data/m-p/24628#M17149</guid>
      <dc:creator>BasavarajAngadi</dc:creator>
      <dc:date>2022-03-28T14:00:21Z</dc:date>
    </item>
    <item>
      <title>Re: Hi Experts i am new to data bricks and i want to know how data bricks supports real time reporting needs in Business intelligence?</title>
      <link>https://community.databricks.com/t5/data-engineering/hi-experts-i-am-new-to-data-bricks-and-i-want-to-know-how-data/m-p/24629#M17150</link>
      <description>&lt;P&gt;Not exactly.  Lambda basically is streaming + batch simultaneously.&lt;/P&gt;&lt;P&gt;You can achieve that with databricks.&lt;/P&gt;&lt;P&gt;ingest real time data into your blob storage, and run reports on that data.&lt;/P&gt;&lt;P&gt;Also transform the data into f.e. golden layer in batch and also report on that.&lt;/P&gt;&lt;P&gt;But you don't need separate storage for the realtime (or near realtime) and batch.&lt;/P&gt;&lt;P&gt;Having separate clusters for SQL workloads and ETL/ELT is a good idea, although technically not necessary.  But I would do it.&lt;/P&gt;</description>
      <pubDate>Mon, 28 Mar 2022 14:06:12 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/hi-experts-i-am-new-to-data-bricks-and-i-want-to-know-how-data/m-p/24629#M17150</guid>
      <dc:creator>-werners-</dc:creator>
      <dc:date>2022-03-28T14:06:12Z</dc:date>
    </item>
    <item>
      <title>Re: Hi Experts i am new to data bricks and i want to know how data bricks supports real time reporting needs in Business intelligence?</title>
      <link>https://community.databricks.com/t5/data-engineering/hi-experts-i-am-new-to-data-bricks-and-i-want-to-know-how-data/m-p/24630#M17151</link>
      <description>&lt;P&gt;@Werner Stinckens​&amp;nbsp; i think we i need to connect with support team on this … need to connect SQL end point of data bricks to MicroStrategy .&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Databricks is our one hub going forward and I want to understand the elimination of lambda architecture.&lt;/P&gt;&lt;P&gt;though the storage is same ADLS gen 2 .....to do reporting for real time where Kafka stores always one day worth of data in the kafka cluster  or a week of data max to support real time analytics  so we are thinking of storing one day  of data in separate container and to support real time and years worth of data in separate container to support batch and historical  reporting.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Need to understand ....let us know how to connect support team &lt;/P&gt;&lt;P&gt;  &lt;/P&gt;</description>
      <pubDate>Mon, 28 Mar 2022 16:22:56 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/hi-experts-i-am-new-to-data-bricks-and-i-want-to-know-how-data/m-p/24630#M17151</guid>
      <dc:creator>BasavarajAngadi</dc:creator>
      <dc:date>2022-03-28T16:22:56Z</dc:date>
    </item>
    <item>
      <title>Re: Hi Experts i am new to data bricks and i want to know how data bricks supports real time reporting needs in Business intelligence?</title>
      <link>https://community.databricks.com/t5/data-engineering/hi-experts-i-am-new-to-data-bricks-and-i-want-to-know-how-data/m-p/24631#M17152</link>
      <description>&lt;P&gt;&lt;A href="https://docs.microsoft.com/en-us/azure/databricks/integrations/bi/microstrategy" alt="https://docs.microsoft.com/en-us/azure/databricks/integrations/bi/microstrategy" target="_blank"&gt;https://docs.microsoft.com/en-us/azure/databricks/integrations/bi/microstrategy&lt;/A&gt;&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Your proposed solution to have 2 containers (1 for real time and one for long term) is certainly possible, but not required.&lt;/P&gt;&lt;P&gt;You could land ALL data into a bronze layer (streaming or not) for example.  And move on from that layer. &lt;/P&gt;&lt;P&gt;The main attention point is that you should try to minimize the query load on the source system (so avoid ingesting the same data twice).&lt;/P&gt;</description>
      <pubDate>Tue, 29 Mar 2022 06:41:30 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/hi-experts-i-am-new-to-data-bricks-and-i-want-to-know-how-data/m-p/24631#M17152</guid>
      <dc:creator>-werners-</dc:creator>
      <dc:date>2022-03-29T06:41:30Z</dc:date>
    </item>
    <item>
      <title>Re: Hi Experts i am new to data bricks and i want to know how data bricks supports real time reporting needs in Business intelligence?</title>
      <link>https://community.databricks.com/t5/data-engineering/hi-experts-i-am-new-to-data-bricks-and-i-want-to-know-how-data/m-p/24632#M17153</link>
      <description>&lt;P&gt;@Werner Stinckens​&amp;nbsp; if i keep anything in one container how does that support batch and real time analytics ?&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Example : i have a container with delta table which has 2 years of data that supports BI reporting for historical data.&lt;/P&gt;&lt;P&gt;if i use same delta table how does that supports real time it unnecessary scans all 2 years data which is not needed and would definitely hamper performance.&lt;/P&gt;&lt;P&gt;to support real time i need have 1 or 2 days worth of data right ?&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;are you getting what i am addressing here ?  &lt;/P&gt;</description>
      <pubDate>Tue, 29 Mar 2022 09:50:10 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/hi-experts-i-am-new-to-data-bricks-and-i-want-to-know-how-data/m-p/24632#M17153</guid>
      <dc:creator>BasavarajAngadi</dc:creator>
      <dc:date>2022-03-29T09:50:10Z</dc:date>
    </item>
    <item>
      <title>Re: Hi Experts i am new to data bricks and i want to know how data bricks supports real time reporting needs in Business intelligence?</title>
      <link>https://community.databricks.com/t5/data-engineering/hi-experts-i-am-new-to-data-bricks-and-i-want-to-know-how-data/m-p/24633#M17154</link>
      <description>&lt;P&gt;I get it.  I think you have another understanding of 'container' than me.&lt;/P&gt;&lt;P&gt;Basically you are talking about 2 'tables', a realtime one and a batch one.&lt;/P&gt;&lt;P&gt;Whereas that is certainly an option, you could also pull it off in one single table using partitioning on date.  For the realtime queries you can then apply a filter on the partitioning column to apply partition pruning (only relevant partitions are selected).&lt;/P&gt;</description>
      <pubDate>Thu, 31 Mar 2022 09:54:22 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/hi-experts-i-am-new-to-data-bricks-and-i-want-to-know-how-data/m-p/24633#M17154</guid>
      <dc:creator>-werners-</dc:creator>
      <dc:date>2022-03-31T09:54:22Z</dc:date>
    </item>
    <item>
      <title>Re: Hi Experts i am new to data bricks and i want to know how data bricks supports real time reporting needs in Business intelligence?</title>
      <link>https://community.databricks.com/t5/data-engineering/hi-experts-i-am-new-to-data-bricks-and-i-want-to-know-how-data/m-p/24634#M17155</link>
      <description>&lt;P&gt;@Werner Stinckens​&amp;nbsp; why i would do that ?  rather i will prefer storing the tables in two containers one supports for real time which holds only one day worth of data and one more container which  will hold all historical data.&lt;/P&gt;&lt;P&gt;This way can we not make BI work? that supports real time analytics and historical needs of data for reporting?  &lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Please consider of connecting  traditional BI tool to modern data platform i.e Databricks .&lt;/P&gt;</description>
      <pubDate>Thu, 31 Mar 2022 10:44:31 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/hi-experts-i-am-new-to-data-bricks-and-i-want-to-know-how-data/m-p/24634#M17155</guid>
      <dc:creator>BasavarajAngadi</dc:creator>
      <dc:date>2022-03-31T10:44:31Z</dc:date>
    </item>
    <item>
      <title>Re: Hi Experts i am new to data bricks and i want to know how data bricks supports real time reporting needs in Business intelligence?</title>
      <link>https://community.databricks.com/t5/data-engineering/hi-experts-i-am-new-to-data-bricks-and-i-want-to-know-how-data/m-p/24635#M17156</link>
      <description>&lt;P&gt;@Basavaraj Angadi​&amp;nbsp;, Why? For simplicity, cost savings etc.&lt;/P&gt;&lt;P&gt;You can make it work with 2 'containers' but it is not necessary.&lt;/P&gt;</description>
      <pubDate>Thu, 31 Mar 2022 10:45:58 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/hi-experts-i-am-new-to-data-bricks-and-i-want-to-know-how-data/m-p/24635#M17156</guid>
      <dc:creator>-werners-</dc:creator>
      <dc:date>2022-03-31T10:45:58Z</dc:date>
    </item>
  </channel>
</rss>

