<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Lakeflow Connect - Column filtering in Data Engineering</title>
    <link>https://community.databricks.com/t5/data-engineering/lakeflow-connect-column-filtering/m-p/121930#M46605</link>
    <description>&lt;P&gt;Hi community , I am interested in learning more about the feature that was mentioned in recent summit about query pushdown in lakeflow connect for SQL server. I believe this feature will allow to select only the required columns from source tables. Is there any indication when this feature will be available for trial? Until we wait for this feature are there other alternatives that anyone recommends?&lt;/P&gt;</description>
    <pubDate>Tue, 17 Jun 2025 06:45:05 GMT</pubDate>
    <dc:creator>amitpm</dc:creator>
    <dc:date>2025-06-17T06:45:05Z</dc:date>
    <item>
      <title>Lakeflow Connect - Column filtering</title>
      <link>https://community.databricks.com/t5/data-engineering/lakeflow-connect-column-filtering/m-p/121930#M46605</link>
      <description>&lt;P&gt;Hi community , I am interested in learning more about the feature that was mentioned in recent summit about query pushdown in lakeflow connect for SQL server. I believe this feature will allow to select only the required columns from source tables. Is there any indication when this feature will be available for trial? Until we wait for this feature are there other alternatives that anyone recommends?&lt;/P&gt;</description>
      <pubDate>Tue, 17 Jun 2025 06:45:05 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/lakeflow-connect-column-filtering/m-p/121930#M46605</guid>
      <dc:creator>amitpm</dc:creator>
      <dc:date>2025-06-17T06:45:05Z</dc:date>
    </item>
    <item>
      <title>Re: Lakeflow Connect - Column filtering</title>
      <link>https://community.databricks.com/t5/data-engineering/lakeflow-connect-column-filtering/m-p/122463#M46782</link>
      <description>&lt;P&gt;Hey&amp;nbsp;&lt;a href="https://community.databricks.com/t5/user/viewprofilepage/user-id/170062"&gt;@amitpm&lt;/a&gt;&amp;nbsp;&lt;BR /&gt;&lt;BR /&gt;According to the documentation, this feature is currently in Public Preview, so if your Databricks account has access to public preview features, you can reach out to support to enable it and start testing performance.&lt;BR /&gt;&lt;BR /&gt;&lt;/P&gt;&lt;P class=""&gt;&lt;A href="https://docs.databricks.com/aws/en/query-federation/sql-server" target="_self"&gt;Setup guide for Lakehouse Federation with SQL Server&lt;/A&gt;&lt;/P&gt;&lt;P class=""&gt;&lt;A href="https://docs.databricks.com/aws/en/query-federation/sql-server#supported-pushdowns" target="_self"&gt;Supported pushdowns&lt;/A&gt; (predicates, projections, etc.)&lt;BR /&gt;&lt;BR /&gt;&lt;/P&gt;&lt;P class=""&gt;While &lt;SPAN class=""&gt;&lt;STRONG&gt;Lakehouse Federation&lt;/STRONG&gt;&lt;/SPAN&gt; has some &lt;SPAN class=""&gt;&lt;STRONG&gt;performance limitations&lt;/STRONG&gt;&lt;/SPAN&gt;, it’s a solid alternative if you want to avoid using the Spark connector and prefer a declarative connection model.&lt;BR /&gt;&lt;BR /&gt;&lt;/P&gt;&lt;P class=""&gt;&lt;A href="https://docs.databricks.com/aws/en/query-federation/#lakehouse-federation-vs-lakeflow-connect" target="_self"&gt;&lt;SPAN class=""&gt;Comparison between &lt;/SPAN&gt;&lt;STRONG&gt;Lakehouse Federation vs. LakeFlow Connect&lt;/STRONG&gt;&lt;/A&gt;&amp;nbsp;&lt;/P&gt;&lt;P class=""&gt;&lt;A href="https://docs.databricks.com/aws/en/query-federation/#limitations" target="_self"&gt;Known limitations of Lakehouse Federation&lt;/A&gt;&lt;/P&gt;&lt;P class=""&gt;Hope this helps, &lt;span class="lia-unicode-emoji" title=":slightly_smiling_face:"&gt;🙂&lt;/span&gt;&lt;BR /&gt;&lt;BR /&gt;Isi&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Sun, 22 Jun 2025 12:01:31 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/lakeflow-connect-column-filtering/m-p/122463#M46782</guid>
      <dc:creator>Isi</dc:creator>
      <dc:date>2025-06-22T12:01:31Z</dc:date>
    </item>
    <item>
      <title>Re: Lakeflow Connect - Column filtering</title>
      <link>https://community.databricks.com/t5/data-engineering/lakeflow-connect-column-filtering/m-p/158406#M54701</link>
      <description>&lt;P class="wnfdntt _1ibi0s3f5 _1ibi0s3ce _1ibi0s3ea" data-pm-slice="1 1 []"&gt;Hi &lt;a href="https://community.databricks.com/t5/user/viewprofilepage/user-id/170062"&gt;@amitpm&lt;/a&gt;,&lt;/P&gt;
&lt;P class="wnfdntt _1ibi0s3f5 _1ibi0s3ce _1ibi0s3ea" data-pm-slice="1 1 []"&gt;Just closing the loop on this thread, and apologies for the very late follow-up.&lt;/P&gt;
&lt;P class="wnfdntt _1ibi0s3f5 _1ibi0s3ce _1ibi0s3ea"&gt;The feature being discussed here maps to &lt;A href="https://docs.databricks.com/aws/en/ingestion/lakeflow-connect/query-based-overview" rel="noopener noreferrer nofollow" target="_blank"&gt;Lakeflow Connect query-based connectors&lt;/A&gt; for SQL Server, and that capability is now available. Query-based connectors for SQL Server were announced in Public Preview in April 2026, and more recently, the team confirmed they were going GA and that the documentation had been updated accordingly.&lt;/P&gt;
&lt;P class="wnfdntt _1ibi0s3f5 _1ibi0s3ce _1ibi0s3ea"&gt;If your original goal was to avoid copying every column from the source, Lakeflow Connect now has documented support for &lt;A href="https://docs.databricks.com/gcp/en/ingestion/lakeflow-connect/column-selection" rel="noopener noreferrer nofollow" target="_blank"&gt;selecting columns to ingest&lt;/A&gt;, including include_columns and exclude_columns, and it also supports &lt;A href="https://docs.databricks.com/aws/en/ingestion/lakeflow-connect/row-filtering" rel="noopener noreferrer nofollow" target="_blank"&gt;row filtering&lt;/A&gt; for query-based connectors so filters can be applied at the source.&lt;/P&gt;
&lt;P class="wnfdntt _1ibi0s3f5 _1ibi0s3ce _1ibi0s3ea"&gt;So, in short, you no longer need to wait on this. If you are evaluating SQL Server ingestion, the query-based path is the most relevant option when you want scheduled incremental ingestion with a cursor column and more control over what data is brought across. If you need the traditional CDC or CT approach instead, the standard &lt;A href="https://docs.databricks.com/aws/en/ingestion/lakeflow-connect/sql-server-pipeline" rel="noopener noreferrer nofollow" target="_blank"&gt;SQL Server Lakeflow Connect connector&lt;/A&gt; is still available, and if the requirement is zero-copy access rather than ingestion, &lt;A href="https://docs.databricks.com/aws/en/query-federation/sql-server" rel="noopener noreferrer nofollow" target="_blank"&gt;Lakehouse Federation for SQL Server&lt;/A&gt; is also worth considering.&lt;/P&gt;
&lt;P class="p1"&gt;&lt;FONT size="2" color="#FF6600"&gt;&lt;STRONG&gt;&lt;I&gt;If this answer resolves your question, could you mark it as “Accept as Solution”? That helps other users quickly find the correct fix.&lt;/I&gt;&lt;/STRONG&gt;&lt;/FONT&gt;&lt;I&gt;&lt;/I&gt;&lt;/P&gt;</description>
      <pubDate>Fri, 05 Jun 2026 13:39:45 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/lakeflow-connect-column-filtering/m-p/158406#M54701</guid>
      <dc:creator>Ashwin_DSA</dc:creator>
      <dc:date>2026-06-05T13:39:45Z</dc:date>
    </item>
  </channel>
</rss>

