<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: Schema issue while fetching data from oracle in Data Engineering</title>
    <link>https://community.databricks.com/t5/data-engineering/schema-issue-while-fetching-data-from-oracle/m-p/103342#M41411</link>
    <description>&lt;P&gt;Thanks for your question!&lt;BR /&gt;To address schema issues when fetching Oracle data in Databricks, use JDBC schema inference to define data types programmatically or batch-cast columns dynamically after loading. For performance, enable predicate pushdown and partitioning during the read, minimizing the data load per query. If the trailing zeros or scientific notation persist during writes, configure specific decimalOptions or cast columns explicitly to maintain consistency. Hope it helps!&lt;/P&gt;</description>
    <pubDate>Fri, 27 Dec 2024 18:26:32 GMT</pubDate>
    <dc:creator>VZLA</dc:creator>
    <dc:date>2024-12-27T18:26:32Z</dc:date>
    <item>
      <title>Schema issue while fetching data from oracle</title>
      <link>https://community.databricks.com/t5/data-engineering/schema-issue-while-fetching-data-from-oracle/m-p/83820#M37011</link>
      <description>&lt;P&gt;I dont have the complete context of the issue.&lt;BR /&gt;&lt;BR /&gt;But Here it is what I know, a friend of mine facing this&lt;BR /&gt;""&lt;BR /&gt;I am fetching data from Oracle data in databricks using python.But every time i do it the schema gets changes&lt;BR /&gt;so if the column is of type decimal&amp;nbsp;&lt;SPAN&gt;for col value is 0.125 then its writting as 0.125000000&lt;BR /&gt;another example -&amp;nbsp;20240821 it is returning it as&amp;nbsp;20240821.0000000,&amp;nbsp;&lt;BR /&gt;if a column has value 0 it shows 0E-10. ""&lt;BR /&gt;&lt;BR /&gt;I suggested a solution that you can specify the data types while reading it ,he said&lt;BR /&gt;- I am having 15 tables and 150 columns each of approx size 20-40 million.&lt;BR /&gt;-&amp;nbsp;I am casting every column as string before query the data then its taking hell amount of time&lt;/SPAN&gt;&lt;/P&gt;</description>
      <pubDate>Wed, 21 Aug 2024 14:32:39 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/schema-issue-while-fetching-data-from-oracle/m-p/83820#M37011</guid>
      <dc:creator>ashraf1395</dc:creator>
      <dc:date>2024-08-21T14:32:39Z</dc:date>
    </item>
    <item>
      <title>Re: Schema issue while fetching data from oracle</title>
      <link>https://community.databricks.com/t5/data-engineering/schema-issue-while-fetching-data-from-oracle/m-p/103342#M41411</link>
      <description>&lt;P&gt;Thanks for your question!&lt;BR /&gt;To address schema issues when fetching Oracle data in Databricks, use JDBC schema inference to define data types programmatically or batch-cast columns dynamically after loading. For performance, enable predicate pushdown and partitioning during the read, minimizing the data load per query. If the trailing zeros or scientific notation persist during writes, configure specific decimalOptions or cast columns explicitly to maintain consistency. Hope it helps!&lt;/P&gt;</description>
      <pubDate>Fri, 27 Dec 2024 18:26:32 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/schema-issue-while-fetching-data-from-oracle/m-p/103342#M41411</guid>
      <dc:creator>VZLA</dc:creator>
      <dc:date>2024-12-27T18:26:32Z</dc:date>
    </item>
  </channel>
</rss>

