<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: Recommended database when using R in databricks in Data Engineering</title>
    <link>https://community.databricks.com/t5/data-engineering/recommended-database-when-using-r-in-databricks/m-p/26112#M18231</link>
    <description>&lt;P&gt;Hi, have you found how to convert it?&lt;/P&gt;</description>
    <pubDate>Mon, 18 Apr 2022 09:30:16 GMT</pubDate>
    <dc:creator>Hubert-Dudek</dc:creator>
    <dc:date>2022-04-18T09:30:16Z</dc:date>
    <item>
      <title>Recommended database when using R in databricks</title>
      <link>https://community.databricks.com/t5/data-engineering/recommended-database-when-using-r-in-databricks/m-p/26107#M18226</link>
      <description>&lt;P&gt;I'm new to integrating the sparklyr / R interface in databricks. In particular it appears  that sparklyr and R commands and functions are dependent upon the type of dataframe one is working with (hive, Spark R etc). Is there a recommend best practice as to which dataframe I should start with while working with R in databricks?&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Jeff&lt;/P&gt;</description>
      <pubDate>Wed, 09 Mar 2022 15:57:30 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/recommended-database-when-using-r-in-databricks/m-p/26107#M18226</guid>
      <dc:creator>Jeff1</dc:creator>
      <dc:date>2022-03-09T15:57:30Z</dc:date>
    </item>
    <item>
      <title>Re: Recommended database when using R in databricks</title>
      <link>https://community.databricks.com/t5/data-engineering/recommended-database-when-using-r-in-databricks/m-p/26108#M18227</link>
      <description>&lt;P&gt;Recommended is delta format in data lake. Here is code example &lt;A href="https://docs.databricks.com/delta/quick-start.html#language-r" target="test_blank"&gt;https://docs.databricks.com/delta/quick-start.html#language-r&lt;/A&gt;&lt;/P&gt;</description>
      <pubDate>Wed, 09 Mar 2022 16:00:38 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/recommended-database-when-using-r-in-databricks/m-p/26108#M18227</guid>
      <dc:creator>Hubert-Dudek</dc:creator>
      <dc:date>2022-03-09T16:00:38Z</dc:date>
    </item>
    <item>
      <title>Re: Recommended database when using R in databricks</title>
      <link>https://community.databricks.com/t5/data-engineering/recommended-database-when-using-r-in-databricks/m-p/26109#M18228</link>
      <description>&lt;P&gt;Ok then as I'm reading through the reference material I'm not finding how to convert a Hive table to the delta format. I'm assuming my initial data is a Hive table as I've had to use tbl() to read in the data. Would I simply us a SQL statement to read in the data as a delta table then write it back out? &lt;/P&gt;</description>
      <pubDate>Wed, 09 Mar 2022 17:26:28 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/recommended-database-when-using-r-in-databricks/m-p/26109#M18228</guid>
      <dc:creator>Jeff1</dc:creator>
      <dc:date>2022-03-09T17:26:28Z</dc:date>
    </item>
    <item>
      <title>Re: Recommended database when using R in databricks</title>
      <link>https://community.databricks.com/t5/data-engineering/recommended-database-when-using-r-in-databricks/m-p/26110#M18229</link>
      <description>&lt;P&gt;Hi, if your hive table is registered in metastore yes you can use SQL syntax.&lt;/P&gt;&lt;P&gt;Than is enough to use COPY INTO..&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;if your table is not registered please map it in metastore&lt;/P&gt;&lt;P&gt;CREATE TABLE IF NOT EXISTS tableName (fields) USING data_format LOCATION (path=)&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;then you can create another table USING delta format and than copy between tables.&lt;/P&gt;</description>
      <pubDate>Wed, 09 Mar 2022 18:17:12 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/recommended-database-when-using-r-in-databricks/m-p/26110#M18229</guid>
      <dc:creator>Hubert-Dudek</dc:creator>
      <dc:date>2022-03-09T18:17:12Z</dc:date>
    </item>
    <item>
      <title>Re: Recommended database when using R in databricks</title>
      <link>https://community.databricks.com/t5/data-engineering/recommended-database-when-using-r-in-databricks/m-p/26111#M18230</link>
      <description>&lt;P&gt;@Hubert Dudek​&amp;nbsp;, Ok - that's helpful. As I'm reading the databricks documentation it appears when I'm reading in my file using the sparklyr tbl() function in databrick it returns a sparklyr&lt;/P&gt;&lt;P&gt;object ("tbl_spark" "tbl_sql" "tbl_lazy" "tbl ''). So does your previous reply still hold true. Either way based upon you oridginal reploy it woudl be to my benefir to convert the sparklyr object into a delta table - yes. If that's true that's what I'm seeking in the documentation or how to do that.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;&lt;/P&gt;</description>
      <pubDate>Wed, 09 Mar 2022 18:55:45 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/recommended-database-when-using-r-in-databricks/m-p/26111#M18230</guid>
      <dc:creator>Jeff1</dc:creator>
      <dc:date>2022-03-09T18:55:45Z</dc:date>
    </item>
    <item>
      <title>Re: Recommended database when using R in databricks</title>
      <link>https://community.databricks.com/t5/data-engineering/recommended-database-when-using-r-in-databricks/m-p/26112#M18231</link>
      <description>&lt;P&gt;Hi, have you found how to convert it?&lt;/P&gt;</description>
      <pubDate>Mon, 18 Apr 2022 09:30:16 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/recommended-database-when-using-r-in-databricks/m-p/26112#M18231</guid>
      <dc:creator>Hubert-Dudek</dc:creator>
      <dc:date>2022-04-18T09:30:16Z</dc:date>
    </item>
  </channel>
</rss>

