<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: Unable to convert R dataframe to spark dataframe in Machine Learning</title>
    <link>https://community.databricks.com/t5/machine-learning/unable-to-convert-r-dataframe-to-spark-dataframe/m-p/110800#M3973</link>
    <description>&lt;P&gt;Thank you for your response, I just tried this line, it did not work:&lt;BR /&gt;&lt;SPAN&gt;spark.sql("CREATE OR REPLACE TEMP VIEW matched_view AS SELECT * FROM matched_spark")&lt;/SPAN&gt;&lt;BR /&gt;&lt;BR /&gt;it gives me an error on my notebook saids spark.sql can't be found, by the way, I'm writing this in R cell.&lt;/P&gt;&lt;P&gt;However, the following syntax works:&lt;/P&gt;&lt;DIV&gt;&lt;DIV&gt;&lt;SPAN&gt;matched_spark %&amp;gt;% sparklyr::sdf_register(&lt;/SPAN&gt;&lt;SPAN&gt;"matched_view"&lt;/SPAN&gt;&lt;SPAN&gt;)&lt;/SPAN&gt;&lt;/DIV&gt;&lt;DIV&gt;&amp;nbsp;&lt;/DIV&gt;&lt;DIV&gt;&lt;SPAN&gt;and then I can use SQL on the next cell to work with matched_view&lt;/SPAN&gt;&lt;/DIV&gt;&lt;/DIV&gt;&lt;P&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;/P&gt;</description>
    <pubDate>Thu, 20 Feb 2025 22:32:27 GMT</pubDate>
    <dc:creator>Paddy_chu</dc:creator>
    <dc:date>2025-02-20T22:32:27Z</dc:date>
    <item>
      <title>Unable to convert R dataframe to spark dataframe</title>
      <link>https://community.databricks.com/t5/machine-learning/unable-to-convert-r-dataframe-to-spark-dataframe/m-p/110798#M3971</link>
      <description>&lt;P&gt;Hi All,&amp;nbsp;&lt;/P&gt;&lt;P&gt;Does anyone knows how to convert R dataframe to spark dataframe to Pandas dataframe? I wanted to get a Pandas dataframe ultimately but I guess I need to convert to spark first. I've been using this sparklyr library but my code did not work. This is the code I used in my R cell:&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;DIV&gt;&lt;DIV&gt;&lt;SPAN&gt;%r&lt;/SPAN&gt;&lt;/DIV&gt;&lt;BR /&gt;&lt;DIV&gt;&lt;SPAN&gt;library&lt;/SPAN&gt;&lt;SPAN&gt;(sparklyr)&lt;/SPAN&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;SPAN&gt;library&lt;/SPAN&gt;&lt;SPAN&gt;(SparkR)&lt;/SPAN&gt;&lt;/DIV&gt;&lt;BR /&gt;&lt;DIV&gt;&lt;SPAN&gt;sc = spark_connect(method = &lt;/SPAN&gt;&lt;SPAN&gt;"databricks"&lt;/SPAN&gt;&lt;SPAN&gt;)&lt;/SPAN&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;SPAN&gt;matched_rdf = psm_tbl %&amp;gt;% select(c(code_treat, code_control)) %&amp;gt;% data.frame()&lt;/SPAN&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;SPAN&gt;# write the R dataframe to spark&lt;/SPAN&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;SPAN&gt;matched_spark = copy_to(sc, matched_rdf, overwrite = TRUE)&lt;/SPAN&gt;&lt;/DIV&gt;&lt;DIV&gt;&amp;nbsp;&lt;/DIV&gt;&lt;DIV&gt;&lt;SPAN&gt;I suppose&amp;nbsp;matched_spark is spark dataframe already and on the next cell I write:&lt;/SPAN&gt;&lt;/DIV&gt;&lt;DIV&gt;&amp;nbsp;&lt;/DIV&gt;&lt;DIV&gt;&lt;SPAN&gt;select * from&amp;nbsp;matched_spark, but there's an error saids "matched_spark" object not found.&amp;nbsp;&lt;/SPAN&gt;&lt;/DIV&gt;&lt;DIV&gt;&amp;nbsp;&lt;/DIV&gt;&lt;DIV&gt;&lt;SPAN&gt;Appreciate if anyone could help!&lt;/SPAN&gt;&lt;/DIV&gt;&lt;/DIV&gt;</description>
      <pubDate>Thu, 20 Feb 2025 22:17:37 GMT</pubDate>
      <guid>https://community.databricks.com/t5/machine-learning/unable-to-convert-r-dataframe-to-spark-dataframe/m-p/110798#M3971</guid>
      <dc:creator>Paddy_chu</dc:creator>
      <dc:date>2025-02-20T22:17:37Z</dc:date>
    </item>
    <item>
      <title>Re: Unable to convert R dataframe to spark dataframe</title>
      <link>https://community.databricks.com/t5/machine-learning/unable-to-convert-r-dataframe-to-spark-dataframe/m-p/110799#M3972</link>
      <description>&lt;P&gt;Hello &lt;a href="https://community.databricks.com/t5/user/viewprofilepage/user-id/68565"&gt;@Paddy_chu&lt;/a&gt;,&lt;/P&gt;
&lt;P&gt;Here's an updated version of the R code:&lt;/P&gt;
&lt;P class="p1"&gt;%r&lt;/P&gt;
&lt;P class="p2"&gt;&amp;nbsp;&lt;/P&gt;
&lt;P class="p1"&gt;library(sparklyr)&lt;/P&gt;
&lt;P class="p1"&gt;library(SparkR)&lt;/P&gt;
&lt;P class="p2"&gt;&amp;nbsp;&lt;/P&gt;
&lt;P class="p1"&gt;sc &amp;lt;- spark_connect(method = "databricks")&lt;/P&gt;
&lt;P class="p1"&gt;matched_rdf &amp;lt;- psm_tbl %&amp;gt;% select(c(code_treat, code_control)) %&amp;gt;% data.frame()&lt;/P&gt;
&lt;P class="p2"&gt;&amp;nbsp;&lt;/P&gt;
&lt;P class="p1"&gt;# Write the R dataframe to Spark&lt;/P&gt;
&lt;P class="p1"&gt;matched_spark &amp;lt;- copy_to(sc, matched_rdf, overwrite = TRUE)&lt;/P&gt;
&lt;P class="p2"&gt;&amp;nbsp;&lt;/P&gt;
&lt;P class="p1"&gt;# Register the Spark DataFrame as a temporary view to query it using SQL&lt;/P&gt;
&lt;P class="p1"&gt;spark.sql("CREATE OR REPLACE TEMP VIEW matched_view AS SELECT * FROM matched_spark")&lt;/P&gt;</description>
      <pubDate>Thu, 20 Feb 2025 22:25:34 GMT</pubDate>
      <guid>https://community.databricks.com/t5/machine-learning/unable-to-convert-r-dataframe-to-spark-dataframe/m-p/110799#M3972</guid>
      <dc:creator>Alberto_Umana</dc:creator>
      <dc:date>2025-02-20T22:25:34Z</dc:date>
    </item>
    <item>
      <title>Re: Unable to convert R dataframe to spark dataframe</title>
      <link>https://community.databricks.com/t5/machine-learning/unable-to-convert-r-dataframe-to-spark-dataframe/m-p/110800#M3973</link>
      <description>&lt;P&gt;Thank you for your response, I just tried this line, it did not work:&lt;BR /&gt;&lt;SPAN&gt;spark.sql("CREATE OR REPLACE TEMP VIEW matched_view AS SELECT * FROM matched_spark")&lt;/SPAN&gt;&lt;BR /&gt;&lt;BR /&gt;it gives me an error on my notebook saids spark.sql can't be found, by the way, I'm writing this in R cell.&lt;/P&gt;&lt;P&gt;However, the following syntax works:&lt;/P&gt;&lt;DIV&gt;&lt;DIV&gt;&lt;SPAN&gt;matched_spark %&amp;gt;% sparklyr::sdf_register(&lt;/SPAN&gt;&lt;SPAN&gt;"matched_view"&lt;/SPAN&gt;&lt;SPAN&gt;)&lt;/SPAN&gt;&lt;/DIV&gt;&lt;DIV&gt;&amp;nbsp;&lt;/DIV&gt;&lt;DIV&gt;&lt;SPAN&gt;and then I can use SQL on the next cell to work with matched_view&lt;/SPAN&gt;&lt;/DIV&gt;&lt;/DIV&gt;&lt;P&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;/P&gt;</description>
      <pubDate>Thu, 20 Feb 2025 22:32:27 GMT</pubDate>
      <guid>https://community.databricks.com/t5/machine-learning/unable-to-convert-r-dataframe-to-spark-dataframe/m-p/110800#M3973</guid>
      <dc:creator>Paddy_chu</dc:creator>
      <dc:date>2025-02-20T22:32:27Z</dc:date>
    </item>
  </channel>
</rss>

