<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: Method iterableAsScalaIterable does not exist Pydeequ in Data Engineering</title>
    <link>https://community.databricks.com/t5/data-engineering/method-iterableasscalaiterable-does-not-exist-pydeequ/m-p/26855#M18860</link>
    <description>&lt;P&gt;Hi @Jhonatan Reyes​&amp;nbsp;,&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Just checking if you still need help on this issue? did @Werner Stinckens​&amp;nbsp;'s response helped? if it did, please mark it as best response.&lt;/P&gt;</description>
    <pubDate>Mon, 11 Apr 2022 20:53:17 GMT</pubDate>
    <dc:creator>jose_gonzalez</dc:creator>
    <dc:date>2022-04-11T20:53:17Z</dc:date>
    <item>
      <title>Method iterableAsScalaIterable does not exist Pydeequ</title>
      <link>https://community.databricks.com/t5/data-engineering/method-iterableasscalaiterable-does-not-exist-pydeequ/m-p/26851#M18856</link>
      <description>&lt;P&gt;Hello,&lt;/P&gt;&lt;P&gt;I'm using Databricks and pydeequ to build a QA step in structured streaming.&lt;/P&gt;&lt;P&gt;One of the Analyzers that I need to use is the Uniqueness.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;If I try to add another one like Completeness, work properly, but if y add the Uniqueness I get an error:&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;PRE&gt;&lt;CODE&gt;py4j.Py4JException: Method iterableAsScalaIterable([class java.lang.String]) does not exist&lt;/CODE&gt;&lt;/PRE&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Log:&lt;/P&gt;&lt;PRE&gt;&lt;CODE&gt;Py4JError                                 Traceback (most recent call last)
&amp;lt;[command-1299007449178928]()&amp;gt; in &amp;lt;module&amp;gt;
      1 from pydeequ.analyzers import *
      2 
----&amp;gt; 3 analysisResult = AnalysisRunner(spark) \
      4                     .onData(df) \
      5                     .addAnalyzer(Size()) \
&amp;nbsp;
/local_disk0/.ephemeral_nfs/envs/pythonEnv-3e8b052e-7811-4908-bcc3-79a52e659d2d/lib/python3.8/site-packages/pydeequ/analyzers.py in addAnalyzer(self, analyzer)
    132         """
    133         analyzer._set_jvm(self._jvm)
--&amp;gt; 134         _analyzer_jvm = analyzer._analyzer_jvm
    135         self._AnalysisRunBuilder.addAnalyzer(_analyzer_jvm)
    136         return self
&amp;nbsp;
/local_disk0/.ephemeral_nfs/envs/pythonEnv-3e8b052e-7811-4908-bcc3-79a52e659d2d/lib/python3.8/site-packages/pydeequ/analyzers.py in _analyzer_jvm(self)
    773         """
    774         return self._deequAnalyzers.Uniqueness(
--&amp;gt; 775             to_scala_seq(self._jvm, self.columns), self._jvm.scala.Option.apply(self.where)
    776         )
    777 
&amp;nbsp;
/local_disk0/.ephemeral_nfs/envs/pythonEnv-3e8b052e-7811-4908-bcc3-79a52e659d2d/lib/python3.8/site-packages/pydeequ/scala_utils.py in to_scala_seq(jvm, iterable)
     77         Scala sequence
     78     """
---&amp;gt; 79     return jvm.scala.collection.JavaConversions.iterableAsScalaIterable(iterable).toSeq()
     80 
     81 
&amp;nbsp;
/databricks/spark/python/lib/py4j-0.10.9.1-src.zip/py4j/java_gateway.py in __call__(self, *args)
   1302 
   1303         answer = self.gateway_client.send_command(command)
-&amp;gt; 1304         return_value = get_return_value(
   1305             answer, self.gateway_client, self.target_id, self.name)
   1306 
&amp;nbsp;
/databricks/spark/python/pyspark/sql/utils.py in deco(*a, **kw)
    115     def deco(*a, **kw):
    116         try:
--&amp;gt; 117             return f(*a, **kw)
    118         except py4j.protocol.Py4JJavaError as e:
    119             converted = convert_exception(e.java_exception)
&amp;nbsp;
/databricks/spark/python/lib/py4j-0.10.9.1-src.zip/py4j/protocol.py in get_return_value(answer, gateway_client, target_id, name)
    328                     format(target_id, ".", name), value)
    329             else:
--&amp;gt; 330                 raise Py4JError(
    331                     "An error occurred while calling {0}{1}{2}. Trace:\n{3}\n".
    332                     format(target_id, ".", name, value))
&amp;nbsp;
Py4JError: An error occurred while calling z:scala.collection.JavaConversions.iterableAsScalaIterable. Trace:
py4j.Py4JException: Method iterableAsScalaIterable([class java.lang.String]) does not exist
	at py4j.reflection.ReflectionEngine.getMethod(ReflectionEngine.java:341)
	at py4j.reflection.ReflectionEngine.getMethod(ReflectionEngine.java:362)
	at py4j.Gateway.invoke(Gateway.java:289)
	at py4j.commands.AbstractCommand.invokeMethod(AbstractCommand.java:132)
	at py4j.commands.CallCommand.execute(CallCommand.java:79)
	at py4j.GatewayConnection.run(GatewayConnection.java:251)
	at java.lang.Thread.run(Thread.java:748)&lt;/CODE&gt;&lt;/PRE&gt;&lt;P&gt;To Reproduce&lt;/P&gt;&lt;P&gt;I'm using the example provided on the main page:&lt;/P&gt;&lt;P&gt;&lt;span class="lia-inline-image-display-wrapper" image-alt="155309688-d4d03acc-1012-42ec-8d40-9cbf4b8d12c3.png"&gt;&lt;img src="https://community.databricks.com/t5/image/serverpage/image-id/2066i580A884E9948E42E/image-size/large?v=v2&amp;amp;px=999" role="button" title="155309688-d4d03acc-1012-42ec-8d40-9cbf4b8d12c3.png" alt="155309688-d4d03acc-1012-42ec-8d40-9cbf4b8d12c3.png" /&gt;&lt;/span&gt;&lt;span class="lia-inline-image-display-wrapper" image-alt="155311239-2259d89e-e2b2-45c1-b57c-1a841ebe189e"&gt;&lt;img src="https://community.databricks.com/t5/image/serverpage/image-id/2080iD0956C4BD50EF4DD/image-size/large?v=v2&amp;amp;px=999" role="button" title="155311239-2259d89e-e2b2-45c1-b57c-1a841ebe189e" alt="155311239-2259d89e-e2b2-45c1-b57c-1a841ebe189e" /&gt;&lt;/span&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;I'm using this version of:&lt;/P&gt;&lt;P&gt;Databricks:&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;&lt;span class="lia-inline-image-display-wrapper" image-alt="155309988-fd6ec25f-53ec-4f7a-a37a-e3596cefe10e"&gt;&lt;img src="https://community.databricks.com/t5/image/serverpage/image-id/2068i3D788BE95DE4C28E/image-size/large?v=v2&amp;amp;px=999" role="button" title="155309988-fd6ec25f-53ec-4f7a-a37a-e3596cefe10e" alt="155309988-fd6ec25f-53ec-4f7a-a37a-e3596cefe10e" /&gt;&lt;/span&gt;Any idea why is this happening?? &lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Thanks&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;&lt;/P&gt;</description>
      <pubDate>Mon, 28 Feb 2022 11:06:20 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/method-iterableasscalaiterable-does-not-exist-pydeequ/m-p/26851#M18856</guid>
      <dc:creator>Jreco</dc:creator>
      <dc:date>2022-02-28T11:06:20Z</dc:date>
    </item>
    <item>
      <title>Re: Method iterableAsScalaIterable does not exist Pydeequ</title>
      <link>https://community.databricks.com/t5/data-engineering/method-iterableasscalaiterable-does-not-exist-pydeequ/m-p/26852#M18857</link>
      <description>&lt;P&gt;I think it is because you did not attach the libraries to the cluster.&lt;/P&gt;&lt;P&gt;When you work with a notebook, the sparksession is already created.&lt;/P&gt;&lt;P&gt;To add libraries, you should install them on the cluster (in the compute tab) using f.e. pypi/maven etc.&lt;/P&gt;</description>
      <pubDate>Mon, 28 Feb 2022 12:57:19 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/method-iterableasscalaiterable-does-not-exist-pydeequ/m-p/26852#M18857</guid>
      <dc:creator>-werners-</dc:creator>
      <dc:date>2022-02-28T12:57:19Z</dc:date>
    </item>
    <item>
      <title>Re: Method iterableAsScalaIterable does not exist Pydeequ</title>
      <link>https://community.databricks.com/t5/data-engineering/method-iterableasscalaiterable-does-not-exist-pydeequ/m-p/26853#M18858</link>
      <description>&lt;P&gt;Thanks for your replay @Werner Stinckens​&amp;nbsp;, I already hae the libraries installed:&lt;/P&gt;&lt;P&gt;&lt;span class="lia-inline-image-display-wrapper" image-alt="image"&gt;&lt;img src="https://community.databricks.com/t5/image/serverpage/image-id/2067i7E891C5E99B15F74/image-size/large?v=v2&amp;amp;px=999" role="button" title="image" alt="image" /&gt;&lt;/span&gt;&lt;/P&gt;</description>
      <pubDate>Mon, 28 Feb 2022 13:09:28 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/method-iterableasscalaiterable-does-not-exist-pydeequ/m-p/26853#M18858</guid>
      <dc:creator>Jreco</dc:creator>
      <dc:date>2022-02-28T13:09:28Z</dc:date>
    </item>
    <item>
      <title>Re: Method iterableAsScalaIterable does not exist Pydeequ</title>
      <link>https://community.databricks.com/t5/data-engineering/method-iterableasscalaiterable-does-not-exist-pydeequ/m-p/26854#M18859</link>
      <description>&lt;P&gt;ok can you try again without creating a sparksession?&lt;/P&gt;&lt;P&gt;​&lt;/P&gt;&lt;P&gt;It could also be pydeequ which is imcompatible, I have never used it.&lt;/P&gt;&lt;P&gt;But first let's try without the sparksession part.&lt;/P&gt;</description>
      <pubDate>Mon, 28 Feb 2022 13:15:33 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/method-iterableasscalaiterable-does-not-exist-pydeequ/m-p/26854#M18859</guid>
      <dc:creator>-werners-</dc:creator>
      <dc:date>2022-02-28T13:15:33Z</dc:date>
    </item>
    <item>
      <title>Re: Method iterableAsScalaIterable does not exist Pydeequ</title>
      <link>https://community.databricks.com/t5/data-engineering/method-iterableasscalaiterable-does-not-exist-pydeequ/m-p/26855#M18860</link>
      <description>&lt;P&gt;Hi @Jhonatan Reyes​&amp;nbsp;,&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Just checking if you still need help on this issue? did @Werner Stinckens​&amp;nbsp;'s response helped? if it did, please mark it as best response.&lt;/P&gt;</description>
      <pubDate>Mon, 11 Apr 2022 20:53:17 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/method-iterableasscalaiterable-does-not-exist-pydeequ/m-p/26855#M18860</guid>
      <dc:creator>jose_gonzalez</dc:creator>
      <dc:date>2022-04-11T20:53:17Z</dc:date>
    </item>
  </channel>
</rss>

