<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: How to create connection between Databricks &amp; BigQuery in Data Engineering</title>
    <link>https://community.databricks.com/t5/data-engineering/how-to-create-connection-between-databricks-bigquery/m-p/19496#M13073</link>
    <description>&lt;P&gt;without the pointy brackets. they are placeholders for values.&lt;/P&gt;&lt;P&gt;so unless you want to enter a variable which you already declared (like credentials in your example), put the double quotes.&lt;/P&gt;</description>
    <pubDate>Thu, 01 Dec 2022 11:46:22 GMT</pubDate>
    <dc:creator>-werners-</dc:creator>
    <dc:date>2022-12-01T11:46:22Z</dc:date>
    <item>
      <title>How to create connection between Databricks &amp; BigQuery</title>
      <link>https://community.databricks.com/t5/data-engineering/how-to-create-connection-between-databricks-bigquery/m-p/19489#M13066</link>
      <description>&lt;P&gt;Hi, &lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;I would like to connect our BigQuery env to Databricks, So I created a service account but where should I configure the service account in Databricks? I read databricks documention and it`s not clear at all. &lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Thanks for your help &lt;/P&gt;</description>
      <pubDate>Thu, 01 Dec 2022 09:42:12 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/how-to-create-connection-between-databricks-bigquery/m-p/19489#M13066</guid>
      <dc:creator>519776</dc:creator>
      <dc:date>2022-12-01T09:42:12Z</dc:date>
    </item>
    <item>
      <title>Re: How to create connection between Databricks &amp; BigQuery</title>
      <link>https://community.databricks.com/t5/data-engineering/how-to-create-connection-between-databricks-bigquery/m-p/19490#M13067</link>
      <description>&lt;P&gt;&lt;A href="https://docs.databricks.com/external-data/bigquery.html" target="test_blank"&gt;https://docs.databricks.com/external-data/bigquery.html&lt;/A&gt;&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Can you elaborate what is not clear?&lt;/P&gt;</description>
      <pubDate>Thu, 01 Dec 2022 11:17:54 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/how-to-create-connection-between-databricks-bigquery/m-p/19490#M13067</guid>
      <dc:creator>-werners-</dc:creator>
      <dc:date>2022-12-01T11:17:54Z</dc:date>
    </item>
    <item>
      <title>Re: How to create connection between Databricks &amp; BigQuery</title>
      <link>https://community.databricks.com/t5/data-engineering/how-to-create-connection-between-databricks-bigquery/m-p/19491#M13068</link>
      <description>&lt;P&gt;yeah, part number 2 - setup Databricks, there is the below code &lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;credentials &amp;lt;base64-keys&amp;gt;&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;spark.hadoop.google.cloud.auth.service.account.enable true&lt;/P&gt;&lt;P&gt;spark.hadoop.fs.gs.auth.service.account.email &amp;lt;client_email&amp;gt;&lt;/P&gt;&lt;P&gt;spark.hadoop.fs.gs.project.id &amp;lt;project_id&amp;gt;&lt;/P&gt;&lt;P&gt;spark.hadoop.fs.gs.auth.service.account.private.key &amp;lt;private_key&amp;gt;&lt;/P&gt;&lt;P&gt;spark.hadoop.fs.gs.auth.service.account.private.key.id &amp;lt;private_key_id&amp;gt;&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;what should it replace instead of &amp;lt;base64-keys&amp;gt; ? the google service account key (json) ? if yes what part of it ? &lt;/P&gt;&lt;P&gt;&lt;/P&gt;</description>
      <pubDate>Thu, 01 Dec 2022 11:21:07 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/how-to-create-connection-between-databricks-bigquery/m-p/19491#M13068</guid>
      <dc:creator>519776</dc:creator>
      <dc:date>2022-12-01T11:21:07Z</dc:date>
    </item>
    <item>
      <title>Re: How to create connection between Databricks &amp; BigQuery</title>
      <link>https://community.databricks.com/t5/data-engineering/how-to-create-connection-between-databricks-bigquery/m-p/19492#M13069</link>
      <description>&lt;P&gt;Here are the docs : &lt;A href="https://docs.databricks.com/external-data/bigquery.html?_ga=2.254305484.510683761.1669885489-463474086.1669885489" target="test_blank"&gt;https://docs.databricks.com/external-data/bigquery.html?_ga=2.254305484.510683761.1669885489-463474086.1669885489&lt;/A&gt;&lt;/P&gt;</description>
      <pubDate>Thu, 01 Dec 2022 11:21:20 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/how-to-create-connection-between-databricks-bigquery/m-p/19492#M13069</guid>
      <dc:creator>mcwir</dc:creator>
      <dc:date>2022-12-01T11:21:20Z</dc:date>
    </item>
    <item>
      <title>Re: How to create connection between Databricks &amp; BigQuery</title>
      <link>https://community.databricks.com/t5/data-engineering/how-to-create-connection-between-databricks-bigquery/m-p/19493#M13070</link>
      <description>&lt;P&gt;I familar with this doc, it is not clear (please find my previous comment) &lt;/P&gt;</description>
      <pubDate>Thu, 01 Dec 2022 11:23:29 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/how-to-create-connection-between-databricks-bigquery/m-p/19493#M13070</guid>
      <dc:creator>519776</dc:creator>
      <dc:date>2022-12-01T11:23:29Z</dc:date>
    </item>
    <item>
      <title>Re: How to create connection between Databricks &amp; BigQuery</title>
      <link>https://community.databricks.com/t5/data-engineering/how-to-create-connection-between-databricks-bigquery/m-p/19494#M13071</link>
      <description>&lt;P&gt;the base64-keys is generated from the json key file:&lt;/P&gt;&lt;P&gt;&lt;I&gt;To configure a cluster to access BigQuery tables, you must provide your JSON key file as a Spark configuration. Use a local tool to Base64-encode your JSON key file. For security purposes do not use a web-based or remote tool that could access your keys.&lt;/I&gt;&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;The JSON key file is created right above the following section:&lt;/P&gt;&lt;P&gt;&lt;A href="https://docs.databricks.com/external-data/bigquery.html#create-a-google-cloud-storage-gcs-bucket-for-temporary-storage" target="test_blank"&gt;https://docs.databricks.com/external-data/bigquery.html#create-a-google-cloud-storage-gcs-bucket-for-temporary-storage&lt;/A&gt;&lt;/P&gt;</description>
      <pubDate>Thu, 01 Dec 2022 11:24:27 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/how-to-create-connection-between-databricks-bigquery/m-p/19494#M13071</guid>
      <dc:creator>-werners-</dc:creator>
      <dc:date>2022-12-01T11:24:27Z</dc:date>
    </item>
    <item>
      <title>Re: How to create connection between Databricks &amp; BigQuery</title>
      <link>https://community.databricks.com/t5/data-engineering/how-to-create-connection-between-databricks-bigquery/m-p/19495#M13072</link>
      <description>&lt;P&gt;So basically it should look like this : &lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;credentials &amp;lt;adfasdfsadfadsfsdafsd&amp;gt;&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;spark.hadoop.google.cloud.auth.service.account.enable true&lt;/P&gt;&lt;P&gt;spark.hadoop.fs.gs.auth.service.account.email &amp;lt;user@service.com&amp;gt;&lt;/P&gt;&lt;P&gt;spark.hadoop.fs.gs.project.id &amp;lt;project-dd&amp;gt;&lt;/P&gt;&lt;P&gt;spark.hadoop.fs.gs.auth.service.account.private.key &amp;lt;fdsfsdfsdgfd&amp;gt;&lt;/P&gt;&lt;P&gt;spark.hadoop.fs.gs.auth.service.account.private.key.id &amp;lt;gsdfgsdgdsg&amp;gt;&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;? Do I need to add "" ? &lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;&lt;/P&gt;</description>
      <pubDate>Thu, 01 Dec 2022 11:35:22 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/how-to-create-connection-between-databricks-bigquery/m-p/19495#M13072</guid>
      <dc:creator>519776</dc:creator>
      <dc:date>2022-12-01T11:35:22Z</dc:date>
    </item>
    <item>
      <title>Re: How to create connection between Databricks &amp; BigQuery</title>
      <link>https://community.databricks.com/t5/data-engineering/how-to-create-connection-between-databricks-bigquery/m-p/19496#M13073</link>
      <description>&lt;P&gt;without the pointy brackets. they are placeholders for values.&lt;/P&gt;&lt;P&gt;so unless you want to enter a variable which you already declared (like credentials in your example), put the double quotes.&lt;/P&gt;</description>
      <pubDate>Thu, 01 Dec 2022 11:46:22 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/how-to-create-connection-between-databricks-bigquery/m-p/19496#M13073</guid>
      <dc:creator>-werners-</dc:creator>
      <dc:date>2022-12-01T11:46:22Z</dc:date>
    </item>
    <item>
      <title>Re: How to create connection between Databricks &amp; BigQuery</title>
      <link>https://community.databricks.com/t5/data-engineering/how-to-create-connection-between-databricks-bigquery/m-p/19497#M13074</link>
      <description>&lt;P&gt;Thanks werners.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;it now working, when I'm runnning the below script:&lt;/P&gt;&lt;P&gt;df = spark.read.format("bigquery").option("table","sandbox.test").load()&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;im getting the below error: &lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;&lt;/P&gt;</description>
      <pubDate>Thu, 01 Dec 2022 12:17:42 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/how-to-create-connection-between-databricks-bigquery/m-p/19497#M13074</guid>
      <dc:creator>519776</dc:creator>
      <dc:date>2022-12-01T12:17:42Z</dc:date>
    </item>
    <item>
      <title>Re: How to create connection between Databricks &amp; BigQuery</title>
      <link>https://community.databricks.com/t5/data-engineering/how-to-create-connection-between-databricks-bigquery/m-p/19498#M13075</link>
      <description>&lt;P&gt;&lt;span class="lia-inline-image-display-wrapper" image-alt="image.png"&gt;&lt;img src="https://community.databricks.com/t5/image/serverpage/image-id/1073i429A3A3820C13AC3/image-size/large?v=v2&amp;amp;px=999" role="button" title="image.png" alt="image.png" /&gt;&lt;/span&gt;&lt;span class="lia-inline-image-display-wrapper" image-alt="image.png"&gt;&lt;img src="https://community.databricks.com/t5/image/serverpage/image-id/1066iB1E5B6290AD316AB/image-size/large?v=v2&amp;amp;px=999" role="button" title="image.png" alt="image.png" /&gt;&lt;/span&gt;&lt;/P&gt;</description>
      <pubDate>Thu, 01 Dec 2022 12:18:00 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/how-to-create-connection-between-databricks-bigquery/m-p/19498#M13075</guid>
      <dc:creator>519776</dc:creator>
      <dc:date>2022-12-01T12:18:00Z</dc:date>
    </item>
    <item>
      <title>Re: How to create connection between Databricks &amp; BigQuery</title>
      <link>https://community.databricks.com/t5/data-engineering/how-to-create-connection-between-databricks-bigquery/m-p/19499#M13076</link>
      <description>&lt;P&gt;are you sure the path to the table is correct?&lt;/P&gt;&lt;P&gt;the example is a bit different:&lt;/P&gt;&lt;P&gt;"bigquery-public-data.samples.shakespeare"&lt;/P&gt;&lt;P&gt;&amp;lt;catalog&amp;gt;.&amp;lt;db&amp;gt;.&amp;lt;table&amp;gt;&lt;/P&gt;</description>
      <pubDate>Thu, 01 Dec 2022 12:25:25 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/how-to-create-connection-between-databricks-bigquery/m-p/19499#M13076</guid>
      <dc:creator>-werners-</dc:creator>
      <dc:date>2022-12-01T12:25:25Z</dc:date>
    </item>
    <item>
      <title>Re: How to create connection between Databricks &amp; BigQuery</title>
      <link>https://community.databricks.com/t5/data-engineering/how-to-create-connection-between-databricks-bigquery/m-p/19500#M13077</link>
      <description>&lt;P&gt;I also changed the path to "test_proj.sandbox.test". &lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;the error is :&lt;/P&gt;&lt;P&gt;A project ID is required for this service but could not be determined from the builder or the environment. Please set a project ID using the builder.&lt;/P&gt;</description>
      <pubDate>Thu, 01 Dec 2022 12:33:47 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/how-to-create-connection-between-databricks-bigquery/m-p/19500#M13077</guid>
      <dc:creator>519776</dc:creator>
      <dc:date>2022-12-01T12:33:47Z</dc:date>
    </item>
    <item>
      <title>Re: How to create connection between Databricks &amp; BigQuery</title>
      <link>https://community.databricks.com/t5/data-engineering/how-to-create-connection-between-databricks-bigquery/m-p/19501#M13078</link>
      <description>&lt;P&gt;I guess something still has to be configured on BigQuery.&lt;/P&gt;&lt;P&gt;can you check this thread?&lt;/P&gt;&lt;P&gt;&lt;A href="https://github.com/GoogleCloudDataproc/spark-bigquery-connector/issues/40" target="test_blank"&gt;https://github.com/GoogleCloudDataproc/spark-bigquery-connector/issues/40&lt;/A&gt;&lt;/P&gt;</description>
      <pubDate>Thu, 01 Dec 2022 12:38:05 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/how-to-create-connection-between-databricks-bigquery/m-p/19501#M13078</guid>
      <dc:creator>-werners-</dc:creator>
      <dc:date>2022-12-01T12:38:05Z</dc:date>
    </item>
    <item>
      <title>Re: How to create connection between Databricks &amp; BigQuery</title>
      <link>https://community.databricks.com/t5/data-engineering/how-to-create-connection-between-databricks-bigquery/m-p/19502#M13079</link>
      <description>&lt;P&gt;Works &lt;span class="lia-unicode-emoji" title=":slightly_smiling_face:"&gt;🙂&lt;/span&gt; &lt;/P&gt;&lt;P&gt;Thanks werners, many thanks .&lt;/P&gt;</description>
      <pubDate>Thu, 01 Dec 2022 12:43:23 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/how-to-create-connection-between-databricks-bigquery/m-p/19502#M13079</guid>
      <dc:creator>519776</dc:creator>
      <dc:date>2022-12-01T12:43:23Z</dc:date>
    </item>
    <item>
      <title>Re: How to create connection between Databricks &amp; BigQuery</title>
      <link>https://community.databricks.com/t5/data-engineering/how-to-create-connection-between-databricks-bigquery/m-p/19503#M13080</link>
      <description>&lt;P&gt;@kfiry​&amp;nbsp;adding to @Werner Stinckens​&amp;nbsp;did you added projectid in read spark query , projectid should be one where big query instance running. also please follow best practices in terms of egress data cost &lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;spark.read.format("bigquery") \&lt;/P&gt;&lt;P&gt;  .option("table", table) \&lt;/P&gt;&lt;P&gt;  .option("project", &amp;lt;project-id&amp;gt;) \&lt;/P&gt;&lt;P&gt;  .option("parentProject", &amp;lt;parent-project-id&amp;gt;) \&lt;/P&gt;&lt;P&gt;  .load()&lt;/P&gt;&lt;P&gt;&lt;/P&gt;</description>
      <pubDate>Thu, 01 Dec 2022 21:24:09 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/how-to-create-connection-between-databricks-bigquery/m-p/19503#M13080</guid>
      <dc:creator>karthik_p</dc:creator>
      <dc:date>2022-12-01T21:24:09Z</dc:date>
    </item>
    <item>
      <title>Re: How to create connection between Databricks &amp; BigQuery</title>
      <link>https://community.databricks.com/t5/data-engineering/how-to-create-connection-between-databricks-bigquery/m-p/19504#M13081</link>
      <description>&lt;P&gt;Thank you. For me, setting parent project ID solved it. This is also in the documentation&lt;/P&gt;&lt;P&gt;spark.read.format("bigquery") \&lt;/P&gt;&lt;P&gt;  .option("table", table) \&lt;/P&gt;&lt;P&gt;  .option("project", &amp;lt;project-id&amp;gt;) \&lt;/P&gt;&lt;P&gt;  .option("parentProject", &amp;lt;parent-project-id&amp;gt;) \&lt;/P&gt;&lt;P&gt;  .load()&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;I didn't have to set the various spark.hadoop.fs.gs config variables for the cluster, as it seemed content with the base64 credentials.&lt;/P&gt;</description>
      <pubDate>Fri, 03 Feb 2023 03:53:04 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/how-to-create-connection-between-databricks-bigquery/m-p/19504#M13081</guid>
      <dc:creator>308655</dc:creator>
      <dc:date>2023-02-03T03:53:04Z</dc:date>
    </item>
  </channel>
</rss>

