Databricks

semi · ‎12-21-2022

import pandas as pd
from apiclient.discovery import build
from oauth2client.service_account import ServiceAccountCredentials
df = spark.read.json("/FileStore/tables/cert.json")
 
SCOPES = ['https://www.googleapis.com/auth/analytics.readonly']
KEY_FILE_LOCATION =  "/FileStore/tables/cert.json"
VIEW_ID = '####'
 
 
def initialize_analyticsreporting():
 
  credentials = ServiceAccountCredentials.from_json_keyfile_name(KEY_FILE_LOCATION, SCOPES)
 
  # Build the service object.
  analytics = build('analyticsreporting', 'v4', credentials=credentials)
  return analytics
 
 
def get_report(analytics):
 
  return analytics.reports().batchGet(
      body={
        'reportRequests': [
        {
          'viewId': VIEW_ID,
          'dateRanges': [{'startDate': '7daysAgo', 'endDate': 'today'}],
          'metrics': [{'expression': 'ga:sessions'}],
          'dimensions': [{'name': 'ga:country'}]
        }]
      }
  ).execute()
 
 
def print_response(response):
 
  for report in response.get('reports', []):
    columnHeader = report.get('columnHeader', {})
    dimensionHeaders = columnHeader.get('dimensions', [])
    metricHeaders = columnHeader.get('metricHeader', {}).get('metricHeaderEntries', [])
 
    for row in report.get('data', {}).get('rows', []):
      dimensions = row.get('dimensions', [])
      dateRangeValues = row.get('metrics', [])
 
      for header, dimension in zip(dimensionHeaders, dimensions):
        print(header + ': ', dimension)
 
      for i, values in enumerate(dateRangeValues):
        print('Date range:', str(i))
        for metricHeader, value in zip(metricHeaders, values.get('values')):
          print(metricHeader.get('name') + ':', value)
 
 
def main():
  analytics = initialize_analyticsreporting()
  global df_response
  df_response=[]
  response = []
  response = get_report(analytics)
  
  df_response=pd.DataFrame(list(response))
 
if __name__ == '__main__':
  main()
df_response

im getting an error if want to acces the file location where I stored my credantiel

error:

(1) Spark Jobs

FileNotFoundError: [Errno 2] No such file or directory: '/FileStore/tables/cert.json'

but I can open my file alone without any issue:

df = spark.read.json("/FileStore/tables/cert.json")

display(df)

-werners- · ‎12-22-2022

Looks like it is because the oauth2client.service_account does not know about DBFS (whereas spark does).

Is it an option to manage your secrets in databricks?

https://docs.databricks.com/security/secrets/secrets.html

jose_gonzalez · ‎12-27-2022

Hi @Reddy Massidi semi ,

Just a friendly follow-up. Did you try to follow @Werner Stinckens 's recommendation? do you still have issues? please let us know

semi · ‎01-03-2023

Thank you you are right the function : ServiceAccountCredentials.from_json_keyfile_name doesn't know DBFS and I'm using a databricks community also I did assign a key in a variable call key_variable and pass it into this function :

ServiceAccountCredentials.from_json_keyfile_dict(

key_variable, SCOPES)

I know it's disallowed to assign the key in a function

Databricks

Access file location problem

Registration now open! Databricks Data + AI Summit 2024

Meet DBRX, the New Standard for High-Quality LLMs

Data Warehousing in the Era of AI