Hubert-Dudek
Databricks MVP

You need to join and pivot

df
.join(df2, on=[df.company == df2.job_company]))
.groupBy("company", "name")
.pivot("job_company")
.count()

image.png


My blog: https://databrickster.medium.com/