Databricks Community

plynton · ‎10-08-2022

I have tried pulling a single row from a .csv using df.query()

However, the data being returned doesn't coincide with the data I'm expecting - it is pulling a different row. Here is my code:

df = spark.read.option("header",True).csv(data_fldr + "config/CHRGConfig.csv")

df = df.toPandas()

hdrlist = list(df)

xstr = "VERSION == \"" + "STPP" + "\""

print(xstr)

planlist = df.query(xstr)

for zz in planlist:

#

# I'm looking for non-null values

#

if not pd.isnull(df.loc[0,zz]):

print(zz)

Hubert-Dudek · ‎10-16-2022

Can you include a few rows of your CSV (at least one shouldn't be pulled, and one should)?

My blog: https://databrickster.medium.com/

plynton · ‎10-18-2022

Hubert - I've found a workaround for this, so we can close the discussion.

Thank you!

Anonymous · ‎11-16-2022

Hi @Peter Ott

Does @Hubert Dudek response answer your question? If yes, would you be happy to mark it as best so that other members can find the solution more quickly?

We'd love to hear from you.

Thanks!

Databricks Community

Incorrect results with df.query()

🌟 Community Pulse: Your Weekly Roundup! June 22 – 28, 2026

Solution Accelerator Series | Product Quality Inspection

Upcoming Community BrickTalk: Bringing (Geo)Spatial Awareness to your Conversational Agents

Databricks Community Champion - June 2026 - Amira Bedhiafi

Build apps without jumping through hoops