Bug? Genie Space benchmark evaluates "Score reason" wrongly as "Empty result" on agent mode
First some context: I'm setting up a Genie Space and have multiple benchmark questions with business validated SQL ground truths defined. Currently no configuration defined, apart from metadata and SQL joins.I haven't found any other way of reporting...
- 20 Views
- 0 replies
- 0 kudos