We are using the foundation model endpoints (provisioned and pay-per-token) for the gpt-oss models for a research project. We have been experiencing consistent tool call failures: gpt-oss-20b was failing ~1 out of every 12, while gpt-oss-120b was fai...