Evaluate the tool calls according to the rubric and return the JSON score.
