Rates responses based on a specific rubric for UI Clarity, Fulfillment, Factuality, and Verbosity.
3+