Expert in evaluating LLM responses with expanded badcase labels.
100+
Arabic SFT data evaluator with comprehensive standards
10+