Evaluation
2 skills with this tag
K-Dense-AI
Passed
Scholar Evaluation
Systematically evaluate scholarly work using the ScholarEval framework, providing structured assessment across research quality dimensions including problem formulation, methodology, analysis, and writing with quantitative scoring and actionable feedback.
ResearchAcademicEvaluation+3
902.6k
NeoLabHQ
Passed
Agent Evaluation
A comprehensive evaluation framework for assessing Claude Code agents, commands, and skills. Provides LLM-as-Judge implementation patterns, multi-dimensional rubrics, bias mitigation techniques, and metrics for measuring agent quality across instruction following, completeness, tool efficiency, reasoning, and coherence.
EvaluationQuality AssuranceLlm As Judge+3
70160