tag

evaluator▌

4 indexed skills · max 10 per page

skills (4)

validate-evaluator

hamelsmu/evals-skills · Productivity

Calibrate an LLM judge against human judgment.

tech-stack-evaluator

alirezarezvani/claude-skills · Productivity

../../../engineering-team/tech-stack-evaluator/SKILL.md

langsmith-evaluator

langchain-ai/langsmith-skills · Productivity

Build evaluation pipelines for LangSmith with LLM-as-Judge and custom code evaluators. \n \n Three core components: creating evaluators (LLM-as-Judge or custom code), defining run functions to capture agent outputs and trajectories, and running evaluations locally or auto-running via uploaded evaluators \n Supports both offline evaluators (comparing run outputs to dataset examples) and online evaluators (real-time quality checks on production runs) \n Requires LangSmith API key and project confi

stock-evaluator-v3

sundial-org/awesome-openclaw-skills · Productivity

Every analysis MUST include ALL of these: