evaluator▌
4 indexed skills · max 10 per page
validate-evaluator
hamelsmu/evals-skills · Productivity
Calibrate an LLM judge against human judgment.
tech-stack-evaluator
alirezarezvani/claude-skills · Productivity
../../../engineering-team/tech-stack-evaluator/SKILL.md
langsmith-evaluator
langchain-ai/langsmith-skills · Productivity
Build evaluation pipelines for LangSmith with LLM-as-Judge and custom code evaluators. \n \n Three core components: creating evaluators (LLM-as-Judge or custom code), defining run functions to capture agent outputs and trajectories, and running evaluations locally or auto-running via uploaded evaluators \n Supports both offline evaluators (comparing run outputs to dataset examples) and online evaluators (real-time quality checks on production runs) \n Requires LangSmith API key and project confi
stock-evaluator-v3
sundial-org/awesome-openclaw-skills · Productivity
Every analysis MUST include ALL of these: