tag

evaluating▌

5 indexed skills · max 10 per page

skills (5)

evaluating-candidates

refoundai/lenny-skills · Productivity

Structured hiring framework from 94 product leaders to make stronger candidate decisions. \n \n Apply 12 core principles covering reference checks, work trials, agency assessment, and T-shaped hiring to evaluate candidates systematically \n Use diagnostic questions to understand hiring stage, team gaps, and whether decisions are based on structured rubrics or intuition alone \n Challenge common biases like pedigree shortcuts, gut-feel-only decisions, and unicorn hiring; prioritize references, pa

evaluating-new-technology

refoundai/lenny-skills · Productivity

Framework for evaluating emerging technologies using insights from 22 product leaders. \n \n Start by clarifying the problem being solved, then assess technology maturity and stability for your specific use case \n Adopt a \"build and buy\" mindset: purchase tools for standard 90% functionality, build custom solutions for your unique 10% \n Prioritize mental bandwidth and core competencies over cost savings; constantly re-test assumptions about what new tools can actually do \n Design for modula

evaluating-code-models

davila7/claude-code-templates · Productivity

BigCode Evaluation Harness evaluates code generation models across 15+ benchmarks including HumanEval, MBPP, and MultiPL-E (18 languages).

evaluating-llms-harness

davila7/claude-code-templates · AI/ML

lm-evaluation-harness evaluates LLMs across 60+ academic benchmarks using standardized prompts and metrics.

evaluating-trade-offs

refoundai/lenny-skills · Productivity

Structured frameworks for evaluating competing options and making clearer trade-off decisions. \n \n Applies mental models from 40 product leaders covering decision context, constraint identification, cost quantification, and framework selection \n Core principles include optimizing for order-of-magnitude over precision, applying the \"would I start this today?\" test to avoid sunk cost fallacy, and using weighted criteria matrices for multi-factor decisions \n Helps surface hidden costs like ma