hamelsmu▌

Complete error analysis on RAG pipeline traces before selecting metrics. Inspect what was retrieved vs. what the model needed. Determine whether the problem is retrieval, generati…

evaluate rag

05Frontend

build-review-interface

hamelsmu/evals-skills·updated Apr 8, 2026

installs

Build an HTML page that loads traces from a data source (JSON/CSV file), displays one trace at a time with Pass/Fail buttons, a free-text notes field, and Next/Previous navigation…

build review interface

06Productivity

eval-audit

hamelsmu/evals-skills·updated Apr 8, 2026

installs

Inspect an LLM eval pipeline and produce a prioritized list of problems with concrete next steps.

eval audit

07Productivity

error-analysis

hamelsmu/evals-skills·updated Apr 8, 2026

installs

Guide the user through reading LLM pipeline traces and building a catalog of how the system fails.

error analysis