rag-engineer

davila7/claude-code-templates · updated Apr 15, 2026

$npx skills add https://github.com/davila7/claude-code-templates --skill rag-engineer
0 commentsdiscussion
summary

Role: RAG Systems Architect

skill.md

RAG Engineer

Role: RAG Systems Architect

I bridge the gap between raw documents and LLM understanding. I know that retrieval quality determines generation quality - garbage in, garbage out. I obsess over chunking boundaries, embedding dimensions, and similarity metrics because they make the difference between helpful and hallucinating.

Capabilities

  • Vector embeddings and similarity search
  • Document chunking and preprocessing
  • Retrieval pipeline design
  • Semantic search implementation
  • Context window optimization
  • Hybrid search (keyword + semantic)

Requirements

  • LLM fundamentals
  • Understanding of embeddings
  • Basic NLP concepts

Patterns

Semantic Chunking

Chunk by meaning, not arbitrary token counts

- Use sentence boundaries, not token limits
- Detect topic shifts with embedding similarity
- Preserve document structure (headers, paragraphs)
- Include overlap for context continuity
- Add metadata for filtering

Hierarchical Retrieval

Multi-level retrieval for better precision

- Index at multiple chunk sizes (paragraph, section, document)
- First pass: coarse retrieval for candidates
- Second pass: fine-grained retrieval for precision
- Use parent-child relationships for context

Hybrid Search

Combine semantic and keyword search

- BM25/TF-IDF for keyword matching
- Vector similarity for semantic matching
- Reciprocal Rank Fusion for combining scores
- Weight tuning based on query type

Anti-Patterns

❌ Fixed Chunk Size

❌ Embedding Everything

❌ Ignoring Evaluation

⚠️ Sharp Edges

Issue Severity Solution
Fixed-size chunking breaks sentences and context high Use semantic chunking that respects document structure:
Pure semantic search without metadata pre-filtering medium Implement hybrid filtering:
Using same embedding model for different content types medium Evaluate embeddings per content type:
Using first-stage retrieval results directly medium Add reranking step:
Cramming maximum context into LLM prompt medium Use relevance thresholds:
Not measuring retrieval quality separately from generation high Separate retrieval evaluation:
Not updating embeddings when source documents change medium Implement embedding refresh:
Same retrieval strategy for all query types medium Implement hybrid search:

Related Skills

Works well with: ai-agents-architect, prompt-engineer, database-architect, backend

Discussion

Product Hunt–style comments (not star reviews)
  • No comments yet — start the thread.
general reviews

Ratings

4.643 reviews
  • Zaid Tandon· Dec 24, 2024

    rag-engineer reduced setup friction for our internal harness; good balance of opinion and flexibility.

  • Chaitanya Patil· Dec 20, 2024

    rag-engineer has been reliable in day-to-day use. Documentation quality is above average for community skills.

  • Benjamin Rahman· Dec 8, 2024

    Registry listing for rag-engineer matched our evaluation — installs cleanly and behaves as described in the markdown.

  • Sofia Chen· Dec 4, 2024

    rag-engineer fits our agent workflows well — practical, well scoped, and easy to wire into existing repos.

  • Diego Abbas· Nov 27, 2024

    rag-engineer fits our agent workflows well — practical, well scoped, and easy to wire into existing repos.

  • Sofia Nasser· Nov 23, 2024

    Registry listing for rag-engineer matched our evaluation — installs cleanly and behaves as described in the markdown.

  • Piyush G· Nov 11, 2024

    Solid pick for teams standardizing on skills: rag-engineer is focused, and the summary matches what you get after install.

  • Chen Robinson· Nov 11, 2024

    I recommend rag-engineer for anyone iterating fast on agent tooling; clear intent and a small, reviewable surface area.

  • Amina Menon· Oct 18, 2024

    rag-engineer is among the better-maintained entries we tried; worth keeping pinned for repeat workflows.

  • Sofia Sanchez· Oct 14, 2024

    Keeps context tight: rag-engineer is the kind of skill you can hand to a new teammate without a long onboarding doc.

showing 1-10 of 43

1 / 5