grepai-embeddings-ollama

yoanbernabeu/grepai-skills · updated Apr 8, 2026

MDX-style export adds YAML metadata + attribution linking explainx.ai and this canonical listing URL.

$npx skills add https://github.com/yoanbernabeu/grepai-skills --skill grepai-embeddings-ollama
0 commentsdiscussion
summary

This skill covers using Ollama as the embedding provider for GrepAI, enabling 100% private, local code search.

skill.md

GrepAI Embeddings with Ollama

This skill covers using Ollama as the embedding provider for GrepAI, enabling 100% private, local code search.

When to Use This Skill

  • Setting up private, local embeddings
  • Choosing the right Ollama model
  • Optimizing Ollama performance
  • Troubleshooting Ollama connection issues

Why Ollama?

Advantage Description
🔒 Privacy Code never leaves your machine
💰 Free No API costs or usage limits
Speed No network latency
🔌 Offline Works without internet
🔧 Control Choose your model

Prerequisites

  1. Ollama installed and running
  2. An embedding model downloaded
# Install Ollama
brew install ollama  # macOS
# or
curl -fsSL https://ollama.com/install.sh | sh  # Linux

# Start Ollama
ollama serve

# Download model
ollama pull nomic-embed-text

Configuration

Basic Configuration

# .grepai/config.yaml
embedder:
  provider: ollama
  model: nomic-embed-text
  endpoint: http://localhost:11434

With Custom Endpoint

embedder:
  provider: ollama
  model: nomic-embed-text
  endpoint: http://192.168.1.100:11434  # Remote Ollama server

With Explicit Dimensions

embedder:
  provider: ollama
  model: nomic-embed-text
  endpoint: http://localhost:11434
  dimensions: 768  # Usually auto-detected

Available Models

Recommended: nomic-embed-text

ollama pull nomic-embed-text
Property Value
Dimensions 768
Size ~274 MB
Speed Fast
Quality Excellent for code
Language English-optimized

Configuration:

embedder:
  provider: ollama
  model: nomic-embed-text

Multilingual: nomic-embed-text-v2-moe

ollama pull nomic-embed-text-v2-moe
Property Value
Dimensions 768
Size ~500 MB
Speed Medium
Quality Excellent
Language Multilingual

Best for codebases with non-English comments/documentation.

Configuration:

embedder:
  provider: ollama
  model: nomic-embed-text-v2-moe

High Quality: bge-m3

ollama pull bge-m3
Property Value
Dimensions 1024
Size ~1.2 GB
Speed Slower
Quality Very high
Language Multilingual

Best for large, complex codebases where accuracy is critical.

Configuration:

embedder:
  provider: ollama
  model: bge-m3
  dimensions: 1024

Maximum Quality: mxbai-embed-large

ollama pull mxbai-embed-large
Property Value
Dimensions 1024
Size ~670 MB
Speed Medium
Quality Highest
Language English

Configuration:

embedder:
  provider: ollama
  model: mxbai-embed-large
  dimensions: 1024

Model Comparison

Model Dims Size Speed Quality Use Case
nomic-embed-text 768 274MB ⚡⚡⚡ ⭐⭐⭐ General use
nomic-embed-text-v2-moe 768 500MB ⚡⚡ ⭐⭐⭐⭐ Multilingual
bge-m3 1024 1.2GB ⭐⭐⭐⭐⭐ Large codebases
mxbai-embed-large 1024 670MB ⚡⚡ ⭐⭐⭐⭐⭐ Maximum accuracy

Performance Optimization

Memory Management

Models load into RAM. Ensure sufficient memory:

Model RAM Required
nomic-embed-text ~500 MB
nomic-embed-text-v2-moe ~800 MB
bge-m3 ~1.5 GB
mxbai-embed-large ~1 GB

GPU Acceleration

Ollama automatically uses:

  • macOS: Metal (Apple Silicon)
  • Linux/Windows: CUDA (NVIDIA GPUs)

Check GPU usage:

ollama ps

Keeping Model Loaded

By default, Ollama unloads models after 5 minutes of inactivity. Keep loaded:

# Keep model loaded indefinitely
curl http://localhost:11434/api/generate -d '{
  "model": "nomic-embed-text",
  "keep_alive": -1
}'

Verifying Connection

Check Ollama is Running

curl http://localhost:11434/api/tags

List Available Models

ollama list

Test Embedding

curl http://localhost:11434/api/embeddings -d '{
  "model": "nomic-embed-text",
  "prompt": "function authenticate(user, password)"
}'

Running Ollama as a Service

macOS (launchd)

Ollama app runs automatically on login.

Linux (systemd)

# Enable service
sudo systemctl enable ollama

# Start service
sudo systemctl start ollama

# Check status
sudo systemctl status ollama

Manual Background

nohup ollama serve > /dev/null 2>&1 &

Remote Ollama Server

Run Ollama on a powerful server and connect remotely:

On the Server

# Allow remote connections
OLLAMA_HOST=0.0.0.0 ollama serve

On the Client

# .grepai/config.yaml
embedder:
  provider: ollama
  model: nomic-embed-text
  endpoint: http://server-ip:11434

Common Issues

Problem: Connection refused ✅ Solution:

# Start Ollama
ollama serve

Problem: Model not found ✅ Solution:

# Pull the model
ollama pull nomic-embed-text

Problem: Slow embedding generation ✅ Solutions:

  • Use a smaller model (nomic-embed-text)
  • Ensure GPU is being used (ollama ps)
  • Close memory-intensive applications
  • Consider a remote server with better hardware

Problem: Out of memory ✅ Solutions:

  • Use a smaller model
  • Close other applications
  • Upgrade RAM
  • Use remote Ollama server

Problem: Embeddings differ after model update ✅ Solution: Re-index after model updates:

rm .grepai/index.gob
grepai watch

Best Practices

  1. Start with nomic-embed-text: Best balance of speed/quality
  2. Keep Ollama running: Background service recommended
  3. Match dimensions: Don't mix models with different dimensions
  4. Re-index on model change: Delete index and re-run watch
  5. Monitor memory: Embedding models use significant RAM

Output Format

Successful Ollama configuration:

✅ Ollama Embedding Provider Configured

   Provider: Ollama
   Model: nomic-embed-text
   Endpoint: http://localhost:11434
   Dimensions: 768 (auto-detected)
   Status: Connected

   Model Info:
   - Size: 274 MB
   - Loaded: Yes
   - GPU: Apple Metal
how to use grepai-embeddings-ollama

How to use grepai-embeddings-ollama on Cursor

AI-first code editor with Composer

1

Prerequisites

Before installing skills in Cursor, ensure your development environment meets these requirements:

  • Cursor installed and configured on your development machine
  • Node.js version 16.0+ with npm package manager (verify with node --version)
  • Active project directory or workspace where you want to add grepai-embeddings-ollama
2

Execute installation command

Execute the skills CLI command in your project's root directory to begin installation:

$npx skills add https://github.com/yoanbernabeu/grepai-skills --skill grepai-embeddings-ollama

The skills CLI fetches grepai-embeddings-ollama from GitHub repository yoanbernabeu/grepai-skills and configures it for Cursor.

3

Select Cursor when prompted

The CLI will show a list of available agents. Use arrow keys to navigate and space to select Cursor:

◆ Which agents do you want to install to?
│ ── Universal (.agents/skills) ── always included ────
│ • Amp
│ • Antigravity
│ • Cline
│ • Codex
│ ●Cursor(selected)
│ • Cursor
│ • Windsurf
4

Verify installation

Confirm successful installation by checking the skill directory location:

.cursor/skills/grepai-embeddings-ollama

Reload or restart Cursor to activate grepai-embeddings-ollama. Access the skill through slash commands (e.g., /grepai-embeddings-ollama) or your agent's skill management interface.

Security & Verification Notice

We perform automated surface-level scans (Gen AI Scanner, Socket, Snyk) during installation. These checks detect common vulnerabilities but do not guarantee complete security. Always review skill source code and verify the publisher's reputation before production use.

Skills execute code in your development environment. Always verify the publisher's identity, review recent commits, and test in isolated environments before production deployment.

List & Monetize Your Skill

Submit your Claude Code skill and start earning

GET_STARTED →

Use Cases

Task Automation & Efficiency

Automate repetitive workflows and reduce manual effort

Example

Generate reports, summarize documents, draft communications

Save 3-5 hours per week on routine tasks

Knowledge Enhancement

Learn new skills, understand complex topics, get expert guidance

Example

Explain concepts, provide examples, suggest learning resources

Accelerate learning and skill development by 2x

Quality Improvement

Enhance output quality through reviews, suggestions, and refinements

Example

Review drafts, suggest improvements, catch errors

Improve work quality by 30-40% with less effort

Implementation Guide

Prerequisites

  • Claude Desktop or compatible AI client with skill support
  • Clear understanding of task or problem to solve
  • Willingness to iterate and refine outputs

Time Estimate

15-45 minutes depending on use case complexity

Installation Steps

  1. 1.Install skill using provided installation command
  2. 2.Test with simple use case relevant to your work
  3. 3.Evaluate output quality and relevance
  4. 4.Iterate on prompts to improve results
  5. 5.Integrate into regular workflow if valuable

Common Pitfalls

  • Expecting perfect results without iteration
  • Not providing enough context in prompts
  • Using skill for tasks outside its intended scope
  • Accepting outputs without review and validation

Best Practices

✓ Do

  • +Start with clear, specific prompts
  • +Provide relevant context and constraints
  • +Review and refine all outputs before using
  • +Iterate to improve output quality
  • +Document successful prompt patterns

✗ Don't

  • Don't use without understanding skill limitations
  • Don't skip validation of outputs
  • Don't share sensitive information in prompts
  • Don't expect skill to replace human judgment

💡 Pro Tips

  • Be specific about desired format and style
  • Ask for multiple options to choose from
  • Request explanations to understand reasoning
  • Combine AI efficiency with human expertise

When to Use This

✓ Use When

Use when skill capabilities match your task, clear ROI on time saved, and you can validate outputs. Best for repetitive tasks, learning, and quality improvement.

✗ Avoid When

Avoid when task requires deep expertise you can't validate, involves sensitive decisions, or when learning process is more valuable than speed of completion.

Learning Path

  1. 1Familiarize yourself with skill capabilities and limitations
  2. 2Start with low-risk, non-critical tasks
  3. 3Progress to more complex and valuable use cases
  4. 4Build expertise through regular use and experimentation

Discussion

Product Hunt–style comments (not star reviews)
  • No comments yet — start the thread.
general reviews

Ratings

4.868 reviews
  • Kaira Menon· Dec 12, 2024

    grepai-embeddings-ollama fits our agent workflows well — practical, well scoped, and easy to wire into existing repos.

  • Chen Abbas· Dec 8, 2024

    Registry listing for grepai-embeddings-ollama matched our evaluation — installs cleanly and behaves as described in the markdown.

  • Chen Kapoor· Dec 4, 2024

    I recommend grepai-embeddings-ollama for anyone iterating fast on agent tooling; clear intent and a small, reviewable surface area.

  • Xiao Okafor· Dec 4, 2024

    grepai-embeddings-ollama fits our agent workflows well — practical, well scoped, and easy to wire into existing repos.

  • Amelia Brown· Nov 27, 2024

    Solid pick for teams standardizing on skills: grepai-embeddings-ollama is focused, and the summary matches what you get after install.

  • Aditi Desai· Nov 23, 2024

    Useful defaults in grepai-embeddings-ollama — fewer surprises than typical one-off scripts, and it plays nicely with `npx skills` flows.

  • Xiao Thompson· Nov 23, 2024

    We added grepai-embeddings-ollama from the explainx registry; install was straightforward and the SKILL.md answered most questions upfront.

  • Nikhil Torres· Nov 3, 2024

    We added grepai-embeddings-ollama from the explainx registry; install was straightforward and the SKILL.md answered most questions upfront.

  • Chen Rahman· Oct 22, 2024

    Solid pick for teams standardizing on skills: grepai-embeddings-ollama is focused, and the summary matches what you get after install.

  • Aditi Ghosh· Oct 18, 2024

    We added grepai-embeddings-ollama from the explainx registry; install was straightforward and the SKILL.md answered most questions upfront.

showing 1-10 of 68

1 / 7