Confirm successful installation by checking the skill directory location:
.cursor/skills/embedding-strategies
Restart Cursor to activate embedding-strategies. Access via /embedding-strategies in your agent's command palette.
โ
Security Notice
We perform automated surface-level scans (Gen AI Scanner, Socket, Snyk) during installation. These checks detect common vulnerabilities but do not guarantee complete security. Always review skill source code and verify the publisher's reputation before production use.
Skills execute code in your environment. Always review source, verify the publisher, and test in isolation before production.
from openai import OpenAI
from typing import List
import numpy as np
client = OpenAI()defget_embeddings( texts: List[str], model:str="text-embedding-3-small", dimensions:int=None)-> List[List[float]]:"""Get embeddings from OpenAI."""# Handle batching for large lists batch_size =100 all_embeddings =[]for i inrange(0,len(texts), batch_size): batch = texts[i:i + batch_size] kwargs ={"input": batch,"model": model}if dimensions: kwargs["dimensions"]= dimensions
response = client.embeddings.create(**kwargs) embeddings =[item.embedding for item in response.data] all_embeddings.extend(embeddings)return all_embeddings
defget_embedding(text:str,**kwargs)-> List[float]:"""Get single embedding."""return get_embeddings([text],**kwargs)[0]# Dimension reduction with OpenAIdefget_reduced_embedding(text:str, dimensions:int=512)-> List[float]:"""Get embedding with reduced dimensions (Matryoshka)."""return get_embedding( text, model="text-embedding-3-small", dimensions=dimensions
)
Template 2: Local Embeddings with Sentence Transformers
from sentence_transformers import SentenceTransformer
from typing import List, Optional
import numpy as np
classLocalEmbedder:"""Local embedding with sentence-transformers."""def__init__( self, model_name:str="BAAI/bge-large-en-v1.5", device:str="cuda"): self.model = SentenceTransformer(model_name, device=device)defembed( self, texts: List[str], normalize:bool=True, show_progress:bool=False)-> np.ndarray:"""Embed texts with optional normalization.""" embeddings = self.model.encode( texts, normalize_embeddings=normalize, show_progress_bar=show_progress, convert_to_numpy=True)return embeddings
defembed_query(self, query:str)-> np.ndarray:"""Embed a query with BGE-style prefix."""# BGE models benefit from query prefixif"bge"in self.model.get_sentence_embedding_dimension(): query =f"Represent this sentence for searching relevant passages: {query}"return self.embed([query])[0]defembed_documents(self, documents: List[str])-> np.ndarray:"""Embed documents for indexing."""return self.embed(documents)# E5 model with instructionsclassE5Embedder:def__init__(self, model_name:str="intfloat/multilingual-e5-large"): self.model = SentenceTransformer(model_name)defembed_query(self, query:str)-> np.ndarray:return self.model.encode(f"query: {query}")defembed_document(self, document:str)-> np.ndarray:return self.model.encode(f"passage: {document}")
Template 3: Chunking Strategies
from typing import List, Tuple
import re
defchunk_by_tokens( text:str, chunk_size:int=512, chunk_overlap:int=50, tokenizer=None)-> List[str]:"""Chunk text by token count."""import tiktoken
tokenizer = tokenizer or tiktoken.get_encoding("cl100k_base") tokens = tokenizer.encode(text) chunks =[] start =0while start <len(tokens): end = start + chunk_size
chunk_tokens = tokens[start:end] chunk_text = tokenizer
Implementation Guide
Prerequisites
โบClaude Desktop or compatible AI client with skill support
โบClear understanding of task or problem to solve
โบWillingness to iterate and refine outputs
Time Estimate
15-45 minutes depending on use case complexity
Steps
1Install skill using provided installation command
2Test with simple use case relevant to your work
3Evaluate output quality and relevance
4Iterate on prompts to improve results
5Integrate into regular workflow if valuable
Common Pitfalls
โ Expecting perfect results without iteration
โ Not providing enough context in prompts
โ Using skill for tasks outside its intended scope
โ Accepting outputs without review and validation
Best Practices
โ Do
+Start with clear, specific prompts
+Provide relevant context and constraints
+Review and refine all outputs before using
+Iterate to improve output quality
+Document successful prompt patterns
โ Don't
โDon't use without understanding skill limitations
โDon't skip validation of outputs
โDon't share sensitive information in prompts
โDon't expect skill to replace human judgment
๐ก Pro Tips
โ Be specific about desired format and style
โ Ask for multiple options to choose from
โ Request explanations to understand reasoning
โ Combine AI efficiency with human expertise
When to Use This
โ Use when
Use when skill capabilities match your task, clear ROI on time saved, and you can validate outputs. Best for repetitive tasks, learning, and quality improvement.
โ Avoid when
Avoid when task requires deep expertise you can't validate, involves sensitive decisions, or when learning process is more valuable than speed of completion.
Learning Path
1Familiarize yourself with skill capabilities and limitations
2Start with low-risk, non-critical tasks
3Progress to more complex and valuable use cases
4Build expertise through regular use and experimentation