explainx.ainewsletter3.4k
trending🔥loopsskills
pricing
workshops ↗
explainx.ai

Learn to lead teams that combine humans and agents. Platform access, live workshops, bootcamps, and 50+ courses — plus skills, tools, and MCP to practice what you learn.

follow us

custom AI agents

[email protected]

get started

Join · $29/moUpcoming workshop

learn

platform · $29/moupcoming workshopworkshopsbootcampscoursescertificationscertification testsexplainx universitycorporate trainingfacilitatorshackathonslearn skills & mcp

discover

skillstoolsagentsmcp serversdesignsllmsagiranks

content

releasesvisionmissionaboutteamcareersresourcespromptsgenerators hubgenerator SEO hubprompt templatesprompt guidesblogfor LLMsdemo

Sister Products

Infloq

Infloq

Influencer marketing

BgBlur

BgBlur

Privacy-first blur

Olly Social

Olly Social

Social AI copilot

Ceptory

Ceptory

Video intelligence

BgRemover

BgRemover

Background removal

newsletter · weekly

Get AI news, tools, and insights in your inbox.

contactsupportprivacytermsdata rightssubmission guidelines

© 2026 AISOLO Technologies Pvt Ltd

skills/tag/text
tag

text▌

20 indexed skills · max 10 per page

skills (20)

text-to-image-prompt-optimizer

manzxiao/text-to-image-prompt-optimizer · Productivity

0

Generate professional, optimized prompts for AI image generation tools with primary support for Google Gemini (Nano Banana), plus Midjourney, Stable Diffusion, DALL-E, Leonardo.ai, and others.

speech-to-text

inference-sh/skills · Productivity

0

Transcribe audio to text via inference.sh CLI.

extracting-pdf-text

letta-ai/skills · Documents

0

This skill provides tools and guidance for extracting text from PDFs in formats suitable for language model consumption.

alicloud-ai-text-document-mind

cinience/alicloud-skills · Cloud

0

Category: provider

regex-vs-llm-structured-text

affaan-m/everything-claude-code · AI/ML

0

Hybrid regex-and-LLM framework for parsing structured text, optimizing cost by handling 95–98% with regex and reserving LLM calls for edge cases. \n \n Combines regex extraction with confidence scoring to flag low-confidence items, then validates only those items with an LLM, reducing LLM calls by ~95% versus all-LLM approaches \n Includes production-ready Python patterns for regex parsing, confidence scoring, and hybrid pipeline orchestration with real metrics from a 410-item quiz parsing examp

text-to-speech

elevenlabs/skills · Productivity

0

Natural speech synthesis from text across 70+ languages with multiple quality and latency models. \n \n Six models available ranging from highest-quality eleven_v3 to ultra-low-latency eleven_flash_v2_5 (~75ms), with language and speed tradeoffs documented \n Supports 13+ output formats including MP3, PCM, WAV, Opus, and telephony codecs (μ-law, A-law) for web, streaming, and real-time applications \n Fine-tune voice characteristics via stability, similarity boost, style, speaker boost, and spee

alicloud-ai-text-document-mind-test

cinience/alicloud-skills · Cloud

0

Category: test

text-to-speech

inferen-sh/skills · Productivity

0

Multiple text-to-speech models via inference.sh CLI for voiceovers, podcasts, and accessibility. \n \n Six models available: ElevenLabs (premium, 22+ voices, 32 languages), DIA TTS (conversational), Kokoro TTS (fast), Chatterbox, Higgs Audio (emotional control), and VibeVoice (long-form podcasts) \n Core capabilities include basic speech synthesis, expressive speech with emotion control, and conversational dialogue generation \n Easily combine with video tools like OmniHuman to create talking he

paddleocr-text-recognition

aidenwu0209/paddleocr-skills · Productivity

0

Extract text from images, PDFs, and documents via PaddleOCR API with structured JSON output. \n \n Supports URLs and local file paths for images and PDFs; returns complete recognized text in JSON format \n Mandatory API-only approach: executes python scripts/ocr_caller.py with --file-url or --file-path parameters \n Requires initial configuration with PADDLEOCR_OCR_API_URL and PADDLEOCR_ACCESS_TOKEN ; displays full extracted text without truncation or summarization \n Handles authentication, rat

speech-to-text

inferen-sh/skills · Productivity

0

Transcribe audio to text using ElevenLabs Scribe or Whisper models via inference.sh CLI. \n \n Three model options: ElevenLabs Scribe v2 (98%+ accuracy with diarization), Fast Whisper V3, and Whisper V3 Large for varying speed/accuracy tradeoffs \n Supports 99+ languages, optional timestamps, speaker diarization, and translation to English \n Common workflows include meeting transcription, podcast transcripts, video subtitles, and voice note conversion \n Requires inference.sh CLI ( infsh ) inst

prevpage 2 / 2next