transcript-fixer

daymade/claude-code-skills · updated May 7, 2026

MDX-style export adds YAML metadata + attribution linking explainx.ai and this canonical listing URL.

$npx skills add https://github.com/daymade/claude-code-skills --skill transcript-fixer
0 commentsdiscussion
summary

Two-phase correction pipeline: deterministic dictionary rules (instant, free) followed by AI-powered error detection. Corrections accumulate in ~/.transcript-fixer/corrections.db, improving accuracy over time.

skill.md

Transcript Fixer

Two-phase correction pipeline: deterministic dictionary rules (instant, free) followed by AI-powered error detection. Corrections accumulate in ~/.transcript-fixer/corrections.db, improving accuracy over time.

Prerequisites

All scripts use PEP 723 inline metadata — uv run auto-installs dependencies. Requires uv (install guide).

Quick Start

# First time: Initialize database
uv run scripts/fix_transcription.py --init

# Phase 1: Dictionary corrections (instant, free)
uv run scripts/fix_transcription.py --input meeting.md --stage 1

After Stage 1, Claude reads the output and fixes remaining ASR errors natively (no API key needed):

  1. Read Stage 1 output in ~200-line chunks using the Read tool
  2. Identify ASR errors — homophones, garbled terms, broken sentences
  3. Present corrections in a table for user review before applying
  4. Save stable patterns to dictionary for future reuse

See references/example_session.md for a concrete input/output walkthrough.

Alternative: API batch processing (for automation without Claude Code):

export GLM_API_KEY="<api-key>"  # From https://open.bigmodel.cn/
uv run scripts/fix_transcript_enhanced.py input.md --output ./corrected

Core Workflow

Two-phase pipeline with persistent learning:

  1. Initialize (once): uv run scripts/fix_transcription.py --init
  2. Add domain corrections: --add "错误词" "正确词" --domain <domain>
  3. Phase 1 — Dictionary: --input file.md --stage 1 (instant, free)
  4. Phase 2 — AI Correction: Claude reads output and fixes errors natively, or --stage 3 with GLM_API_KEY for API mode
  5. Save stable patterns: --add "错误词" "正确词" after each session
  6. Review learned patterns: --review-learned and --approve high-confidence suggestions

Domains: general, embodied_ai, finance, medical, or custom (e.g., 火星加速器) Learning: Patterns appearing ≥3 times at ≥80% confidence auto-promote from AI to dictionary

After fixing, always save reusable corrections to dictionary. This is the skill's core value — see references/iteration_workflow.md for the complete checklist.

False Positive Prevention

Adding wrong dictionary rules silently corrupts future transcripts. Read references/false_positive_guide.md before adding any correction rule, especially for short words (≤2 chars) or common Chinese words that appear correctly in normal text.

Native AI Correction (Default Mode)

When running inside Claude Code, use Claude's own language understanding for Phase 2:

  1. Run Stage 1 (dictionary): --input file.md --stage 1
  2. Verify Stage 1 — diff original vs output. If dictionary introduced false positives, work from the original file
  3. Read the full text in ~200-line chunks. Read the entire transcript before proposing corrections — later context often disambiguates earlier errors
  4. Identify ASR errors:
    • Product/tool names: "close code" → "Claude Code", "get Hub" → "GitHub"
    • Technical terms: "Web coding" → "Vibe Coding", "happy pass" → "happy path"
    • Homophone errors: "上海文" → "上下文", "分值" → "分支"
    • English ASR garbling: "Pre top" → "prototype", "rapper" → "repo"
    • Broken sentences: "很大程。路上" → "很大程度上"
  5. Present corrections in high/medium confidence tables with line numbers
  6. Apply with sed on a copy, verify with diff, replace original
  7. Generate word diff: uv run scripts/generate_word_diff.py original.md corrected.md diff.html
  8. Save stable patterns to dictionary
  9. Remove false positives if Stage 1 had any

Enhanced Capabilities (Native Mode Only)

  • Intelligent paragraph breaks: Add \n\n at logical topic transitions
  • Filler word reduction: "这个这个这个" → "这个"
  • Interactive review: Corrections confirmed before applying
  • Context-aware judgment: Full document context resolves ambiguous errors

When to Use API Mode Instead

Use GLM_API_KEY + Stage 3 for batch processing, standalone usage without Claude Code, or reproducible automated processing.

Legacy Fallback

When the script outputs [CLAUDE_FALLBACK] (GLM API error), switch to native mode automatically.

Utility Scripts

Timestamp repair:

uv run scripts/fix_transcript_timestamps.py meeting.txt --in-place

Split transcript into sections (rebase each to 00:00:00):

uv run scripts/split_transcript_sections.py meeting.txt \
  --first-section-name "课前聊天" \
  --section "正式上课::好,无缝切换嘛。" \
  --rebase-to-zero

Word-level diff (recommended for reviewing corrections):

uv run scripts/generate_word_diff.py original.md corrected.md output.html

Output Files

  • *_stage1.md — Dictionary corrections applied
  • *_corrected.txt — Final version (native mode) or *_stage2.md (API mode)
  • *_对比.html — Visual diff (open in browser)

Database Operations

Read references/database_schema.md before any database operations.

sqlite3 ~/.transcript-fixer/corrections.db "SELECT * FROM active_corrections;"
sqlite3 ~/.transcript-fixer/corrections.db "SELECT value FROM system_config WHERE key='schema_version';"

Stages

Stage Description Speed Cost
1 Dictionary only Instant Free
1 + Native Dictionary + Claude AI (default) ~1min Free
3 Dictionary + API AI + diff report ~10s API calls

Bundled Resources

Scripts:

  • fix_transcription.py — Core CLI (dictionary, add, audit, learning)
  • fix_transcript_enhanced.py — Enhanced wrapper for interactive use
  • fix_transcript_timestamps.py — Timestamp normalization and repair
  • generate_word_diff.py — Word-level diff HTML generation
  • split_transcript_sections.py — Split transcript by marker phrases

References (load as needed):

  • Safety: false_positive_guide.md (read before adding rules), database_schema.md (read before DB ops)
  • Workflow: iteration_workflow.md, workflow_guide.md, example_session.md
  • CLI: quick_reference.md, script_parameters.md
  • Advanced: dictionary_guide.md, sql_queries.md, architecture.md, best_practices.md
  • Operations: troubleshooting.md, installation_setup.md, glm_api_setup.md, team_collaboration.md

Troubleshooting

uv run scripts/fix_transcription.py --validate checks setup health. See references/troubleshooting.md for detailed resolution.

Next Step: Structure into Meeting Minutes

After correcting a transcript, if the content is from a meeting, lecture, or interview, suggest structuring it:

Transcript corrected: [N] errors fixed, saved to [output_path].

Want to turn this into structured meeting minutes with decisions and action items?

Options:
A) Yes — run /meeting-minutes-taker (Recommended for meetings/lectures)
B) Export as PDF — run /pdf-creator on the corrected text
C) No thanks — the corrected transcript is all I need
how to use transcript-fixer

How to use transcript-fixer on Cursor

AI-first code editor with Composer

1

Prerequisites

Before installing skills in Cursor, ensure your development environment meets these requirements:

  • Cursor installed and configured on your development machine
  • Node.js version 16.0+ with npm package manager (verify with node --version)
  • Active project directory or workspace where you want to add transcript-fixer
2

Execute installation command

Execute the skills CLI command in your project's root directory to begin installation:

$npx skills add https://github.com/daymade/claude-code-skills --skill transcript-fixer

The skills CLI fetches transcript-fixer from GitHub repository daymade/claude-code-skills and configures it for Cursor.

3

Select Cursor when prompted

The CLI will show a list of available agents. Use arrow keys to navigate and space to select Cursor:

◆ Which agents do you want to install to?
│ ── Universal (.agents/skills) ── always included ────
│ • Amp
│ • Antigravity
│ • Cline
│ • Codex
│ ●Cursor(selected)
│ • Cursor
│ • Windsurf
4

Verify installation

Confirm successful installation by checking the skill directory location:

.cursor/skills/transcript-fixer

Reload or restart Cursor to activate transcript-fixer. Access the skill through slash commands (e.g., /transcript-fixer) or your agent's skill management interface.

Security & Verification Notice

We perform automated surface-level scans (Gen AI Scanner, Socket, Snyk) during installation. These checks detect common vulnerabilities but do not guarantee complete security. Always review skill source code and verify the publisher's reputation before production use.

Skills execute code in your development environment. Always verify the publisher's identity, review recent commits, and test in isolated environments before production deployment.

List & Monetize Your Skill

Submit your Claude Code skill and start earning

GET_STARTED →

Use Cases

User Story & Requirements Generation

Create detailed user stories, acceptance criteria, and feature specs

Example

Generate user stories for 'password reset feature' with acceptance criteria, edge cases, and test scenarios

Reduce spec writing time by 50%, ensure comprehensive coverage

Competitive Analysis

Research competitors, compare features, identify gaps

Example

Analyze 5 competitor products, create feature comparison matrix, suggest differentiation opportunities

Complete competitive research in 2 hours instead of 2 days

Roadmap Prioritization

Evaluate features using frameworks (RICE, ICE, Kano) and create prioritized backlogs

Example

Score 20 feature ideas using RICE framework, generate prioritized roadmap with rationale

Make data-driven prioritization decisions faster

Stakeholder Communication

Draft PRDs, status updates, and stakeholder presentations

Example

Create executive summary of Q3 roadmap, monthly progress report, feature launch announcement

Save 3-5 hours/week on communication overhead

Implementation Guide

Prerequisites

  • Claude Desktop or compatible AI client
  • Access to product documentation and roadmap tools (Jira, Notion, etc.)
  • Understanding of product management frameworks (RICE, Jobs-to-be-Done, etc.)
  • Stakeholder contact information and communication channels

Time Estimate

30-60 minutes to see productivity improvements

Installation Steps

  1. 1.Install product management skill
  2. 2.Start with user story generation for known feature
  3. 3.Progress to competitive analysis: research 2-3 competitors
  4. 4.Use for roadmap prioritization: apply RICE/ICE scoring
  5. 5.Draft stakeholder communications and refine based on feedback
  6. 6.Build template library for recurring PM tasks
  7. 7.Share effective prompts with product team

Common Pitfalls

  • Not validating competitive research—verify facts before sharing
  • Accepting user stories without involving engineering team
  • Over-relying on frameworks without qualitative judgment
  • Not customizing outputs to company culture and communication style
  • Skipping stakeholder validation of generated requirements

Best Practices

✓ Do

  • +Validate research and competitive analysis with real data
  • +Collaborate with engineering when generating technical requirements
  • +Customize frameworks and templates to your company context
  • +Use skill for first drafts, refine with stakeholder input
  • +Document successful prompt patterns for PM tasks
  • +Combine AI efficiency with human judgment and intuition

✗ Don't

  • Don't publish competitive analysis without fact-checking
  • Don't finalize user stories without engineering review
  • Don't make prioritization decisions solely on AI scoring
  • Don't skip customer validation of generated requirements
  • Don't ignore company-specific context and culture

💡 Pro Tips

  • Provide context: company goals, constraints, customer feedback
  • Ask for alternatives: 'Show 3 ways to prioritize this roadmap'
  • Request stakeholder-specific formatting: 'Executive summary vs. engineering spec'
  • Use skill for 70% generation + 30% customization to company needs

When to Use This

✓ Use When

Use for user story writing, competitive research, roadmap prioritization, stakeholder communication, and PRD drafting. Best for reducing repetitive documentation and research work.

✗ Avoid When

Avoid for strategic product vision (requires deep customer empathy), pricing decisions (needs market and financial expertise), or when face-to-face customer discovery is more valuable than speed.

Learning Path

  1. 1Basic: user stories, feature specs, status updates
  2. 2Intermediate: competitive analysis, prioritization frameworks, PRDs
  3. 3Advanced: product strategy, go-to-market planning, OKR setting
  4. 4Expert: product vision, market positioning, business model innovation

Discussion

Product Hunt–style comments (not star reviews)
  • No comments yet — start the thread.
general reviews

Ratings

4.565 reviews
  • Chaitanya Patil· Dec 24, 2024

    Keeps context tight: transcript-fixer is the kind of skill you can hand to a new teammate without a long onboarding doc.

  • Meera Verma· Dec 24, 2024

    Registry listing for transcript-fixer matched our evaluation — installs cleanly and behaves as described in the markdown.

  • Yuki Yang· Dec 16, 2024

    transcript-fixer is among the better-maintained entries we tried; worth keeping pinned for repeat workflows.

  • Lucas Gupta· Dec 8, 2024

    Useful defaults in transcript-fixer — fewer surprises than typical one-off scripts, and it plays nicely with `npx skills` flows.

  • Pratham Ware· Dec 4, 2024

    transcript-fixer fits our agent workflows well — practical, well scoped, and easy to wire into existing repos.

  • Amina Ghosh· Nov 27, 2024

    Registry listing for transcript-fixer matched our evaluation — installs cleanly and behaves as described in the markdown.

  • Piyush G· Nov 15, 2024

    transcript-fixer has been reliable in day-to-day use. Documentation quality is above average for community skills.

  • Meera Menon· Nov 15, 2024

    Useful defaults in transcript-fixer — fewer surprises than typical one-off scripts, and it plays nicely with `npx skills` flows.

  • Amina Torres· Nov 11, 2024

    transcript-fixer fits our agent workflows well — practical, well scoped, and easy to wire into existing repos.

  • Yuki Haddad· Nov 7, 2024

    Solid pick for teams standardizing on skills: transcript-fixer is focused, and the summary matches what you get after install.

showing 1-10 of 65

1 / 7