kernel▌
14 indexed skills · max 10 per page
tilegym-adding-cutile-kernel
nvidia/skills · tilegym
Add a new cuTile GPU kernel operator to TileGym. Covers dispatch registration in ops.py, cuTile backend implementation, __init__.py exports, test creation, and benchmark in tests/benchmark. Use when adding, creating, or implementing a new cuTile operator/kernel in TileGym, or when asking how to register a new cuTile op.
tilegym-converting-cutile-to-julia
nvidia/skills · tilegym
Converts cuTile Python GPU kernels (@ct.kernel) to cuTile.jl Julia equivalents. Handles kernel syntax translation, 0-indexed to 1-indexed conversion, broadcasting differences, memory layout (row-major to column-major), type system mapping, and launch API differences. Use when converting, porting, or translating cuTile Python kernels to Julia cuTile.jl, or debugging/optimizing existing Julia cuTile translations.
tilegym-improve-cutile-kernel-perf
nvidia/skills · tilegym
Iteratively optimize cuTile kernel performance through systematic profiling, bottleneck analysis, IR comparison, and targeted tuning. Covers tile sizes, occupancy, autotune configs, TMA, latency hints, persistent scheduling, num_ctas, flush_to_zero, and IR-level debugging. Use when asked to "optimize cutile kernel", "improve kernel perf", "tune cutile performance", "make kernel faster", or iteratively benchmark and refine a cuTile GPU kernel in the TileGym project.
tilegym-converting-cutile-to-triton
nvidia/skills · tilegym
Converts cuTile GPU kernels (@ct.kernel) to Triton (@triton.jit). Handles standard in-repo conversion, debugging (cudaErrorIllegalAddress, shape mismatch, numerical mismatch), and mapping cuTile idioms (ct.load/ct.store, ct.Constant, ct.launch) to Triton equivalents. Covers dual-kernel layout flags (e.g. transpose=True/False + autotune grid via META) per translations/advanced-patterns.md. Use when converting, porting, or translating cuTile kernels to Triton, or debugging existing Triton translations.
tilegym-monkey-patch-kernels-to-transformers
nvidia/skills · tilegym
Integrate TileGym kernels into Hugging Face `transformers` models by replacing the library's submodule(s) and certain class(es)' implementations, and patching certain class(es)' init/forward/load weight methods prior to instantiating models. Used when the user requires integrating TileGym kernels into `transformers` models.
jetson-init-source
nvidia/skills · jetson
>-
analyzing-linux-kernel-rootkits
mukul975/Anthropic-Cybersecurity-Skills · analyzing-linux-kernel-rootkits
Detect kernel-level rootkits in Linux memory dumps using Volatility3 linux plugins (check_syscall, lsmod, hidden_modules), rkhunter system scanning, and /proc vs /sys discrepancy analysis to identify hooked syscalls, hidden kernel modules, and tampered system structures.
kernel-auth
kernel/skills · Productivity
Setup and manage Kernel managed authentication connections for any website with safety checks and reauthentication support.
kernel-typescript-sdk
kernel/skills · Backend
Use the Kernel TypeScript SDK when you need to:
kernel-agent-browser
kernel/skills · Productivity
This skill documents best practices for using agent-browser's built-in Kernel provider (-p kernel) for cloud browser automation.