Tuesday, June 23, 2026
Merged timeline of 267 items — blog publish times and listing timestamps, cut at midnight . Page 4 of 6.
- Skilljetsonjetson-print-device-info
Use when you need to print Jetson device info (module model, L4T version, kernel, OS version, current power mode) from a running Jetson target. This is an example skill.
by Yash @ Explainx0 comments - Skilljetsonjetson-speculative-decoding
Add EAGLE-3 or draft-model speculative decoding to a Jetson vLLM server when TPOT is the bottleneck.
by Yash @ Explainx0 comments - Skillnemo-mbridgenemo-mbridge-perf-cuda-graphs
Validate and use CUDA graph capture in Megatron Bridge, including local full-iteration graphs and Transformer Engine scoped graphs for attention, MLP, and MoE modules.
by Yash @ Explainx0 comments - Skillnemo-mbridgenemo-mbridge-perf-cpu-offloading
Validate and use CPU offloading in Megatron Bridge, including layer-level activation offloading and fractional optimizer state offloading with HybridDeviceOptimizer.
by Yash @ Explainx0 comments - Skillnemo-mbridgenemo-mbridge-perf-activation-recompute
Validate and use selective and full activation recompute in Megatron Bridge to reduce GPU memory usage at the cost of extra compute.
by Yash @ Explainx0 comments - Skilljetsonjetson-customize-uphy
Configure Jetson UPHY lane allocation (uphy0/uphy1-config) on Orin/Thor custom carriers. Do NOT use for pinmux or PCIe-only edits.
by Yash @ Explainx0 comments - Skillnemo-mbridgenemo-mbridge-mlm-bridge-training
Run Megatron-LM (MLM) and Megatron Bridge training with mock or real data. Covers correlation testing, available recipes, and multi-GPU examples.
by Yash @ Explainx0 comments - Skillmcoremcore-split-pr
Split a PR into multiple PRs to reduce the number of required CODEOWNERS reviewer groups.
by Yash @ Explainx0 comments - Skillnemonemo-evaluator-plugin
Use when working on the Evaluator plugin CLI, jobs, SDK-backed specs, metric types, or plugin-owned Evaluator skills.
by Yash @ Explainx0 comments - Skillnemonemo-automodel-recipe-development
Create and modify NeMo AutoModel training and evaluation recipes, including YAML structure, builders, and execution flow.
by Yash @ Explainx0 comments - Skillnemo-mbridgenemo-mbridge-multi-node-slurm
Convert single-node scripts to multi-node Slurm sbatch jobs and debug common multi-node failures. Covers srun-native vs uv run torch.distributed approaches, container setup, NCCL timeouts, OOM sizing for MoE models, and…
by Yash @ Explainx0 comments - Skillnemonemo-data-designer-plugin
Use when the user wants to create a dataset, generate synthetic data, or build a data generation pipeline.
by Yash @ Explainx0 comments - Skillnemonemo-automodel-model-onboarding
Guide for onboarding new model architectures into NeMo AutoModel, including architecture discovery, implementation patterns, registration, and validation.
by Yash @ Explainx0 comments - Skilljetsonjetson-package
Pick Jetson-compatible containers, vLLM runtime images, and Jetson AI Lab PyPI indexes; maps Orin SM 8.7 vs Thor SM 11.0 and JetPack-specific package choices.
by Yash @ Explainx0 comments - Skillnemonemo-automodel-launcher-config
Configure NeMo AutoModel job launches for interactive runs, Slurm clusters, and SkyPilot cloud execution.
by Yash @ Explainx0 comments - Skillnemonemo-automodel-distributed-training
Guide for selecting and configuring distributed training strategies in NeMo AutoModel, including FSDP2, Megatron FSDP, DDP, and parallelism settings.
by Yash @ Explainx0 comments - Skilljetsonjetson-memory-audit
Measure Jetson DRAM/NvMap usage and verify before/after memory reclamation with live audit data.
by Yash @ Explainx0 comments - Skilljetsonjetson-llm-serve
Stand up vLLM or SGLang serving on Jetson, using upstream vLLM on Thor and Orin JetPack 7.2+, and NVIDIA-AI-IOT vLLM on older Orin.
by Yash @ Explainx0 comments - Skillmcoremcore-testing
Test system for Megatron-LM. Covers test layout, recipe YAML structure, adding and running unit and functional tests, golden values, marker filters, and CI parity.
by Yash @ Explainx0 comments - Skillmcoremcore-create-issue
Investigate a failing GitHub Actions run or job and create a GitHub issue for the failure.
by Yash @ Explainx0 comments - Skilljetsonjetson-inference-mem-tune
Pick the serving stack and per-runtime memory flags (vLLM, SGLang, llama.cpp, TensorRT Edge-LLM) for an LLM/VLM workload on any NVIDIA Jetson.
by Yash @ Explainx0 comments - Skillmcoremcore-linting-and-formatting
Linting and formatting for Megatron-LM. Covers running autoformat.sh, tools (ruff, black, isort, pylint, mypy), and code style rules.
by Yash @ Explainx0 comments - Skillmcoremcore-run-on-slurm
How to launch distributed Megatron-LM training jobs on a SLURM cluster. Covers a minimal sbatch skeleton, environment-variable setup for torch.distributed.run, CUDA_DEVICE_MAX_CONNECTIONS rules across hardware and paral…
by Yash @ Explainx0 comments - Skilljetsonjetson-headless-mode
Plan and apply safe Jetson headless-mode changes to reclaim GUI and daemon memory.
by Yash @ Explainx0 comments - Skilljetsonjetson-customize-usb
Enable/disable Jetson USB2/USB3 SS ports via kernel-DT overlay. Do NOT use for UPHY lane allocation or ODMDATA edits.
by Yash @ Explainx0 comments - Skilljetsonjetson-print-bsp-info
Use when you need to print Jetson BSP info (L4T version, board configs, rootfs state) from a Linux_for_Tegra root on the host PC. This is an example skill.
by Yash @ Explainx0 comments - Skilljetsonjetson-llm-benchmark
Benchmark Jetson LLM/VLM serving performance across vLLM, llama.cpp, and Ollama with structured JSON output.
by Yash @ Explainx0 comments - Skillaccelerated-computinglaunch-nemo-rl
Playbook for launching, monitoring, stopping, and debugging NeMo-RL recipes on a Kubernetes cluster via the nrl-k8s CLI. Covers ephemeral vs long-lived RayCluster modes, iterating on runs, and debugging hung or failed t…
by Yash @ Explainx0 comments - Skillcuoptcuopt-numerical-optimization-formulation
LP, MILP, QP — concepts, problem-text parsing, and formulation patterns (parameters, constraints, decisions, objective). Concepts only; no API.
by Yash @ Explainx0 comments - Skillhsbhsb-setup
Clone the latest NVIDIA Holoscan Sensor Bridge repo, ask which supported devkit is being used, configure the host per platform, build the correct demo container, run it, and verify HSB connectivity by pinging 192.168.0.…
by Yash @ Explainx0 comments - Skillhsbhsb-test
Execute QA test plans on Holoscan Sensor Bridge hardware. Reads a user-provided test document, filters tests by the user's setup, determines which tests can run automatically, executes them with pass/fail evaluation, an…
by Yash @ Explainx0 comments - Skilldigital-healthdigital-health-clinical-asr-build
Stage 2 of the Clinical ASR Flywheel. Use when curating clinical terms, tagging IPA, and synthesizing a NeMo manifest. NOT for scoring (use /digital-health-clinical-asr-eval).
by Yash @ Explainx0 comments - Skillholoscanholoscan-install-container
Install Holoscan SDK via the NGC Docker container. Use for container-based installs; not for native apt/pip/Conda installs.
by Yash @ Explainx0 comments - Skillhsbhsb-flash
Flash the FPGA on an HSB board connected to an NVIDIA devkit. Supports HSB Lattice boards (FPGA versions 2407, 2412, 2507, 2510) and Leopard Imaging VB1940 "all-in-one" cameras (FPGA versions 2507, 2510). Uses release-s…
by Yash @ Explainx0 comments - Skillholoscanholoscan-setup
Guides Holoscan SDK installation: inspects the host, assesses platform compatibility, recommends an install method, and delegates to the matching install skill.
by Yash @ Explainx0 comments - Skilldicomdicom-series-to-volume
Used for converting one CT DICOM series folder to a HU NIfTI volume with affine evidence. Not for multi-frame DICOM or clinical use.
by Yash @ Explainx0 comments - Skillhsbhsb-app
Discover and run Holoscan Sensor Bridge example applications on a connected devkit. Filters available apps by the user's platform, HSB software version, board type, and sensors. Supports timed execution, failure analysi…
by Yash @ Explainx0 comments - Skillholoscanholoscan-install-wheel
Install Holoscan SDK Python wheel via pip into a venv. Use for Python installs; not for native C++/apt or Conda installs.
by Yash @ Explainx0 comments