tao▌
57 indexed skills · max 10 per page
tao-train-mask-auto-label
nvidia/skills · tao
MAL (Mask Auto-Label) for weakly-supervised segmentation. Produces segmentation masks from minimal annotations
tao-analyze-gaps-vlm-bcq
nvidia/skills · tao
Extract false-positive and false-negative gaps from VLM binary-classification-question (BCQ, yes/no) predictions.
tao-generate-image-grounding
nvidia/skills · tao
Two-step image grounding pipeline: extracts referring expressions from (image, caption) pairs and grounds them
tao-train-fast-foundation-stereo
nvidia/skills · tao
Real-time stereo depth estimation using FastFoundationStereo (FFS), the distilled bp2 commercial variant of
tao-mine-aoi-images
nvidia/skills · tao
Runs the DEFT embed-then-mine workflow for VCN AOI iterations — embeds the gap-analysis target parquet, embeds a source pool, and mines nearest-neighbour source images for downstream augmentation. Use as the immediate next step after `tao-route-visual-changenet-samples` when expanding a real-image augmentation queue from the mining subset.
tao-train-mask-auto-encoder
nvidia/skills · tao
Masked Auto-Encoder (MAE) for self-supervised pretraining and fine-tuning. Masks random patches and reconstructs
tao-train-image-classification
nvidia/skills · tao
PyTorch-based TAO image classification. Supports a wide range of backbones (FAN, EfficientNet, ResNet, etc.)
tao-train-grounding-dino
nvidia/skills · tao
Grounding DINO for open-set object detection. Combines DINO-style detection with a BERT text encoder for
tao-train-foundation-stereo
nvidia/skills · tao
Stereo depth estimation using FoundationStereo. Predicts disparity maps from stereo image pairs for 3D
tao-finetune-huggingface-model
nvidia/skills · tao
>