tag

tao▌

57 indexed skills · max 10 per page

skills (57)

tao-train-mask-auto-label

nvidia/skills · tao

MAL (Mask Auto-Label) for weakly-supervised segmentation. Produces segmentation masks from minimal annotations

tao-analyze-gaps-vlm-bcq

nvidia/skills · tao

Extract false-positive and false-negative gaps from VLM binary-classification-question (BCQ, yes/no) predictions.

tao-generate-image-grounding

nvidia/skills · tao

Two-step image grounding pipeline: extracts referring expressions from (image, caption) pairs and grounds them

tao-train-fast-foundation-stereo

nvidia/skills · tao

Real-time stereo depth estimation using FastFoundationStereo (FFS), the distilled bp2 commercial variant of

tao-mine-aoi-images

nvidia/skills · tao

Runs the DEFT embed-then-mine workflow for VCN AOI iterations — embeds the gap-analysis target parquet, embeds a source pool, and mines nearest-neighbour source images for downstream augmentation. Use as the immediate next step after `tao-route-visual-changenet-samples` when expanding a real-image augmentation queue from the mining subset.

tao-train-mask-auto-encoder

nvidia/skills · tao

Masked Auto-Encoder (MAE) for self-supervised pretraining and fine-tuning. Masks random patches and reconstructs

tao-train-image-classification

nvidia/skills · tao

PyTorch-based TAO image classification. Supports a wide range of backbones (FAN, EfficientNet, ResNet, etc.)

tao-train-grounding-dino

nvidia/skills · tao

Grounding DINO for open-set object detection. Combines DINO-style detection with a BERT text encoder for

tao-train-foundation-stereo

nvidia/skills · tao

Stereo depth estimation using FoundationStereo. Predicts disparity maps from stereo image pairs for 3D

tao-finetune-huggingface-model

nvidia/skills · tao

prevpage 3 / 6next