tao▌
57 indexed skills · max 10 per page
tao-train-pointpillars
nvidia/skills · tao
PointPillars for 3D object detection from LiDAR point clouds. Encodes point clouds into a pseudo-image via a
tao-train-segformer
nvidia/skills · tao
SegFormer for semantic segmentation. Lightweight transformer-based architecture with hierarchical feature
tao-generate-referring-expressions
nvidia/skills · tao
Four-step image referring-expression pipeline: turns images plus KITTI bounding-box labels into region
tao-train-ocdnet
nvidia/skills · tao
OCDNet for scene text detection. Detects arbitrary-oriented text regions in natural images using a
tao-convert-dataset-format
nvidia/skills · tao
Run `tao-daft convert` to convert NVIDIA TAO DAFT datasets between supported formats. Do not use for non-DAFT data.
tao-train-nvpanoptix3d
nvidia/skills · tao
NVPanoptix3D for panoptic 3D scene reconstruction from posed RGB images. Produces 3D panoptic segmentation
tao-train-nvdinov2
nvidia/skills · tao
NVDINOv2 for self-supervised visual representation learning. Trains vision transformers via self-distillation
tao-train-metric-learning-recognition
nvidia/skills · tao
Metric-learning recognition (ml-recog) for fine-grained visual recognition. Learns embeddings for
tao-train-mask2former
nvidia/skills · tao
Mask2Former for universal image segmentation (panoptic, instance, and semantic). Transformer-based with
tao-train-mask-grounding-dino
nvidia/skills · tao
Mask Grounding DINO for grounded instance segmentation. Extends Grounding DINO with a mask-prediction head for