>_Skillful
Need help with advanced AI agent engineering?Contact FirmAdapt
NVIDIA Corporation

NVIDIA Corporation

Organization

@nvidia

2788 San Tomas Expressway, Santa Clara, CA, 95051 nvidia.com On GitHub since May 2012

47

Published Tools

83,416

Total Stars

0

Weekly Downloads

26,651

GitHub Followers

749

Public Repos

100/100

Avg Security

Published Tools

2 MCP Servers19 Skills26 Agentsacross 8 categories

NeMo Framework

Generative AI framework built for researchers and PyTorch developers working on Large Language Models (LLMs), Multimodal Models (MMs), Automatic Speech Recognition (ASR), Text to Speech (TTS), and Computer Vision (CV) domains.

...more
AgentLLM Training Frameworks
17K1 dir

Megatron-LM

Ongoing research training transformer models at scale.

AgentLLM Training Frameworks
16K1 dir

TensorRT-LLM

Nvidia Framework for LLM Inference

AgentLLM Inference
13K2 dirs

garak

nv052193, Mads Kongsbak, Tianhao Li, Phyllis Poh, Razvan Dinu, Zander Mackie, Greg Stephens, Ahsan Ayub, Jonathan Liberman, Gustav Fredrikson, Oh Tien Cheng, Brain John, Naman Mishra, Soumili Nandi, Arjun Krishna, Mihailo Milenkovic, Kai Greshake, Martin Borup-Larsen, Emmanuel Ferdman, Eric Therond, Zoe Nolan, Harsh Raj, Shine-afk, Rafael Sandroni, Eric Hacker, Blessed Uyo, Ikko Eltociear Ashimine, iamnotcj, Dwight Temple, Shane Rosse, Masaya Ogushi, Viktor T. Zetterberg, Erwan Roussel, Matthew Rowe, Aishwarya Padmakumar, Marco Rosa, Ian Chu, Mike McKiernan, Divya Chitimalla, Katherine Luna, Dave Baker, Jack Kelly, Amrit Prakash, Cássia Sampaio, Nakul Rajpal, Noah Oeksuez, Dhruv Malik, Patricia Pampanelli, Joseph Davis Chamdani, Rob Geada, Ashish RajAnand, Paulina Kalicka, Gal Moshkovitz, Jack Smith, Paul A. Parkanzky, Leif Hancox-Li, Fabrizio Rocco, Sai Chandra Pandraju, Harish Kolla, Snehal Vartak, Abhiraj Sinha, Harsh Motla, Otavio Padovani, Siddhant Mishra, dyrtyData, Leone Lage Perdigão, Lucas Wang

LLM vulnerability scanner

Skillai-ml
7.4K1 dir

nvidia-eval-factory-garak

nv052193, Mads Kongsbak, Tianhao Li, Phyllis Poh, Razvan Dinu, Zander Mackie, Greg Stephens, Ahsan Ayub, Jonathan Liberman, Gustav Fredrikson, Oh Tien Cheng, Brain John, Naman Mishra, Soumili Nandi, Arjun Krishna, Mihailo Milenkovic, Kai Greshake, Martin Borup-Larsen, Emmanuel Ferdman, Eric Therond, Zoe Nolan, Harsh Raj, Shine-afk, Rafael Sandroni, Eric Hacker, Blessed Uyo, Ikko Eltociear Ashimine, iamnotcj, Dwight Temple, Shane Rosse, Masaya Ogushi, Viktor T. Zetterberg, Erwan Roussel, Matthew Rowe, Aishwarya Padmakumar, Marco Rosa, Ian Chu

garak (LLM vulnerability scanner) - packaged by NVIDIA Eval Factory

Skillai-ml
7.3K1 dir

FasterTransformer

NVIDIA Framework for LLM Inference(Transitioned to TensorRT-LLM)

AgentLLM Inference
6.4K1 dir

ai-dynamo

"NVIDIA Inc." <[email protected]>

Distributed Inference Framework

Skilluncategorised
6.3K1 dir

nemoguardrails

NVIDIA

NeMo Guardrails is an open-source toolkit for easily adding programmable guardrails to LLM-based conversational systems.

Skilluncategorised
5.8K2 dirs

Transformer Engine

A library for accelerating Transformer model training on NVIDIA GPUs.

AgentLLM Training Frameworks
3.2K1 dir

libkvikio-cu12

NVIDIA Corporation

KvikIO - GPUDirect Storage (C++)

Skillai-ml
2551 dir

libkvikio-cu13

NVIDIA Corporation

KvikIO - GPUDirect Storage (C++)

Skillai-ml
2551 dir

kvikio-cu12

NVIDIA Corporation

KvikIO - GPUDirect Storage

Skillai-ml
2551 dir

kvikio-cu13

NVIDIA Corporation

KvikIO - GPUDirect Storage

Skillai-ml
2551 dir

voice-agent-examples

nvidia

AI Space: nvidia/voice-agent-examples

AgentHF Space
192 dirs

sphinx-llm

None

Skillai-ml
161 dir

nvidia-profbench

NVIDIA Corporation

Professional domain benchmark for evaluating LLMs on Physics PhD, Chemistry PhD, Finance MBA, and Consulting MBA tasks

Skillai-ml
1 dir

nvidia-nat-ragaai

NVIDIA Corporation

Subpackage for RagaAI Catalyst integration in NeMo Agent Toolkit

Agentuncategorised
3 dirs

vss-ctx-rag

None

Skillai-ml
1 dir

nemo-text-processing

NVIDIA

NeMo text processing for ASR and TTS

Skilluncategorised
1 dir

NVIDIA: Nemotron 3 Super (free)

nvidia

NVIDIA Nemotron 3 Super is a 120B-parameter open hybrid MoE model, activating just 12B parameters for maximum compute efficiency and accuracy in complex multi-agent applications. Built on a hybrid Mamba-Transformer Mixture-of-Experts architecture with multi-token prediction (MTP), it delivers over 50% higher token generation compared to leading open models. The model features a 1M token context window for long-term agent coherence, cross-document reasoning, and multi-step task planning. Latent

...more
AgentLLM Model
1 dir

NVIDIA: Nemotron 3 Nano 30B A3B (free)

nvidia

NVIDIA Nemotron 3 Nano 30B A3B is a small language MoE model with highest compute efficiency and accuracy for developers to build specialized agentic AI systems. The model is fully open with open-weights, datasets and recipes so developers can easily customize, optimize, and deploy the model on their infrastructure for maximum privacy and security.

...more
AgentLLM Model
1 dir

NVIDIA: Nemotron 3 Nano 30B A3B

nvidia

NVIDIA Nemotron 3 Nano 30B A3B is a small language MoE model with highest compute efficiency and accuracy for developers to build specialized agentic AI systems. The model is fully open with open-weights, datasets and recipes so developers can easily customize, optimize, and deploy the model on their infrastructure for maximum privacy and security.

...more
AgentLLM Model
1 dir

NVIDIA: Nemotron Nano 12B 2 VL (free)

nvidia

NVIDIA Nemotron Nano 2 VL is a 12-billion-parameter open multimodal reasoning model designed for video understanding and document intelligence. It introduces a hybrid Transformer-Mamba architecture, combining transformer-level accuracy with Mamba’s memory-efficient sequence modeling for significantly higher throughput and lower latency. The model supports inputs of text and multi-image documents, producing natural-language outputs. It is trained on high-quality NVIDIA-curated synthetic datasets

...more
AgentLLM Model
1 dir

NVIDIA: Nemotron Nano 12B 2 VL

nvidia

NVIDIA Nemotron Nano 2 VL is a 12-billion-parameter open multimodal reasoning model designed for video understanding and document intelligence. It introduces a hybrid Transformer-Mamba architecture, combining transformer-level accuracy with Mamba’s memory-efficient sequence modeling for significantly higher throughput and lower latency. The model supports inputs of text and multi-image documents, producing natural-language outputs. It is trained on high-quality NVIDIA-curated synthetic datasets

...more
AgentLLM Model
1 dir

NVIDIA: Llama 3.3 Nemotron Super 49B V1.5

nvidia

Llama-3.3-Nemotron-Super-49B-v1.5 is a 49B-parameter, English-centric reasoning/chat model derived from Meta’s Llama-3.3-70B-Instruct with a 128K context. It’s post-trained for agentic workflows (RAG, tool calling) via SFT across math, code, science, and multi-turn chat, followed by multiple RL stages; Reward-aware Preference Optimization (RPO) for alignment, RL with Verifiable Rewards (RLVR) for step-wise reasoning, and iterative DPO to refine tool-use behavior. A distillation-driven Neural Arc

...more
AgentLLM Model
1 dir

NVIDIA: Nemotron Nano 9B V2 (free)

nvidia

NVIDIA-Nemotron-Nano-9B-v2 is a large language model (LLM) trained from scratch by NVIDIA, and designed as a unified model for both reasoning and non-reasoning tasks. It responds to user queries and tasks by first generating a reasoning trace and then concluding with a final response. The model's reasoning capabilities can be controlled via a system prompt. If the user prefers the model to provide its final answer without intermediate reasoning traces, it can be configured to do so.

...more
AgentLLM Model
1 dir

NVIDIA: Nemotron Nano 9B V2

nvidia

NVIDIA-Nemotron-Nano-9B-v2 is a large language model (LLM) trained from scratch by NVIDIA, and designed as a unified model for both reasoning and non-reasoning tasks. It responds to user queries and tasks by first generating a reasoning trace and then concluding with a final response. The model's reasoning capabilities can be controlled via a system prompt. If the user prefers the model to provide its final answer without intermediate reasoning traces, it can be configured to do so.

...more
AgentLLM Model
1 dir

NVIDIA: Llama 3.1 Nemotron 70B Instruct

nvidia

NVIDIA's Llama 3.1 Nemotron 70B is a language model designed for generating precise and useful responses. Leveraging [Llama 3.1 70B](/models/meta-llama/llama-3.1-70b-instruct) architecture and Reinforcement Learning from Human Feedback (RLHF), it excels in automatic alignment benchmarks. This model is tailored for applications requiring high accuracy in helpfulness and response generation, suitable for diverse user queries across multiple domains. Usage of this model is subject to [Meta's Accep

...more
AgentLLM Model
1 dir

nvidia-nat-ragas

NVIDIA Corporation

Subpackage for RAGAS evaluators in NVIDIA NeMo Agent Toolkit

Agentai-agents
1 dir

nvidia-nat-rag

NVIDIA Corporation

Subpackage for NVIDIA RAG in NeMo Agent Toolkit

Agentai-agents
1 dir

nvidia-nat-fastmcp

NVIDIA Corporation

Subpackage for FastMCP server integration in NeMo Agent Toolkit

MCP Servermcp
1 dir

nvidia-nat-crewai

NVIDIA Corporation

Subpackage for CrewAI integration in NeMo Agent Toolkit

Agentai-agents
1 dir

nvidia-nat-langchain

NVIDIA Corporation

Subpackage for LangChain/LangGraph integration in NeMo Agent Toolkit

Agentai-agents
1 dir

nvidia-nat-mcp

NVIDIA Corporation

Subpackage for MCP client integration in NeMo Agent Toolkit

MCP Servermcp
1 dir

NVIDIA: Nemotron 3 Super

nvidia

NVIDIA Nemotron 3 Super is a 120B-parameter open hybrid MoE model, activating just 12B parameters for maximum compute efficiency and accuracy in complex multi-agent applications. Built on a hybrid Mamba-Transformer Mixture-of-Experts architecture with multi-token prediction (MTP), it delivers over 50% higher token generation compared to leading open models. The model features a 1M token context window for long-term agent coherence, cross-document reasoning, and multi-step task planning. Latent

...more
AgentLLM Model
1 dir

multi-storage-client

NVIDIA Multi-Storage Client Team

Unified high-performance Python client for object and file stores.

Skillai-ml
1 dir

NVIDIA: Llama 3.1 Nemotron Ultra 253B v1

nvidia

Llama-3.1-Nemotron-Ultra-253B-v1 is a large language model (LLM) optimized for advanced reasoning, human-interactive chat, retrieval-augmented generation (RAG), and tool-calling tasks. Derived from Meta’s Llama-3.1-405B-Instruct, it has been significantly customized using Neural Architecture Search (NAS), resulting in enhanced efficiency, reduced memory usage, and improved inference latency. The model supports a context length of up to 128K tokens and can operate efficiently on an 8x NVIDIA H100

...more
AgentLLM Model
1 dir

pylibwholegraph-cu13

NVIDIA Corporation

pylibwholegraph - GPU Graph Storage for GNN feature and graph structure

Skillai-ml
1 dir

pylibwholegraph-cu12

NVIDIA Corporation

pylibwholegraph - GPU Graph Storage for GNN feature and graph structure

Skillai-ml
1 dir

NVIDIA: Nemotron 3 Nano Omni (free)

nvidia

NVIDIA Nemotron™ 3 Nano Omni is a 30B-A3B open multimodal model designed to function as a perception and context sub-agent in enterprise agent systems. It accepts text, image, video, and...

...more
AgentLLM Model
1 dir

nvd-claude-nim

NVIDIA

Anthropic Messages → NVIDIA NIM Proxy for Claude Code

Skillai-ml
1 dir

aiperf-nightly

"NVIDIA Inc." <[email protected]>

AIPerf is a package for performance testing of AI models

Skilluncategorised
1 dir

nvidia-ml-py

NVIDIA Corporation

Python Bindings for the NVIDIA Management Library

Skilluncategorised
1 dir

nemo-evaluator

NVIDIA

NeMo Evaluator — benchmark environments, pluggable solvers, interceptor proxy, and decision-grade scoring for LLMs

Skillai-ml
1 dir

NVIDIA: Nemotron 3.5 Content Safety (free)

nvidia

NVIDIA Nemotron 3.5 Content Safety is a compact 4B-parameter multimodal guardrail model from NVIDIA, fine-tuned from Google Gemma-3-4B. It moderates both inputs to and responses from LLMs and VLMs, accepting...

...more
AgentLLM Model
1 dir

NVIDIA: Nemotron 3 Ultra (free)

nvidia

NVIDIA Nemotron 3 Ultra is an open frontier-reasoning and orchestration model from NVIDIA, with 55B active parameters out of 550B total (MoE). Built on a hybrid Transformer-Mamba mixture-of-experts architecture, it...

...more
AgentLLM Model
1 dir

NVIDIA: Nemotron 3 Ultra

nvidia

NVIDIA Nemotron 3 Ultra is an open frontier-reasoning and orchestration model from NVIDIA, with 55B active parameters out of 550B total (MoE). Built on a hybrid Transformer-Mamba mixture-of-experts architecture, it...

...more
AgentLLM Model
1 dir