Red Hat AI

company

Verified

https://www.redhat.com/en/products/ai

AI & ML interests

OpenSource and AI

Recent Activity

krishnateja95 published a dataset about 5 hours ago

RedHatAI/NVIDIA-Nemotron-3-Super-120B-A12B-BF16-responses

robgreenberg3 updated a collection about 5 hours ago

Intel Xeon-compatible Models

robgreenberg3 updated a model about 5 hours ago

RedHatAI/Qwen3-8B-W8A8-INT8

View all activity

Papers

SNLP: Layer-Parallel Inference via Structured Newton Corrections

S2D2: Fast Decoding for Diffusion LLMs via Training-Free Self-Speculation

View all Papers

RedHatAI 's models 666

RedHatAI/Phi-3-medium-128k-instruct-quantized.w8a8

Text Generation • 14B • Updated Oct 9, 2024 • 21 • 2

RedHatAI/Phi-3-medium-128k-instruct-quantized.w8a16

Text Generation • 14B • Updated Oct 9, 2024 • 13 • 2

RedHatAI/Phi-3-medium-128k-instruct-FP8

Text Generation • 14B • Updated Oct 9, 2024 • 204 • 5

RedHatAI/Qwen2.5-32B-Instruct-quantized.w8a16

33B • Updated Oct 9, 2024 • 6

RedHatAI/Qwen2.5-7B-Instruct-quantized.w8a16

8B • Updated Oct 9, 2024 • 417

RedHatAI/Qwen2.5-0.5B-Instruct-quantized.w8a16

0.6B • Updated Oct 9, 2024 • 15

RedHatAI/Qwen2.5-72B-Instruct-quantized.w8a8

73B • Updated Oct 9, 2024 • 263

RedHatAI/Qwen2.5-32B-Instruct-quantized.w8a8

33B • Updated Oct 9, 2024 • 36

RedHatAI/Qwen2.5-32B-quantized.w8a8

33B • Updated Oct 9, 2024 • 7

RedHatAI/Meta-Llama-3.1-405B-Instruct-FP8

Text Generation • 406B • Updated Oct 9, 2024 • 337 • 31

RedHatAI/Qwen2.5-3B-Instruct-quantized.w8a8

3B • Updated Oct 9, 2024 • 31

RedHatAI/Qwen2.5-1.5B-Instruct-quantized.w8a8

2B • Updated Oct 9, 2024 • 11

RedHatAI/SparseLlama-3-8B-pruned_50.2of4

Text Generation • 8B • Updated Oct 4, 2024 • 30 •

RedHatAI/Llama-3.2-90B-Vision-Instruct-FP8-dynamic

Text Generation • 89B • Updated Oct 2, 2024 • 1.49k • 11

RedHatAI/Phi-3.5-mini-instruct-FP8-KV

Text Generation • 4B • Updated Oct 1, 2024 • 112 • 2

RedHatAI/Meta-Llama-3-70B-Instruct-quantized.w4a16

Text Generation • 71B • Updated Aug 29, 2024 • 24 • 2

RedHatAI/SmolLM-135M-q

Updated Aug 23, 2024

RedHatAI/Mixtral-8x22B-Instruct-v0.1-AutoFP8

Text Generation • 141B • Updated Aug 12, 2024 • 36 • 3

RedHatAI/DeepSeek-Coder-V2-Base-FP8

Text Generation • 236B • Updated Jul 22, 2024 • 58

RedHatAI/DeepSeek-Coder-V2-Instruct-FP8

Text Generation • 236B • Updated Jul 22, 2024 • 592 • 7

RedHatAI/Mistral-Nemo-Instruct-2407-FP8

Text Generation • 12B • Updated Jul 19, 2024 • 19k • • 18

RedHatAI/Qwen2-57B-A14B-Instruct-FP8

Text Generation • 57B • Updated Jul 18, 2024 • 889 • 1

RedHatAI/Llama-2-7b-chat-hf-FP8

Text Generation • 7B • Updated Jul 18, 2024 • 104

RedHatAI/Mistral-7B-Instruct-v0.3-FP8

Text Generation • 7B • Updated Jul 18, 2024 • 1.4k • 3

RedHatAI/Qwen2-0.5B-Instruct-FP8

Text Generation • 0.5B • Updated Jul 18, 2024 • 830 • • 4

RedHatAI/Qwen2-1.5B-Instruct-FP8

Text Generation • 2B • Updated Jul 18, 2024 • 57k •

RedHatAI/Qwen2-7B-Instruct-FP8

Text Generation • 8B • Updated Jul 18, 2024 • 2.6k • • 2

RedHatAI/Qwen2-72B-Instruct-FP8

Text Generation • 73B • Updated Jul 18, 2024 • 1.19k • • 15

RedHatAI/Mixtral-8x7B-Instruct-v0.1-AutoFP8

Text Generation • 47B • Updated Jul 18, 2024 • 105 • 3

RedHatAI/Meta-Llama-3-70B-Instruct-FP8

Text Generation • 71B • Updated Jul 18, 2024 • 1.53k • • 13