Modelnvidia/

Llama-3.3-70B-Instruct-FP8

importedhuggingfaceminiosafetensors

2 downloads0 likesUpdated 6/23/2026

No model card yet.

Hardware-aware snippets

Runtime-specific quick starts for nvidia/Llama-3.3-70B-Instruct-FP8. Detected hardware: No explicit hardware metadata.

Task: text-generationTransformers

NVIDIA CUDA path

Best default for NVIDIA GPUs and hosted accelerator nodes.

pip install torch transformers accelerate

python - <<'PY'
from transformers import pipeline

model_id = "nvidia/Llama-3.3-70B-Instruct-FP8"
pipe = pipeline(
    task="text-generation",
    model=model_id,
    device_map="auto",
    model_kwargs={"torch_dtype": "auto"},
)
print(pipe("Hello from Inferix"))
PY

Model lineage

1 repos

Base model

meta-llama/Llama-3.3-70B-Instruct

Finetuned

Llama-3.3-70B-Instruct-FP8selected

Quantizations

No quantization variants detected.

Model info

Namespace

nvidia

Visibility

public

Downloads

Likes

Evaluation results

Entries

Tasks

Datasets

No evaluation results published yet.

Metadata links

Base model

meta-llama/Llama-3.3-70B-Instruct

Trained on datasets

No linked datasets

Linked papers

No linked papers

Inference providers

0/0 fit

Not available for inference yet.

Spaces using this model

No spaces linked yet.

Part of collections

No public collections include this model.