No model card yet.
License
gemma
Tasks
image-text-to-text
Frameworks
transformers
Hardware-aware snippets
Runtime-specific quick starts for RedHatAI/gemma-3-12b-it-quantized.w4a16. Detected hardware: No explicit hardware metadata.
Task: image-text-to-textTransformers
NVIDIA CUDA path
Best default for NVIDIA GPUs and hosted accelerator nodes.
pip install torch transformers accelerate
python - <<'PY'
from transformers import pipeline
model_id = "RedHatAI/gemma-3-12b-it-quantized.w4a16"
pipe = pipeline(
task="image-text-to-text",
model=model_id,
device_map="auto",
model_kwargs={"torch_dtype": "auto"},
)
print(pipe("Hello from Inferix"))
PYModel lineage
1 reposBase model
google/gemma-3-12b-itFinetuned
1gemma-3-12b-it-quantized.w4a16selected
Quantizations
0No quantization variants detected.
Model info
Namespace
RedHatAI
Visibility
public
Downloads
0
Likes
0
License: gemma
Evaluation results
Entries
0
Tasks
0
Datasets
0
No evaluation results published yet.
Metadata links
Inference providers
0/0 fitNot available for inference yet.
Spaces using this model
No spaces linked yet.
Part of collections
No public collections include this model.
Tags
importedhuggingfaceminiotransformerssafetensorsgemma3image-text-to-textvllmvisionw4a16conversationalbase_model:google/gemma-3-12b-itbase_model:quantized:google/gemma-3-12b-itlicense:gemmatext-generation-inferenceendpoints_compatiblecompressed-tensorsregion:us