Skip to main content
Inferix
Modelmlx-community/

GLM-4.7-Flash-8bit

text-generationmlximportedhuggingfaceminiomlxLicense: mit
0 downloads0 likesUpdated 6/23/2026

No model card yet.

License

mit

Tasks

text-generation

Frameworks

mlx

Hardware-aware snippets

Runtime-specific quick starts for mlx-community/GLM-4.7-Flash-8bit. Detected hardware: No explicit hardware metadata.

Task: text-generationTransformers

NVIDIA CUDA path

Best default for NVIDIA GPUs and hosted accelerator nodes.

pip install torch transformers accelerate

python - <<'PY'
from transformers import pipeline

model_id = "mlx-community/GLM-4.7-Flash-8bit"
pipe = pipeline(
    task="text-generation",
    model=model_id,
    device_map="auto",
    model_kwargs={"torch_dtype": "auto"},
)
print(pipe("Hello from Inferix"))
PY

Model lineage

1 repos

Finetuned

1
GLM-4.7-Flash-8bitselected

Quantizations

0

No quantization variants detected.

Model info

Namespace

mlx-community

Visibility

public

Downloads

0

Likes

0

License: mit

Evaluation results

Entries

0

Tasks

0

Datasets

0

No evaluation results published yet.

Metadata links

Trained on datasets

No linked datasets

Linked papers

No linked papers

Inference providers

0/0 fit

Not available for inference yet.

Spaces using this model

No spaces linked yet.

Part of collections

No public collections include this model.

Tags

importedhuggingfaceminiomlxsafetensorsglm4_moe_litetext-generationconversationalenzhbase_model:zai-org/GLM-4.7-Flashbase_model:quantized:zai-org/GLM-4.7-Flashlicense:mit8-bitregion:us