No model card yet.
License
mit
Tasks
text-generation
Frameworks
mlx
Hardware-aware snippets
Runtime-specific quick starts for mlx-community/GLM-4.7-Flash-8bit. Detected hardware: No explicit hardware metadata.
Task: text-generationTransformers
NVIDIA CUDA path
Best default for NVIDIA GPUs and hosted accelerator nodes.
pip install torch transformers accelerate
python - <<'PY'
from transformers import pipeline
model_id = "mlx-community/GLM-4.7-Flash-8bit"
pipe = pipeline(
task="text-generation",
model=model_id,
device_map="auto",
model_kwargs={"torch_dtype": "auto"},
)
print(pipe("Hello from Inferix"))
PYModel lineage
1 reposBase model
zai-org/GLM-4.7-FlashFinetuned
1GLM-4.7-Flash-8bitselected
Quantizations
0No quantization variants detected.
Model info
Namespace
mlx-community
Visibility
public
Downloads
0
Likes
0
License: mit
Evaluation results
Entries
0
Tasks
0
Datasets
0
No evaluation results published yet.
Metadata links
Inference providers
0/0 fitNot available for inference yet.
Spaces using this model
No spaces linked yet.
Part of collections
No public collections include this model.
Tags
importedhuggingfaceminiomlxsafetensorsglm4_moe_litetext-generationconversationalenzhbase_model:zai-org/GLM-4.7-Flashbase_model:quantized:zai-org/GLM-4.7-Flashlicense:mit8-bitregion:us