mlx-vlm v0.6.3 发布，首发支持 DiffusionGemma 与 North Mini Code 1.0

OmniTools 6月11日消息，mlx-vlm v0.6.3 正式发布，首次实现对 Google DeepMind DiffusionGemma 和 Cohere North Mini Code 1.0 的原生支持。

DiffusionGemma 采用新型架构：以 256 token 块为单位并行生成，支持双向注意力与迭代自纠错；其 26B MoE 模型仅激活 3.8B 参数，量化后可在 18GB 显存下运行。North Mini Code 1.0 为 30B MoE 模型，仅激活 3B 参数，在 BF16 精度下推理速度约 66 tok/s。

两款模型均通过深度协作实现 Day-0 MLX 支持，可在 Apple Silicon Mac 设备本地运行。用户可通过命令 uv pip install -U mlx-vlm 安装最新版本。