ollama

mirror of https://github.com/ollama/ollama.git synced 2026-01-29 07:12:03 +03:00

Files

Jeffrey Morgan 03bf241c33 x/imagegen: add FP4 quantization support for image generation models (#13773 )

Add --quantize fp4 support to ollama create for image generation models
(flux2, z-image-turbo), using MLX's affine 4-bit quantization.

Changes:
- Add fp4 to validation in CreateImageGenModel
- Add FP4 case to quantizeTensor (group_size=32, bits=4, affine mode)
- Add GetQuantization() to WeightSource interface for dynamic params
- Update LoadLinearLayer to use quantization params from model metadata

2026-01-19 00:54:54 -08:00

agent

x/cmd: enable web search and web fetch with flag (#13690 )

2026-01-12 13:59:40 -08:00

cmd

cmd: enable multi-line input and shift enter (#13694 )

2026-01-14 17:52:46 -08:00

create

x/imagegen: add FP4 quantization support for image generation models (#13773 )

2026-01-19 00:54:54 -08:00

imagegen

x/imagegen: add FP4 quantization support for image generation models (#13773 )