ollama/convert at b8e8ef8929629ad91c774415b53cbec233fb54c8 - ollama - Gitea: Git with a cup of tea

gsrlabs/ollama

mirror of https://github.com/ollama/ollama.git synced 2026-01-29 07:12:03 +03:00

Files

History

Jeffrey Morgan 64737330a4 Re-apply "model: add MLA absorption for glm4moelite" with fix (#13870 )

The nvidia_fp32 config for (576, 512) head sizes had nbatch_fa=32,
which caused zero-sized arrays when computing array dimensions:
  nbatch_fa / (np * warp_size) = 32 / (2 * 32) = 0

This resulted in CUDA compilation failures on CUDA 12 (Windows and
Linux arm64):
- "static assertion failed with nbatch_fa % (np*warp_size) != 0"
- "the size of an array must be greater than zero"

Fix by changing nbatch_fa from 32 to 64 for all (576, 512) configs
in the nvidia_fp32 function, matching the nvidia_fp16 and AMD configs.

2026-01-23 18:40:28 -08:00

..

chore(all): replace instances of interface with any (#10067 )

2025-04-02 09:44:27 -07:00

convert: import support for command-r models from safetensors (#6063 )

2025-01-15 16:31:22 -08:00

convert_bert.go

Add experimental MLX backend and engine with imagegen support (#13648 )

2026-01-08 16:18:59 -08:00

convert_commandr.go

Add experimental MLX backend and engine with imagegen support (#13648 )

2026-01-08 16:18:59 -08:00

convert_deepseek2.go

Add experimental MLX backend and engine with imagegen support (#13648 )

2026-01-08 16:18:59 -08:00

convert_deepseekocr.go

Add experimental MLX backend and engine with imagegen support (#13648 )

2026-01-08 16:18:59 -08:00

convert_gemma2_adapter.go

Add experimental MLX backend and engine with imagegen support (#13648 )

2026-01-08 16:18:59 -08:00

convert_gemma2.go

Add experimental MLX backend and engine with imagegen support (#13648 )

2026-01-08 16:18:59 -08:00

convert_gemma3.go

Add experimental MLX backend and engine with imagegen support (#13648 )

2026-01-08 16:18:59 -08:00

convert_gemma3n.go

Add experimental MLX backend and engine with imagegen support (#13648 )

2026-01-08 16:18:59 -08:00

convert_gemma.go

Add experimental MLX backend and engine with imagegen support (#13648 )

2026-01-08 16:18:59 -08:00

convert_glm4moelite.go

Re-apply "model: add MLA absorption for glm4moelite" with fix (#13870 )

2026-01-23 18:40:28 -08:00

convert_gptoss.go

Add experimental MLX backend and engine with imagegen support (#13648 )

2026-01-08 16:18:59 -08:00

convert_lfm2.go

model: add lfm2 architecture and LFM2.5-1.2B-Thinking support (#13792 )

2026-01-20 12:20:53 -08:00

convert_llama4.go

Add experimental MLX backend and engine with imagegen support (#13648 )

2026-01-08 16:18:59 -08:00

convert_llama_adapter.go

Add experimental MLX backend and engine with imagegen support (#13648 )

2026-01-08 16:18:59 -08:00

convert_llama.go

Add experimental MLX backend and engine with imagegen support (#13648 )

2026-01-08 16:18:59 -08:00

convert_mistral_causal.go

Add experimental MLX backend and engine with imagegen support (#13648 )

2026-01-08 16:18:59 -08:00

convert_mistral.go

Add experimental MLX backend and engine with imagegen support (#13648 )

2026-01-08 16:18:59 -08:00

convert_mixtral.go

Add experimental MLX backend and engine with imagegen support (#13648 )

2026-01-08 16:18:59 -08:00

convert_mllama.go

Add experimental MLX backend and engine with imagegen support (#13648 )

2026-01-08 16:18:59 -08:00

convert_nomicbert.go

Add experimental MLX backend and engine with imagegen support (#13648 )

2026-01-08 16:18:59 -08:00

convert_olmo.go

Add experimental MLX backend and engine with imagegen support (#13648 )

2026-01-08 16:18:59 -08:00

convert_phi3.go

Add experimental MLX backend and engine with imagegen support (#13648 )

2026-01-08 16:18:59 -08:00

convert_qwen2.go

Add experimental MLX backend and engine with imagegen support (#13648 )

2026-01-08 16:18:59 -08:00

convert_qwen3.go

Add experimental MLX backend and engine with imagegen support (#13648 )

2026-01-08 16:18:59 -08:00

convert_qwen3vl.go

Add experimental MLX backend and engine with imagegen support (#13648 )

2026-01-08 16:18:59 -08:00

convert_qwen25vl.go

Add experimental MLX backend and engine with imagegen support (#13648 )

2026-01-08 16:18:59 -08:00

convert_test.go

Add experimental MLX backend and engine with imagegen support (#13648 )

2026-01-08 16:18:59 -08:00

convert.go

model: add lfm2 architecture and LFM2.5-1.2B-Thinking support (#13792 )

2026-01-20 12:20:53 -08:00

reader_safetensors.go

deepseekocr

2025-11-18 16:11:37 -08:00

reader_test.go

convert: convert bf16 vision weights to fp16 (#12324 )

2025-09-17 17:43:17 -07:00

reader_torch.go

llama4

2025-04-25 16:59:20 -07:00

reader.go

model: add lfm2 architecture and LFM2.5-1.2B-Thinking support (#13792 )

2026-01-20 12:20:53 -08:00

sentencepiece_model.proto

all: fix typos in documentation, code, and comments (#7021 )

2024-12-10 12:58:06 -08:00

tensor_test.go

fix tensor merge (#13053 )

2025-11-13 15:32:34 -08:00

tensor.go

fix tensor merge (#13053 )

2025-11-13 15:32:34 -08:00

tokenizer_spm.go

parsers/renderers: functiongemma (#13521 )

2025-12-18 07:55:37 -08:00

tokenizer_test.go

model: handle multiple eos tokens (#10577 )

2025-05-16 13:40:23 -07:00

tokenizer.go

s#x/exp/maps#maps# (#11506 )

2025-07-23 13:23:32 -07:00