mirror of
https://github.com/ollama/ollama.git
synced 2026-01-29 23:32:02 +03:00
Compare commits
base: gsrlabs:jmorganca/cloud-errors
gsrlabs:main
gsrlabs:parth-launch-extra-params
gsrlabs:mxyng/tokenizers
gsrlabs:jessegross/context
gsrlabs:brucemacd/usage-api
gsrlabs:pdevine/glm-mlx
gsrlabs:brucemacd/usage-cli
gsrlabs:llama-update
gsrlabs:fix-cuda12-fattn-shmem
gsrlabs:ollama-imagegen-docs
gsrlabs:hoyyeva/upgrade-config
gsrlabs:parth/fix-multiline-inputs
gsrlabs:brucemacd/integration-doc-types
gsrlabs:brucemacd/config-docs
gsrlabs:mxyng/model-files
gsrlabs:mxyng/simple-execute
gsrlabs:fix-imagegen-ollama-models
gsrlabs:mxyng/async-upload
gsrlabs:jmorganca/lazy-no-dtype-changes
gsrlabs:imagegen-auto-detect-create
gsrlabs:parth/decrease-concurrent-download-hf
gsrlabs:fix-mlx-quantize-init
gsrlabs:jmorganca/x-cleanup
gsrlabs:usage
gsrlabs:imagegen-readme
gsrlabs:jmorganca/glm-image
gsrlabs:mlx-gpu-cd
gsrlabs:jmorganca/imagegen-modelfile
gsrlabs:parth/agent-skills
gsrlabs:parth/agent-allowlist
gsrlabs:parth/signed-in-offline
gsrlabs:parth/agents
gsrlabs:parth/fix-context-chopping
gsrlabs:improve-cloud-flow
gsrlabs:parth/add-models-websearch
gsrlabs:parth/prompt-renderer-mcp
gsrlabs:jmorganca/native-settings
gsrlabs:jmorganca/download-stream-hash
gsrlabs:jmorganca/client2-rebased
gsrlabs:brucemacd/oai-chat-req-multipart
gsrlabs:jessegross/multi_chunk_reserve
gsrlabs:grace/additional-omit-empty
gsrlabs:grace/mistral-3-large
gsrlabs:mxyng/tokenizer2
gsrlabs:mxyng/tokenizer
gsrlabs:jessegross/flash
gsrlabs:hoyyeva/windows-nacked-app
gsrlabs:mxyng/cleanup-attention
gsrlabs:grace/deepseek-parser
gsrlabs:hoyyeva/remember-unsent-prompt
gsrlabs:parth/add-lfs-pointer-error-conversion
gsrlabs:parth/olmo2-test2
gsrlabs:hoyyeva/ollama-launchagent-plist
gsrlabs:nicole/olmo-model
gsrlabs:parth/olmo-test
gsrlabs:mxyng/remove-embedded
gsrlabs:parth/render-template
gsrlabs:jmorganca/intellect-3
gsrlabs:parth/remove-prealloc-linter
gsrlabs:jmorganca/cmd-eval
gsrlabs:nicole/nomic-embed-text-fix
gsrlabs:mxyng/lint-2
gsrlabs:hoyyeva/add-gemini-3-pro-preview
gsrlabs:hoyyeva/load-model-list
gsrlabs:mxyng/expand-path
gsrlabs:mxyng/environ-2
gsrlabs:hoyyeva/deeplink-json-encoding
gsrlabs:parth/improve-tool-calling-tests
gsrlabs:hoyyeva/conversation
gsrlabs:hoyyeva/assistant-edit-response
gsrlabs:hoyyeva/thinking
gsrlabs:origin/brucemacd/invalid-char-i-err
gsrlabs:parth/improve-tool-calling
gsrlabs:jmorganca/required-omitempty
gsrlabs:grace/qwen3-vl-tests
gsrlabs:mxyng/iter-client
gsrlabs:parth/docs-readme
gsrlabs:nicole/embed-test
gsrlabs:pdevine/integration-benchstat
gsrlabs:parth/remove-generate-cmd
gsrlabs:parth/add-toolcall-id
gsrlabs:mxyng/server-tests
gsrlabs:jmorganca/glm-4.6
gsrlabs:jmorganca/gin-h-compat
gsrlabs:drifkin/stable-tool-args
gsrlabs:pdevine/qwen3-more-thinking
gsrlabs:parth/add-websearch-client
gsrlabs:nicole/websearch_local
gsrlabs:jmorganca/qwen3-coder-updates
gsrlabs:grace/deepseek-v3-migration-tests
gsrlabs:mxyng/fix-create
gsrlabs:jmorganca/cloud-errors
gsrlabs:pdevine/parser-tidy
gsrlabs:revert-12233-parth/simplify-entrypoints-runner
gsrlabs:parth/enable-so-gpt-oss
gsrlabs:brucemacd/qwen3vl
gsrlabs:jmorganca/readme-simplify
gsrlabs:parth/gpt-oss-structured-outputs
gsrlabs:revert-12039-jmorganca/tools-braces
gsrlabs:mxyng/embeddings
gsrlabs:mxyng/gguf
gsrlabs:mxyng/benchmark
gsrlabs:mxyng/types-null
gsrlabs:parth/move-parsing
gsrlabs:mxyng/gemma2
gsrlabs:jmorganca/docs
gsrlabs:mxyng/16-bit
gsrlabs:mxyng/create-stdin
gsrlabs:pdevine/authorizedkeys
gsrlabs:mxyng/quant
gsrlabs:parth/opt-in-error-context-window
gsrlabs:brucemacd/cache-models
gsrlabs:brucemacd/runner-completion
gsrlabs:jmorganca/llama-update-6
gsrlabs:brucemacd/benchmark-list
gsrlabs:brucemacd/partial-read-caps
gsrlabs:parth/deepseek-r1-tools
gsrlabs:mxyng/omit-array
gsrlabs:parth/tool-prefix-temp
gsrlabs:brucemacd/runner-test
gsrlabs:jmorganca/qwen25vl
gsrlabs:brucemacd/model-forward-test-ext
gsrlabs:parth/python-function-parsing
gsrlabs:jmorganca/cuda-compression-none
gsrlabs:drifkin/num-parallel
gsrlabs:drifkin/chat-truncation-fix
gsrlabs:jmorganca/sync
gsrlabs:parth/python-tools-calling
gsrlabs:drifkin/array-head-count
gsrlabs:brucemacd/create-no-loop
gsrlabs:parth/server-enable-content-stream-with-tools
gsrlabs:qwen25omni
gsrlabs:mxyng/v3
gsrlabs:brucemacd/ropeconfig
gsrlabs:jmorganca/silence-tokenizer
gsrlabs:parth/sample-so-test
gsrlabs:parth/sampling-structured-outputs
gsrlabs:brucemacd/doc-go-engine
gsrlabs:parth/constrained-sampling-json
gsrlabs:jmorganca/mistral-wip
gsrlabs:brucemacd/mistral-small-convert
gsrlabs:parth/sample-unmarshal-json-for-params
gsrlabs:brucemacd/jomorganca/mistral
gsrlabs:pdevine/bfloat16
gsrlabs:jmorganca/mistral
gsrlabs:brucemacd/mistral
gsrlabs:pdevine/logging
gsrlabs:parth/sample-correctness-fix
gsrlabs:parth/sample-fix-sorting
gsrlabs:jmorgan/sample-fix-sorting-extras
gsrlabs:jmorganca/temp-0-images
gsrlabs:brucemacd/parallel-embed-models
gsrlabs:brucemacd/shim-grammar
gsrlabs:jmorganca/fix-gguf-error
gsrlabs:bmizerany/nameswork
gsrlabs:jmorganca/faster-releases
gsrlabs:bmizerany/validatenames
gsrlabs:brucemacd/err-no-vocab
gsrlabs:brucemacd/rope-config
gsrlabs:brucemacd/err-hint
gsrlabs:brucemacd/qwen2_5
gsrlabs:brucemacd/logprobs
gsrlabs:brucemacd/new_runner_graph_bench
gsrlabs:progress-flicker
gsrlabs:brucemacd/forward-test
gsrlabs:brucemacd/go_qwen2
gsrlabs:pdevine/gemma2
gsrlabs:jmorganca/add-missing-symlink-eval
gsrlabs:mxyng/next-debug
gsrlabs:parth/set-context-size-openai
gsrlabs:brucemacd/next-bpe-bench
gsrlabs:brucemacd/next-bpe-test
gsrlabs:brucemacd/new_runner_e2e
gsrlabs:brucemacd/new_runner_qwen2
gsrlabs:pdevine/convert-cohere2
gsrlabs:brucemacd/convert-cli
gsrlabs:parth/log-probs
gsrlabs:mxyng/next-mlx
gsrlabs:mxyng/cmd-history
gsrlabs:parth/templating
gsrlabs:parth/tokenize-detokenize
gsrlabs:brucemacd/check-key-register
gsrlabs:bmizerany/grammar
gsrlabs:jmorganca/vendor-081b29bd
gsrlabs:mxyng/func-checks
gsrlabs:jmorganca/fix-null-format
gsrlabs:parth/fix-default-to-warn-json
gsrlabs:jmorganca/qwen2vl
gsrlabs:jmorganca/no-concat
gsrlabs:parth/cmd-cleanup-SO
gsrlabs:brucemacd/check-key-register-structured-err
gsrlabs:parth/openai-stream-usage
gsrlabs:parth/fix-referencing-so
gsrlabs:stream-tools-stop
gsrlabs:jmorganca/degin-1
gsrlabs:brucemacd/install-path-clean
gsrlabs:brucemacd/push-name-validation
gsrlabs:brucemacd/browser-key-register
gsrlabs:jmorganca/openai-fix-first-message
gsrlabs:jmorganca/fix-proxy
gsrlabs:jessegross/sample
gsrlabs:parth/disallow-streaming-tools
gsrlabs:dhiltgen/remove_submodule
gsrlabs:jmorganca/ga
gsrlabs:jmorganca/mllama
gsrlabs:pdevine/newlines
gsrlabs:pdevine/geems-2b
gsrlabs:jmorganca/llama-bump
gsrlabs:mxyng/modelname-7
gsrlabs:mxyng/gin-slog
gsrlabs:mxyng/modelname-6
gsrlabs:jyan/convert-prog
gsrlabs:jyan/quant5
gsrlabs:paligemma-support
gsrlabs:pdevine/import-docs
gsrlabs:jmorganca/openai-context
gsrlabs:jyan/paligemma
gsrlabs:jyan/p2
gsrlabs:jyan/palitest
gsrlabs:bmizerany/embedspeedup
gsrlabs:jmorganca/llama-vit
gsrlabs:brucemacd/allow-ollama
gsrlabs:royh/ep-methods
gsrlabs:royh/whisper
gsrlabs:mxyng/api-models
gsrlabs:mxyng/fix-memory
gsrlabs:jyan/q4_4/8
gsrlabs:jyan/ollama-v
gsrlabs:royh/stream-tools
gsrlabs:roy-embed-parallel
gsrlabs:bmizerany/hrm
gsrlabs:revert-5963-revert-5924-mxyng/llama3.1-rope
gsrlabs:royh/embed-viz
gsrlabs:jyan/local2
gsrlabs:jyan/auth
gsrlabs:jyan/local
gsrlabs:jyan/parse-temp
gsrlabs:jmorganca/template-mistral
gsrlabs:jyan/reord-g
gsrlabs:royh-openai-suffixdocs
gsrlabs:royh-imgembed
gsrlabs:royh-embed-parallel
gsrlabs:jyan/quant4
gsrlabs:royh-precision
gsrlabs:jyan/progress
gsrlabs:pdevine/fix-template
gsrlabs:jyan/quant3
gsrlabs:pdevine/ggla
gsrlabs:mxyng/update-registry-domain
gsrlabs:jmorganca/ggml-static
gsrlabs:mxyng/create-context
gsrlabs:jyan/v0.146
gsrlabs:mxyng/layers-from-files
gsrlabs:build_dist
gsrlabs:bmizerany/noseek
gsrlabs:royh-ls
gsrlabs:royh-name
gsrlabs:timeout
gsrlabs:mxyng/server-timestamp
gsrlabs:bmizerany/nosillyggufslurps
gsrlabs:royh-params
gsrlabs:jmorganca/llama-cpp-7c26775
gsrlabs:royh-openai-delete
gsrlabs:royh-show-rigid
gsrlabs:jmorganca/enable-fa
gsrlabs:jmorganca/no-error-template
gsrlabs:jyan/format
gsrlabs:royh-testdelete
gsrlabs:bmizerany/fastverify
gsrlabs:language_support
gsrlabs:pdevine/ps-glitches
gsrlabs:brucemacd/tokenize
gsrlabs:bruce/iq-quants
gsrlabs:bmizerany/filepathwithcoloninhost
gsrlabs:mxyng/split-bin
gsrlabs:bmizerany/client-registry
gsrlabs:jmorganca/if-none-match
gsrlabs:native
gsrlabs:jmorganca/native
gsrlabs:jmorganca/batch-embeddings
gsrlabs:jmorganca/initcmake
gsrlabs:jmorganca/mm
gsrlabs:pdevine/showggmlinfo
gsrlabs:modenameenforcealphanum
gsrlabs:bmizerany/modenameenforcealphanum
gsrlabs:jmorganca/done-reason
gsrlabs:jmorganca/llama-cpp-8960fe8
gsrlabs:ollama.com
gsrlabs:bmizerany/filepathnobuild
gsrlabs:bmizerany/types/model/defaultfix
gsrlabs:rmdisplaylong
gsrlabs:nogogen
gsrlabs:bmizerany/x
gsrlabs:modelfile-readme
gsrlabs:bmizerany/replacecolon
gsrlabs:jmorganca/limit
gsrlabs:jmorganca/execstack
gsrlabs:jmorganca/replace-assets
gsrlabs:mxyng/tune-concurrency
gsrlabs:jmorganca/testing
gsrlabs:whitespace-detection
gsrlabs:jmorganca/options
gsrlabs:upgrade-all
gsrlabs:scratch
gsrlabs:cuda-search
gsrlabs:mattw/airenamer
gsrlabs:mattw/allmodelsonhuggingface
gsrlabs:mattw/quantcontext
gsrlabs:mattw/whatneedstorun
gsrlabs:brucemacd/llama-mem-calc
gsrlabs:mattw/faq-context
gsrlabs:mattw/communitylinks
gsrlabs:mattw/noprune
gsrlabs:mattw/python-functioncalling
gsrlabs:rename
gsrlabs:mxyng/install
gsrlabs:pulse
gsrlabs:remove-first
gsrlabs:editor
gsrlabs:mattw/selfqueryingretrieval
gsrlabs:cgo
gsrlabs:mattw/howtoquant
gsrlabs:api
gsrlabs:matt/streamingapi
gsrlabs:format-config
gsrlabs:mxyng/extra-args
gsrlabs:shell
gsrlabs:update-nous-hermes
gsrlabs:cp-model
gsrlabs:upload-progress
gsrlabs:fix-unknown-model
gsrlabs:fix-model-names
gsrlabs:delete-fix
gsrlabs:insecure-registry
gsrlabs:ls
gsrlabs:deletemodels
gsrlabs:progressbar
gsrlabs:readme-updates
gsrlabs:license-layers
gsrlabs:skip-list
gsrlabs:list-models
gsrlabs:modelpath
gsrlabs:matt/examplemodelfiles
gsrlabs:distribution
gsrlabs:go-opts
gsrlabs:v0.15.2
gsrlabs:v0.15.1
gsrlabs:v0.15.1-rc1
gsrlabs:v0.15.1-rc0
gsrlabs:v0.15.0-rc6
gsrlabs:v0.15.0
gsrlabs:v0.15.0-rc5
gsrlabs:v0.15.0-rc4
gsrlabs:v0.15.0-rc3
gsrlabs:v0.15.0-rc2
gsrlabs:v0.15.0-rc1
gsrlabs:v0.15.0-rc0
gsrlabs:v0.14.3
gsrlabs:v0.14.3-rc3
gsrlabs:v0.14.3-rc2
gsrlabs:v0.14.3-rc1
gsrlabs:v0.14.3-rc0
gsrlabs:v0.14.2
gsrlabs:v0.14.2-rc1
gsrlabs:v0.14.2-rc0
gsrlabs:v0.14.1
gsrlabs:v0.14.0-rc11
gsrlabs:v0.14.0
gsrlabs:v0.14.0-rc10
gsrlabs:v0.14.0-rc9
gsrlabs:v0.14.0-rc8
gsrlabs:v0.14.0-rc7
gsrlabs:v0.14.0-rc6
gsrlabs:v0.14.0-rc5
gsrlabs:v0.14.0-rc4
gsrlabs:v0.14.0-rc3
gsrlabs:v0.14.0-rc2
gsrlabs:v0.14.0-rc1
gsrlabs:v0.14.0-rc0
gsrlabs:v0.13.5
gsrlabs:v0.13.5-rc1
gsrlabs:v0.13.5-rc0
gsrlabs:v0.13.4-rc2
gsrlabs:v0.13.4
gsrlabs:v0.13.4-rc1
gsrlabs:v0.13.4-rc0
gsrlabs:v0.13.3
gsrlabs:v0.13.3-rc1
gsrlabs:v0.13.3-rc0
gsrlabs:v0.13.2
gsrlabs:v0.13.2-rc2
gsrlabs:v0.13.2-rc1
gsrlabs:v0.13.2-rc0
gsrlabs:v0.13.1
gsrlabs:v0.13.1-rc2
gsrlabs:v0.13.1-rc1
gsrlabs:v0.13.1-rc0
gsrlabs:v0.13.0
gsrlabs:v0.13.0-rc0
gsrlabs:v0.12.11
gsrlabs:v0.12.11-rc1
gsrlabs:v0.12.11-rc0
gsrlabs:v0.12.10-rc1
gsrlabs:v0.12.10
gsrlabs:v0.12.10-rc0
gsrlabs:v0.12.9-rc0
gsrlabs:v0.12.9
gsrlabs:v0.12.8
gsrlabs:v0.12.8-rc0
gsrlabs:v0.12.7
gsrlabs:v0.12.7-rc1
gsrlabs:v0.12.7-rc0
gsrlabs:v0.12.6
gsrlabs:v0.12.6-rc1
gsrlabs:v0.12.6-rc0
gsrlabs:v0.12.5-rc0
gsrlabs:v0.12.5
gsrlabs:v0.12.4
gsrlabs:v0.12.4-rc7
gsrlabs:v0.12.4-rc6
gsrlabs:v0.12.4-rc5
gsrlabs:v0.12.4-rc4
gsrlabs:v0.12.4-rc3
gsrlabs:v0.12.4-rc2
gsrlabs:v0.12.4-rc1
gsrlabs:v0.12.4-rc0
gsrlabs:v0.12.3
gsrlabs:v0.12.2
gsrlabs:v0.12.2-rc0
gsrlabs:v0.12.1
gsrlabs:v0.12.1-rc2
gsrlabs:v0.12.1-rc1
gsrlabs:v0.12.1-rc0
gsrlabs:v0.12.0
gsrlabs:v0.12.0-rc1
gsrlabs:v0.12.0-rc0
gsrlabs:v0.11.11
gsrlabs:v0.11.11-rc2
gsrlabs:v0.11.11-rc3
gsrlabs:v0.11.11-rc1
gsrlabs:v0.11.11-rc0
gsrlabs:v0.11.10
gsrlabs:v0.11.9
gsrlabs:v0.11.9-rc0
gsrlabs:v0.11.8
gsrlabs:v0.11.8-rc0
gsrlabs:v0.11.7
gsrlabs:v0.11.7-rc0
gsrlabs:v0.11.7-rc1
gsrlabs:v0.11.6
gsrlabs:v0.11.6-rc0
gsrlabs:v0.11.5-rc4
gsrlabs:v0.11.5-rc5
gsrlabs:v0.11.5
gsrlabs:v0.11.5-rc3
gsrlabs:v0.11.5-rc2
gsrlabs:v0.11.5-rc1
gsrlabs:v0.11.5-rc0
gsrlabs:v0.11.4
gsrlabs:v0.11.4-rc0
gsrlabs:v0.11.3
gsrlabs:v0.11.3-rc0
gsrlabs:v0.11.2
gsrlabs:v0.11.1
gsrlabs:v0.11.0
gsrlabs:v0.10.1
gsrlabs:v0.10.0
gsrlabs:v0.10.0-rc4
gsrlabs:v0.10.0-rc3
gsrlabs:v0.10.0-rc2
gsrlabs:v0.10.0-rc1
gsrlabs:v0.10.0-rc0
gsrlabs:v0.9.7-rc1
gsrlabs:v0.9.7-rc0
gsrlabs:v0.9.6
gsrlabs:v0.9.6-rc0
gsrlabs:v0.9.5
gsrlabs:v0.9.4-rc6
gsrlabs:v0.9.4-rc3
gsrlabs:v0.9.4
gsrlabs:v0.9.4-rc5
gsrlabs:v0.9.4-rc4
gsrlabs:v0.9.4-rc2
gsrlabs:v0.9.4-rc1
gsrlabs:v0.9.4-rc0
gsrlabs:v0.9.3
gsrlabs:v0.9.3-rc5
gsrlabs:v0.9.4-citest0
gsrlabs:v0.9.3-rc4
gsrlabs:v0.9.3-rc3
gsrlabs:v0.9.3-rc2
gsrlabs:v0.9.3-rc1
gsrlabs:v0.9.3-rc0
gsrlabs:v0.9.2
gsrlabs:v0.9.1
gsrlabs:v0.9.1-rc1
gsrlabs:v0.9.1-rc0
gsrlabs:v0.9.0
gsrlabs:v0.9.0-rc0
gsrlabs:v0.8.0
gsrlabs:v0.8.0-rc0
gsrlabs:v0.7.1-rc2
gsrlabs:v0.7.1
gsrlabs:v0.7.1-rc1
gsrlabs:v0.7.1-rc0
gsrlabs:v0.7.0
gsrlabs:v0.7.0-rc1
gsrlabs:v0.7.0-rc0
gsrlabs:v0.6.8-rc0
gsrlabs:v0.6.8
gsrlabs:v0.6.7
gsrlabs:v0.6.7-rc2
gsrlabs:v0.6.7-rc1
gsrlabs:v0.6.7-rc0
gsrlabs:v0.6.6
gsrlabs:v0.6.6-rc2
gsrlabs:v0.6.6-rc1
gsrlabs:v0.6.6-rc0
gsrlabs:v0.6.5-rc1
gsrlabs:v0.6.5
gsrlabs:v0.6.5-rc0
gsrlabs:v0.6.4-rc0
gsrlabs:v0.6.4
gsrlabs:v0.6.3
gsrlabs:v0.6.3-rc1
gsrlabs:v0.6.3-rc0
gsrlabs:v0.6.2-rc0
gsrlabs:v0.6.2
gsrlabs:v0.6.1
gsrlabs:v0.6.1-rc0
gsrlabs:v0.6.0-rc0
gsrlabs:v0.6.0
gsrlabs:v0.5.13
gsrlabs:v0.5.13-rc6
gsrlabs:v0.5.13-rc5
gsrlabs:v0.5.13-rc4
gsrlabs:v0.5.13-rc3
gsrlabs:v0.5.13-rc2
gsrlabs:v0.5.13-rc1
gsrlabs:v0.5.13-rc0
gsrlabs:v0.5.12
gsrlabs:v0.5.12-rc1
gsrlabs:v0.5.12-rc0
gsrlabs:v0.5.11
gsrlabs:v0.5.10
gsrlabs:v0.5.9
gsrlabs:v0.5.9-rc0
gsrlabs:v0.5.8
gsrlabs:v0.5.8-rc13
gsrlabs:v0.5.8-rc12
gsrlabs:v0.5.8-rc11
gsrlabs:v0.5.8-rc10
gsrlabs:v0.5.8-rc9
gsrlabs:v0.5.8-rc8
gsrlabs:v0.5.8-rc7
gsrlabs:v0.5.8-rc6
gsrlabs:v0.5.8-rc5
gsrlabs:v0.5.8-rc4
gsrlabs:v0.5.8-rc3
gsrlabs:v0.5.8-rc2
gsrlabs:v0.5.8-rc1
gsrlabs:v0.5.8-rc0
gsrlabs:v0.5.7
gsrlabs:v0.5.6
gsrlabs:v0.5.5
gsrlabs:v0.5.5-rc0
gsrlabs:v0.5.4
gsrlabs:v0.5.3
gsrlabs:v0.5.3-rc0
gsrlabs:v0.5.2
gsrlabs:v0.5.2-rc3
gsrlabs:v0.5.2-rc2
gsrlabs:v0.5.2-rc1
gsrlabs:v0.5.2-rc0
gsrlabs:v0.5.1
gsrlabs:v0.5.0
gsrlabs:v0.5.0-rc1
gsrlabs:v0.4.8-rc0
gsrlabs:v0.4.7
gsrlabs:v0.4.6
gsrlabs:v0.4.5
gsrlabs:v0.4.4
gsrlabs:v0.4.3
gsrlabs:v0.4.3-rc0
gsrlabs:v0.4.2
gsrlabs:v0.4.2-rc1
gsrlabs:v0.4.2-rc0
gsrlabs:v0.4.1
gsrlabs:v0.4.1-rc0
gsrlabs:v0.4.0
gsrlabs:v0.4.0-rc8
gsrlabs:v0.4.0-rc7
gsrlabs:v0.4.0-rc6
gsrlabs:v0.4.0-rc5
gsrlabs:v0.4.0-rc4
gsrlabs:v0.4.0-rc3
gsrlabs:v0.4.0-rc2
gsrlabs:v0.4.0-rc1
gsrlabs:v0.4.0-rc0
gsrlabs:v0.4.0-ci3
gsrlabs:v0.3.14-rc0
gsrlabs:v0.3.14
gsrlabs:v0.3.13
gsrlabs:v0.3.12
gsrlabs:v0.3.12-rc5
gsrlabs:v0.3.12-rc4
gsrlabs:v0.3.12-rc3
gsrlabs:v0.3.12-rc2
gsrlabs:v0.3.12-rc1
gsrlabs:v0.3.11
gsrlabs:v0.3.11-rc4
gsrlabs:v0.3.11-rc3
gsrlabs:v0.3.11-rc2
gsrlabs:v0.3.11-rc1
gsrlabs:v0.3.10
gsrlabs:v0.3.10-rc1
gsrlabs:v0.3.9
gsrlabs:v0.3.8
gsrlabs:v0.3.7
gsrlabs:v0.3.7-rc6
gsrlabs:v0.3.7-rc5
gsrlabs:v0.3.7-rc4
gsrlabs:v0.3.7-rc3
gsrlabs:v0.3.7-rc2
gsrlabs:v0.3.7-rc1
gsrlabs:v0.3.6
gsrlabs:v0.3.5
gsrlabs:v0.3.4
gsrlabs:v0.3.3
gsrlabs:v0.3.2
gsrlabs:v0.3.1
gsrlabs:v0.3.0
gsrlabs:v0.2.8
gsrlabs:v0.2.8-rc2
gsrlabs:v0.2.8-rc1
gsrlabs:v0.2.7
gsrlabs:v0.2.6
gsrlabs:v0.2.5
gsrlabs:v0.2.4
gsrlabs:v0.2.3
gsrlabs:v0.2.2
gsrlabs:v0.2.2-rc2
gsrlabs:v0.2.2-rc1
gsrlabs:v0.2.1
gsrlabs:v0.2.0
gsrlabs:v0.1.49-rc14
gsrlabs:v0.1.49-rc13
gsrlabs:v0.1.49-rc12
gsrlabs:v0.1.49-rc11
gsrlabs:v0.1.49-rc10
gsrlabs:v0.1.49-rc9
gsrlabs:v0.1.49-rc8
gsrlabs:v0.1.49-rc7
gsrlabs:v0.1.49-rc6
gsrlabs:v0.1.49-rc4
gsrlabs:v0.1.49-rc5
gsrlabs:v0.1.49-rc3
gsrlabs:v0.1.49-rc2
gsrlabs:v0.1.49-rc1
gsrlabs:v0.1.48
gsrlabs:v0.1.47
gsrlabs:v0.1.46
gsrlabs:v0.1.45
gsrlabs:v0.1.45-rc5
gsrlabs:v0.1.45-rc4
gsrlabs:v0.1.45-rc3
gsrlabs:v0.1.45-rc2
gsrlabs:v0.1.45-rc1
gsrlabs:v0.1.44
gsrlabs:v0.1.43
gsrlabs:v0.1.42
gsrlabs:v0.1.41
gsrlabs:v0.1.40
gsrlabs:v0.1.40-rc1
gsrlabs:v0.1.39
gsrlabs:v0.1.39-rc2
gsrlabs:v0.1.39-rc1
gsrlabs:v0.1.38
gsrlabs:v0.1.37
gsrlabs:v0.1.36
gsrlabs:v0.1.35
gsrlabs:v0.1.35-rc1
gsrlabs:v0.1.34
gsrlabs:v0.1.34-rc1
gsrlabs:v0.1.33
gsrlabs:v0.1.33-rc7
gsrlabs:v0.1.33-rc6
gsrlabs:v0.1.33-rc5
gsrlabs:v0.1.33-rc4
gsrlabs:v0.1.33-rc3
gsrlabs:v0.1.33-rc2
gsrlabs:v0.1.33-rc1
gsrlabs:v0.1.32
gsrlabs:v0.1.32-rc2
gsrlabs:v0.1.32-rc1
gsrlabs:v0.1.31
gsrlabs:v0.1.30
gsrlabs:v0.1.29
gsrlabs:v0.1.28
gsrlabs:v0.1.27
gsrlabs:v0.1.26
gsrlabs:v0.1.25
gsrlabs:v0.1.24
gsrlabs:v0.1.23
gsrlabs:v0.1.22
gsrlabs:v0.1.21
gsrlabs:v0.1.20
gsrlabs:v0.1.19
gsrlabs:v0.1.18
gsrlabs:v0.1.17
gsrlabs:v0.1.16
gsrlabs:v0.1.15
gsrlabs:v0.1.14
gsrlabs:v0.1.13
gsrlabs:v0.1.12
gsrlabs:v0.1.11
gsrlabs:v0.1.10
gsrlabs:v0.1.9
gsrlabs:v0.1.8
gsrlabs:v0.1.7
gsrlabs:v0.1.6
gsrlabs:v0.1.5
gsrlabs:v0.1.4
gsrlabs:v0.1.3
gsrlabs:v0.1.2
gsrlabs:v0.1.1
gsrlabs:v0.1.0
gsrlabs:v0.0.21
gsrlabs:v0.0.20
gsrlabs:v0.0.19
gsrlabs:v0.0.18
gsrlabs:v0.0.17
gsrlabs:v0.0.16
gsrlabs:v0.0.15
gsrlabs:v0.0.14
gsrlabs:v0.0.13
gsrlabs:v0.0.12
gsrlabs:v0.0.11
gsrlabs:v0.0.10
gsrlabs:v0.0.9
gsrlabs:v0.0.8
gsrlabs:v0.0.7
gsrlabs:v0.0.6
gsrlabs:v0.0.5
gsrlabs:v0.0.4
gsrlabs:v0.0.3
gsrlabs:v0.0.2
gsrlabs:v0.0.1
...
compare: gsrlabs:mattw/quantcontext
gsrlabs:main
gsrlabs:parth-launch-extra-params
gsrlabs:mxyng/tokenizers
gsrlabs:jessegross/context
gsrlabs:brucemacd/usage-api
gsrlabs:pdevine/glm-mlx
gsrlabs:brucemacd/usage-cli
gsrlabs:llama-update
gsrlabs:fix-cuda12-fattn-shmem
gsrlabs:ollama-imagegen-docs
gsrlabs:hoyyeva/upgrade-config
gsrlabs:parth/fix-multiline-inputs
gsrlabs:brucemacd/integration-doc-types
gsrlabs:brucemacd/config-docs
gsrlabs:mxyng/model-files
gsrlabs:mxyng/simple-execute
gsrlabs:fix-imagegen-ollama-models
gsrlabs:mxyng/async-upload
gsrlabs:jmorganca/lazy-no-dtype-changes
gsrlabs:imagegen-auto-detect-create
gsrlabs:parth/decrease-concurrent-download-hf
gsrlabs:fix-mlx-quantize-init
gsrlabs:jmorganca/x-cleanup
gsrlabs:usage
gsrlabs:imagegen-readme
gsrlabs:jmorganca/glm-image
gsrlabs:mlx-gpu-cd
gsrlabs:jmorganca/imagegen-modelfile
gsrlabs:parth/agent-skills
gsrlabs:parth/agent-allowlist
gsrlabs:parth/signed-in-offline
gsrlabs:parth/agents
gsrlabs:parth/fix-context-chopping
gsrlabs:improve-cloud-flow
gsrlabs:parth/add-models-websearch
gsrlabs:parth/prompt-renderer-mcp
gsrlabs:jmorganca/native-settings
gsrlabs:jmorganca/download-stream-hash
gsrlabs:jmorganca/client2-rebased
gsrlabs:brucemacd/oai-chat-req-multipart
gsrlabs:jessegross/multi_chunk_reserve
gsrlabs:grace/additional-omit-empty
gsrlabs:grace/mistral-3-large
gsrlabs:mxyng/tokenizer2
gsrlabs:mxyng/tokenizer
gsrlabs:jessegross/flash
gsrlabs:hoyyeva/windows-nacked-app
gsrlabs:mxyng/cleanup-attention
gsrlabs:grace/deepseek-parser
gsrlabs:hoyyeva/remember-unsent-prompt
gsrlabs:parth/add-lfs-pointer-error-conversion
gsrlabs:parth/olmo2-test2
gsrlabs:hoyyeva/ollama-launchagent-plist
gsrlabs:nicole/olmo-model
gsrlabs:parth/olmo-test
gsrlabs:mxyng/remove-embedded
gsrlabs:parth/render-template
gsrlabs:jmorganca/intellect-3
gsrlabs:parth/remove-prealloc-linter
gsrlabs:jmorganca/cmd-eval
gsrlabs:nicole/nomic-embed-text-fix
gsrlabs:mxyng/lint-2
gsrlabs:hoyyeva/add-gemini-3-pro-preview
gsrlabs:hoyyeva/load-model-list
gsrlabs:mxyng/expand-path
gsrlabs:mxyng/environ-2
gsrlabs:hoyyeva/deeplink-json-encoding
gsrlabs:parth/improve-tool-calling-tests
gsrlabs:hoyyeva/conversation
gsrlabs:hoyyeva/assistant-edit-response
gsrlabs:hoyyeva/thinking
gsrlabs:origin/brucemacd/invalid-char-i-err
gsrlabs:parth/improve-tool-calling
gsrlabs:jmorganca/required-omitempty
gsrlabs:grace/qwen3-vl-tests
gsrlabs:mxyng/iter-client
gsrlabs:parth/docs-readme
gsrlabs:nicole/embed-test
gsrlabs:pdevine/integration-benchstat
gsrlabs:parth/remove-generate-cmd
gsrlabs:parth/add-toolcall-id
gsrlabs:mxyng/server-tests
gsrlabs:jmorganca/glm-4.6
gsrlabs:jmorganca/gin-h-compat
gsrlabs:drifkin/stable-tool-args
gsrlabs:pdevine/qwen3-more-thinking
gsrlabs:parth/add-websearch-client
gsrlabs:nicole/websearch_local
gsrlabs:jmorganca/qwen3-coder-updates
gsrlabs:grace/deepseek-v3-migration-tests
gsrlabs:mxyng/fix-create
gsrlabs:jmorganca/cloud-errors
gsrlabs:pdevine/parser-tidy
gsrlabs:revert-12233-parth/simplify-entrypoints-runner
gsrlabs:parth/enable-so-gpt-oss
gsrlabs:brucemacd/qwen3vl
gsrlabs:jmorganca/readme-simplify
gsrlabs:parth/gpt-oss-structured-outputs
gsrlabs:revert-12039-jmorganca/tools-braces
gsrlabs:mxyng/embeddings
gsrlabs:mxyng/gguf
gsrlabs:mxyng/benchmark
gsrlabs:mxyng/types-null
gsrlabs:parth/move-parsing
gsrlabs:mxyng/gemma2
gsrlabs:jmorganca/docs
gsrlabs:mxyng/16-bit
gsrlabs:mxyng/create-stdin
gsrlabs:pdevine/authorizedkeys
gsrlabs:mxyng/quant
gsrlabs:parth/opt-in-error-context-window
gsrlabs:brucemacd/cache-models
gsrlabs:brucemacd/runner-completion
gsrlabs:jmorganca/llama-update-6
gsrlabs:brucemacd/benchmark-list
gsrlabs:brucemacd/partial-read-caps
gsrlabs:parth/deepseek-r1-tools
gsrlabs:mxyng/omit-array
gsrlabs:parth/tool-prefix-temp
gsrlabs:brucemacd/runner-test
gsrlabs:jmorganca/qwen25vl
gsrlabs:brucemacd/model-forward-test-ext
gsrlabs:parth/python-function-parsing
gsrlabs:jmorganca/cuda-compression-none
gsrlabs:drifkin/num-parallel
gsrlabs:drifkin/chat-truncation-fix
gsrlabs:jmorganca/sync
gsrlabs:parth/python-tools-calling
gsrlabs:drifkin/array-head-count
gsrlabs:brucemacd/create-no-loop
gsrlabs:parth/server-enable-content-stream-with-tools
gsrlabs:qwen25omni
gsrlabs:mxyng/v3
gsrlabs:brucemacd/ropeconfig
gsrlabs:jmorganca/silence-tokenizer
gsrlabs:parth/sample-so-test
gsrlabs:parth/sampling-structured-outputs
gsrlabs:brucemacd/doc-go-engine
gsrlabs:parth/constrained-sampling-json
gsrlabs:jmorganca/mistral-wip
gsrlabs:brucemacd/mistral-small-convert
gsrlabs:parth/sample-unmarshal-json-for-params
gsrlabs:brucemacd/jomorganca/mistral
gsrlabs:pdevine/bfloat16
gsrlabs:jmorganca/mistral
gsrlabs:brucemacd/mistral
gsrlabs:pdevine/logging
gsrlabs:parth/sample-correctness-fix
gsrlabs:parth/sample-fix-sorting
gsrlabs:jmorgan/sample-fix-sorting-extras
gsrlabs:jmorganca/temp-0-images
gsrlabs:brucemacd/parallel-embed-models
gsrlabs:brucemacd/shim-grammar
gsrlabs:jmorganca/fix-gguf-error
gsrlabs:bmizerany/nameswork
gsrlabs:jmorganca/faster-releases
gsrlabs:bmizerany/validatenames
gsrlabs:brucemacd/err-no-vocab
gsrlabs:brucemacd/rope-config
gsrlabs:brucemacd/err-hint
gsrlabs:brucemacd/qwen2_5
gsrlabs:brucemacd/logprobs
gsrlabs:brucemacd/new_runner_graph_bench
gsrlabs:progress-flicker
gsrlabs:brucemacd/forward-test
gsrlabs:brucemacd/go_qwen2
gsrlabs:pdevine/gemma2
gsrlabs:jmorganca/add-missing-symlink-eval
gsrlabs:mxyng/next-debug
gsrlabs:parth/set-context-size-openai
gsrlabs:brucemacd/next-bpe-bench
gsrlabs:brucemacd/next-bpe-test
gsrlabs:brucemacd/new_runner_e2e
gsrlabs:brucemacd/new_runner_qwen2
gsrlabs:pdevine/convert-cohere2
gsrlabs:brucemacd/convert-cli
gsrlabs:parth/log-probs
gsrlabs:mxyng/next-mlx
gsrlabs:mxyng/cmd-history
gsrlabs:parth/templating
gsrlabs:parth/tokenize-detokenize
gsrlabs:brucemacd/check-key-register
gsrlabs:bmizerany/grammar
gsrlabs:jmorganca/vendor-081b29bd
gsrlabs:mxyng/func-checks
gsrlabs:jmorganca/fix-null-format
gsrlabs:parth/fix-default-to-warn-json
gsrlabs:jmorganca/qwen2vl
gsrlabs:jmorganca/no-concat
gsrlabs:parth/cmd-cleanup-SO
gsrlabs:brucemacd/check-key-register-structured-err
gsrlabs:parth/openai-stream-usage
gsrlabs:parth/fix-referencing-so
gsrlabs:stream-tools-stop
gsrlabs:jmorganca/degin-1
gsrlabs:brucemacd/install-path-clean
gsrlabs:brucemacd/push-name-validation
gsrlabs:brucemacd/browser-key-register
gsrlabs:jmorganca/openai-fix-first-message
gsrlabs:jmorganca/fix-proxy
gsrlabs:jessegross/sample
gsrlabs:parth/disallow-streaming-tools
gsrlabs:dhiltgen/remove_submodule
gsrlabs:jmorganca/ga
gsrlabs:jmorganca/mllama
gsrlabs:pdevine/newlines
gsrlabs:pdevine/geems-2b
gsrlabs:jmorganca/llama-bump
gsrlabs:mxyng/modelname-7
gsrlabs:mxyng/gin-slog
gsrlabs:mxyng/modelname-6
gsrlabs:jyan/convert-prog
gsrlabs:jyan/quant5
gsrlabs:paligemma-support
gsrlabs:pdevine/import-docs
gsrlabs:jmorganca/openai-context
gsrlabs:jyan/paligemma
gsrlabs:jyan/p2
gsrlabs:jyan/palitest
gsrlabs:bmizerany/embedspeedup
gsrlabs:jmorganca/llama-vit
gsrlabs:brucemacd/allow-ollama
gsrlabs:royh/ep-methods
gsrlabs:royh/whisper
gsrlabs:mxyng/api-models
gsrlabs:mxyng/fix-memory
gsrlabs:jyan/q4_4/8
gsrlabs:jyan/ollama-v
gsrlabs:royh/stream-tools
gsrlabs:roy-embed-parallel
gsrlabs:bmizerany/hrm
gsrlabs:revert-5963-revert-5924-mxyng/llama3.1-rope
gsrlabs:royh/embed-viz
gsrlabs:jyan/local2
gsrlabs:jyan/auth
gsrlabs:jyan/local
gsrlabs:jyan/parse-temp
gsrlabs:jmorganca/template-mistral
gsrlabs:jyan/reord-g
gsrlabs:royh-openai-suffixdocs
gsrlabs:royh-imgembed
gsrlabs:royh-embed-parallel
gsrlabs:jyan/quant4
gsrlabs:royh-precision
gsrlabs:jyan/progress
gsrlabs:pdevine/fix-template
gsrlabs:jyan/quant3
gsrlabs:pdevine/ggla
gsrlabs:mxyng/update-registry-domain
gsrlabs:jmorganca/ggml-static
gsrlabs:mxyng/create-context
gsrlabs:jyan/v0.146
gsrlabs:mxyng/layers-from-files
gsrlabs:build_dist
gsrlabs:bmizerany/noseek
gsrlabs:royh-ls
gsrlabs:royh-name
gsrlabs:timeout
gsrlabs:mxyng/server-timestamp
gsrlabs:bmizerany/nosillyggufslurps
gsrlabs:royh-params
gsrlabs:jmorganca/llama-cpp-7c26775
gsrlabs:royh-openai-delete
gsrlabs:royh-show-rigid
gsrlabs:jmorganca/enable-fa
gsrlabs:jmorganca/no-error-template
gsrlabs:jyan/format
gsrlabs:royh-testdelete
gsrlabs:bmizerany/fastverify
gsrlabs:language_support
gsrlabs:pdevine/ps-glitches
gsrlabs:brucemacd/tokenize
gsrlabs:bruce/iq-quants
gsrlabs:bmizerany/filepathwithcoloninhost
gsrlabs:mxyng/split-bin
gsrlabs:bmizerany/client-registry
gsrlabs:jmorganca/if-none-match
gsrlabs:native
gsrlabs:jmorganca/native
gsrlabs:jmorganca/batch-embeddings
gsrlabs:jmorganca/initcmake
gsrlabs:jmorganca/mm
gsrlabs:pdevine/showggmlinfo
gsrlabs:modenameenforcealphanum
gsrlabs:bmizerany/modenameenforcealphanum
gsrlabs:jmorganca/done-reason
gsrlabs:jmorganca/llama-cpp-8960fe8
gsrlabs:ollama.com
gsrlabs:bmizerany/filepathnobuild
gsrlabs:bmizerany/types/model/defaultfix
gsrlabs:rmdisplaylong
gsrlabs:nogogen
gsrlabs:bmizerany/x
gsrlabs:modelfile-readme
gsrlabs:bmizerany/replacecolon
gsrlabs:jmorganca/limit
gsrlabs:jmorganca/execstack
gsrlabs:jmorganca/replace-assets
gsrlabs:mxyng/tune-concurrency
gsrlabs:jmorganca/testing
gsrlabs:whitespace-detection
gsrlabs:jmorganca/options
gsrlabs:upgrade-all
gsrlabs:scratch
gsrlabs:cuda-search
gsrlabs:mattw/airenamer
gsrlabs:mattw/allmodelsonhuggingface
gsrlabs:mattw/quantcontext
gsrlabs:mattw/whatneedstorun
gsrlabs:brucemacd/llama-mem-calc
gsrlabs:mattw/faq-context
gsrlabs:mattw/communitylinks
gsrlabs:mattw/noprune
gsrlabs:mattw/python-functioncalling
gsrlabs:rename
gsrlabs:mxyng/install
gsrlabs:pulse
gsrlabs:remove-first
gsrlabs:editor
gsrlabs:mattw/selfqueryingretrieval
gsrlabs:cgo
gsrlabs:mattw/howtoquant
gsrlabs:api
gsrlabs:matt/streamingapi
gsrlabs:format-config
gsrlabs:mxyng/extra-args
gsrlabs:shell
gsrlabs:update-nous-hermes
gsrlabs:cp-model
gsrlabs:upload-progress
gsrlabs:fix-unknown-model
gsrlabs:fix-model-names
gsrlabs:delete-fix
gsrlabs:insecure-registry
gsrlabs:ls
gsrlabs:deletemodels
gsrlabs:progressbar
gsrlabs:readme-updates
gsrlabs:license-layers
gsrlabs:skip-list
gsrlabs:list-models
gsrlabs:modelpath
gsrlabs:matt/examplemodelfiles
gsrlabs:distribution
gsrlabs:go-opts
gsrlabs:v0.15.2
gsrlabs:v0.15.1
gsrlabs:v0.15.1-rc1
gsrlabs:v0.15.1-rc0
gsrlabs:v0.15.0-rc6
gsrlabs:v0.15.0
gsrlabs:v0.15.0-rc5
gsrlabs:v0.15.0-rc4
gsrlabs:v0.15.0-rc3
gsrlabs:v0.15.0-rc2
gsrlabs:v0.15.0-rc1
gsrlabs:v0.15.0-rc0
gsrlabs:v0.14.3
gsrlabs:v0.14.3-rc3
gsrlabs:v0.14.3-rc2
gsrlabs:v0.14.3-rc1
gsrlabs:v0.14.3-rc0
gsrlabs:v0.14.2
gsrlabs:v0.14.2-rc1
gsrlabs:v0.14.2-rc0
gsrlabs:v0.14.1
gsrlabs:v0.14.0-rc11
gsrlabs:v0.14.0
gsrlabs:v0.14.0-rc10
gsrlabs:v0.14.0-rc9
gsrlabs:v0.14.0-rc8
gsrlabs:v0.14.0-rc7
gsrlabs:v0.14.0-rc6
gsrlabs:v0.14.0-rc5
gsrlabs:v0.14.0-rc4
gsrlabs:v0.14.0-rc3
gsrlabs:v0.14.0-rc2
gsrlabs:v0.14.0-rc1
gsrlabs:v0.14.0-rc0
gsrlabs:v0.13.5
gsrlabs:v0.13.5-rc1
gsrlabs:v0.13.5-rc0
gsrlabs:v0.13.4-rc2
gsrlabs:v0.13.4
gsrlabs:v0.13.4-rc1
gsrlabs:v0.13.4-rc0
gsrlabs:v0.13.3
gsrlabs:v0.13.3-rc1
gsrlabs:v0.13.3-rc0
gsrlabs:v0.13.2
gsrlabs:v0.13.2-rc2
gsrlabs:v0.13.2-rc1
gsrlabs:v0.13.2-rc0
gsrlabs:v0.13.1
gsrlabs:v0.13.1-rc2
gsrlabs:v0.13.1-rc1
gsrlabs:v0.13.1-rc0
gsrlabs:v0.13.0
gsrlabs:v0.13.0-rc0
gsrlabs:v0.12.11
gsrlabs:v0.12.11-rc1
gsrlabs:v0.12.11-rc0
gsrlabs:v0.12.10-rc1
gsrlabs:v0.12.10
gsrlabs:v0.12.10-rc0
gsrlabs:v0.12.9-rc0
gsrlabs:v0.12.9
gsrlabs:v0.12.8
gsrlabs:v0.12.8-rc0
gsrlabs:v0.12.7
gsrlabs:v0.12.7-rc1
gsrlabs:v0.12.7-rc0
gsrlabs:v0.12.6
gsrlabs:v0.12.6-rc1
gsrlabs:v0.12.6-rc0
gsrlabs:v0.12.5-rc0
gsrlabs:v0.12.5
gsrlabs:v0.12.4
gsrlabs:v0.12.4-rc7
gsrlabs:v0.12.4-rc6
gsrlabs:v0.12.4-rc5
gsrlabs:v0.12.4-rc4
gsrlabs:v0.12.4-rc3
gsrlabs:v0.12.4-rc2
gsrlabs:v0.12.4-rc1
gsrlabs:v0.12.4-rc0
gsrlabs:v0.12.3
gsrlabs:v0.12.2
gsrlabs:v0.12.2-rc0
gsrlabs:v0.12.1
gsrlabs:v0.12.1-rc2
gsrlabs:v0.12.1-rc1
gsrlabs:v0.12.1-rc0
gsrlabs:v0.12.0
gsrlabs:v0.12.0-rc1
gsrlabs:v0.12.0-rc0
gsrlabs:v0.11.11
gsrlabs:v0.11.11-rc2
gsrlabs:v0.11.11-rc3
gsrlabs:v0.11.11-rc1
gsrlabs:v0.11.11-rc0
gsrlabs:v0.11.10
gsrlabs:v0.11.9
gsrlabs:v0.11.9-rc0
gsrlabs:v0.11.8
gsrlabs:v0.11.8-rc0
gsrlabs:v0.11.7
gsrlabs:v0.11.7-rc0
gsrlabs:v0.11.7-rc1
gsrlabs:v0.11.6
gsrlabs:v0.11.6-rc0
gsrlabs:v0.11.5-rc4
gsrlabs:v0.11.5-rc5
gsrlabs:v0.11.5
gsrlabs:v0.11.5-rc3
gsrlabs:v0.11.5-rc2
gsrlabs:v0.11.5-rc1
gsrlabs:v0.11.5-rc0
gsrlabs:v0.11.4
gsrlabs:v0.11.4-rc0
gsrlabs:v0.11.3
gsrlabs:v0.11.3-rc0
gsrlabs:v0.11.2
gsrlabs:v0.11.1
gsrlabs:v0.11.0
gsrlabs:v0.10.1
gsrlabs:v0.10.0
gsrlabs:v0.10.0-rc4
gsrlabs:v0.10.0-rc3
gsrlabs:v0.10.0-rc2
gsrlabs:v0.10.0-rc1
gsrlabs:v0.10.0-rc0
gsrlabs:v0.9.7-rc1
gsrlabs:v0.9.7-rc0
gsrlabs:v0.9.6
gsrlabs:v0.9.6-rc0
gsrlabs:v0.9.5
gsrlabs:v0.9.4-rc6
gsrlabs:v0.9.4-rc3
gsrlabs:v0.9.4
gsrlabs:v0.9.4-rc5
gsrlabs:v0.9.4-rc4
gsrlabs:v0.9.4-rc2
gsrlabs:v0.9.4-rc1
gsrlabs:v0.9.4-rc0
gsrlabs:v0.9.3
gsrlabs:v0.9.3-rc5
gsrlabs:v0.9.4-citest0
gsrlabs:v0.9.3-rc4
gsrlabs:v0.9.3-rc3
gsrlabs:v0.9.3-rc2
gsrlabs:v0.9.3-rc1
gsrlabs:v0.9.3-rc0
gsrlabs:v0.9.2
gsrlabs:v0.9.1
gsrlabs:v0.9.1-rc1
gsrlabs:v0.9.1-rc0
gsrlabs:v0.9.0
gsrlabs:v0.9.0-rc0
gsrlabs:v0.8.0
gsrlabs:v0.8.0-rc0
gsrlabs:v0.7.1-rc2
gsrlabs:v0.7.1
gsrlabs:v0.7.1-rc1
gsrlabs:v0.7.1-rc0
gsrlabs:v0.7.0
gsrlabs:v0.7.0-rc1
gsrlabs:v0.7.0-rc0
gsrlabs:v0.6.8-rc0
gsrlabs:v0.6.8
gsrlabs:v0.6.7
gsrlabs:v0.6.7-rc2
gsrlabs:v0.6.7-rc1
gsrlabs:v0.6.7-rc0
gsrlabs:v0.6.6
gsrlabs:v0.6.6-rc2
gsrlabs:v0.6.6-rc1
gsrlabs:v0.6.6-rc0
gsrlabs:v0.6.5-rc1
gsrlabs:v0.6.5
gsrlabs:v0.6.5-rc0
gsrlabs:v0.6.4-rc0
gsrlabs:v0.6.4
gsrlabs:v0.6.3
gsrlabs:v0.6.3-rc1
gsrlabs:v0.6.3-rc0
gsrlabs:v0.6.2-rc0
gsrlabs:v0.6.2
gsrlabs:v0.6.1
gsrlabs:v0.6.1-rc0
gsrlabs:v0.6.0-rc0
gsrlabs:v0.6.0
gsrlabs:v0.5.13
gsrlabs:v0.5.13-rc6
gsrlabs:v0.5.13-rc5
gsrlabs:v0.5.13-rc4
gsrlabs:v0.5.13-rc3
gsrlabs:v0.5.13-rc2
gsrlabs:v0.5.13-rc1
gsrlabs:v0.5.13-rc0
gsrlabs:v0.5.12
gsrlabs:v0.5.12-rc1
gsrlabs:v0.5.12-rc0
gsrlabs:v0.5.11
gsrlabs:v0.5.10
gsrlabs:v0.5.9
gsrlabs:v0.5.9-rc0
gsrlabs:v0.5.8
gsrlabs:v0.5.8-rc13
gsrlabs:v0.5.8-rc12
gsrlabs:v0.5.8-rc11
gsrlabs:v0.5.8-rc10
gsrlabs:v0.5.8-rc9
gsrlabs:v0.5.8-rc8
gsrlabs:v0.5.8-rc7
gsrlabs:v0.5.8-rc6
gsrlabs:v0.5.8-rc5
gsrlabs:v0.5.8-rc4
gsrlabs:v0.5.8-rc3
gsrlabs:v0.5.8-rc2
gsrlabs:v0.5.8-rc1
gsrlabs:v0.5.8-rc0
gsrlabs:v0.5.7
gsrlabs:v0.5.6
gsrlabs:v0.5.5
gsrlabs:v0.5.5-rc0
gsrlabs:v0.5.4
gsrlabs:v0.5.3
gsrlabs:v0.5.3-rc0
gsrlabs:v0.5.2
gsrlabs:v0.5.2-rc3
gsrlabs:v0.5.2-rc2
gsrlabs:v0.5.2-rc1
gsrlabs:v0.5.2-rc0
gsrlabs:v0.5.1
gsrlabs:v0.5.0
gsrlabs:v0.5.0-rc1
gsrlabs:v0.4.8-rc0
gsrlabs:v0.4.7
gsrlabs:v0.4.6
gsrlabs:v0.4.5
gsrlabs:v0.4.4
gsrlabs:v0.4.3
gsrlabs:v0.4.3-rc0
gsrlabs:v0.4.2
gsrlabs:v0.4.2-rc1
gsrlabs:v0.4.2-rc0
gsrlabs:v0.4.1
gsrlabs:v0.4.1-rc0
gsrlabs:v0.4.0
gsrlabs:v0.4.0-rc8
gsrlabs:v0.4.0-rc7
gsrlabs:v0.4.0-rc6
gsrlabs:v0.4.0-rc5
gsrlabs:v0.4.0-rc4
gsrlabs:v0.4.0-rc3
gsrlabs:v0.4.0-rc2
gsrlabs:v0.4.0-rc1
gsrlabs:v0.4.0-rc0
gsrlabs:v0.4.0-ci3
gsrlabs:v0.3.14-rc0
gsrlabs:v0.3.14
gsrlabs:v0.3.13
gsrlabs:v0.3.12
gsrlabs:v0.3.12-rc5
gsrlabs:v0.3.12-rc4
gsrlabs:v0.3.12-rc3
gsrlabs:v0.3.12-rc2
gsrlabs:v0.3.12-rc1
gsrlabs:v0.3.11
gsrlabs:v0.3.11-rc4
gsrlabs:v0.3.11-rc3
gsrlabs:v0.3.11-rc2
gsrlabs:v0.3.11-rc1
gsrlabs:v0.3.10
gsrlabs:v0.3.10-rc1
gsrlabs:v0.3.9
gsrlabs:v0.3.8
gsrlabs:v0.3.7
gsrlabs:v0.3.7-rc6
gsrlabs:v0.3.7-rc5
gsrlabs:v0.3.7-rc4
gsrlabs:v0.3.7-rc3
gsrlabs:v0.3.7-rc2
gsrlabs:v0.3.7-rc1
gsrlabs:v0.3.6
gsrlabs:v0.3.5
gsrlabs:v0.3.4
gsrlabs:v0.3.3
gsrlabs:v0.3.2
gsrlabs:v0.3.1
gsrlabs:v0.3.0
gsrlabs:v0.2.8
gsrlabs:v0.2.8-rc2
gsrlabs:v0.2.8-rc1
gsrlabs:v0.2.7
gsrlabs:v0.2.6
gsrlabs:v0.2.5
gsrlabs:v0.2.4
gsrlabs:v0.2.3
gsrlabs:v0.2.2
gsrlabs:v0.2.2-rc2
gsrlabs:v0.2.2-rc1
gsrlabs:v0.2.1
gsrlabs:v0.2.0
gsrlabs:v0.1.49-rc14
gsrlabs:v0.1.49-rc13
gsrlabs:v0.1.49-rc12
gsrlabs:v0.1.49-rc11
gsrlabs:v0.1.49-rc10
gsrlabs:v0.1.49-rc9
gsrlabs:v0.1.49-rc8
gsrlabs:v0.1.49-rc7
gsrlabs:v0.1.49-rc6
gsrlabs:v0.1.49-rc4
gsrlabs:v0.1.49-rc5
gsrlabs:v0.1.49-rc3
gsrlabs:v0.1.49-rc2
gsrlabs:v0.1.49-rc1
gsrlabs:v0.1.48
gsrlabs:v0.1.47
gsrlabs:v0.1.46
gsrlabs:v0.1.45
gsrlabs:v0.1.45-rc5
gsrlabs:v0.1.45-rc4
gsrlabs:v0.1.45-rc3
gsrlabs:v0.1.45-rc2
gsrlabs:v0.1.45-rc1
gsrlabs:v0.1.44
gsrlabs:v0.1.43
gsrlabs:v0.1.42
gsrlabs:v0.1.41
gsrlabs:v0.1.40
gsrlabs:v0.1.40-rc1
gsrlabs:v0.1.39
gsrlabs:v0.1.39-rc2
gsrlabs:v0.1.39-rc1
gsrlabs:v0.1.38
gsrlabs:v0.1.37
gsrlabs:v0.1.36
gsrlabs:v0.1.35
gsrlabs:v0.1.35-rc1
gsrlabs:v0.1.34
gsrlabs:v0.1.34-rc1
gsrlabs:v0.1.33
gsrlabs:v0.1.33-rc7
gsrlabs:v0.1.33-rc6
gsrlabs:v0.1.33-rc5
gsrlabs:v0.1.33-rc4
gsrlabs:v0.1.33-rc3
gsrlabs:v0.1.33-rc2
gsrlabs:v0.1.33-rc1
gsrlabs:v0.1.32
gsrlabs:v0.1.32-rc2
gsrlabs:v0.1.32-rc1
gsrlabs:v0.1.31
gsrlabs:v0.1.30
gsrlabs:v0.1.29
gsrlabs:v0.1.28
gsrlabs:v0.1.27
gsrlabs:v0.1.26
gsrlabs:v0.1.25
gsrlabs:v0.1.24
gsrlabs:v0.1.23
gsrlabs:v0.1.22
gsrlabs:v0.1.21
gsrlabs:v0.1.20
gsrlabs:v0.1.19
gsrlabs:v0.1.18
gsrlabs:v0.1.17
gsrlabs:v0.1.16
gsrlabs:v0.1.15
gsrlabs:v0.1.14
gsrlabs:v0.1.13
gsrlabs:v0.1.12
gsrlabs:v0.1.11
gsrlabs:v0.1.10
gsrlabs:v0.1.9
gsrlabs:v0.1.8
gsrlabs:v0.1.7
gsrlabs:v0.1.6
gsrlabs:v0.1.5
gsrlabs:v0.1.4
gsrlabs:v0.1.3
gsrlabs:v0.1.2
gsrlabs:v0.1.1
gsrlabs:v0.1.0
gsrlabs:v0.0.21
gsrlabs:v0.0.20
gsrlabs:v0.0.19
gsrlabs:v0.0.18
gsrlabs:v0.0.17
gsrlabs:v0.0.16
gsrlabs:v0.0.15
gsrlabs:v0.0.14
gsrlabs:v0.0.13
gsrlabs:v0.0.12
gsrlabs:v0.0.11
gsrlabs:v0.0.10
gsrlabs:v0.0.9
gsrlabs:v0.0.8
gsrlabs:v0.0.7
gsrlabs:v0.0.6
gsrlabs:v0.0.5
gsrlabs:v0.0.4
gsrlabs:v0.0.3
gsrlabs:v0.0.2
gsrlabs:v0.0.1
2 Commits
jmorganca/
...
mattw/quan
| Author | SHA1 | Message | Date | |
|---|---|---|---|---|
|
|
fed3843be2 |
update to resolve jmorganca comments
Signed-off-by: Matt Williams <m@technovangelist.com> |
||
|
|
01d4047ed3 |
add faq about quant and context
Signed-off-by: Matt Williams <m@technovangelist.com> |
1 changed files with 23 additions and 0 deletions
23
docs/faq.md
23
docs/faq.md
|
|
@@ -112,3 +112,26 @@ This can impact both installing Ollama, as well as downloading models.
|
|||
Open `Control Panel > Networking and Internet > View network status and tasks` and click on `Change adapter settings` on the left panel. Find the `vEthernel (WSL)` adapter, right click and select `Properties`.
|
||||
Click on `Configure` and open the `Advanced` tab. Search through each of the properties until you find `Large Send Offload Version 2 (IPv4)` and `Large Send Offload Version 2 (IPv6)`. *Disable* both of these
|
||||
properties.
|
||||
|
||||
## What does the q in the model tag mean? What is quantization?
|
||||
|
||||
Whenever you pull a model without a tag, Ollama will actually pull the q4_0 quantization of the model. You can verify this on the tags page. On https://ollama.ai/library/llama2/tags you can see that the hash for the latest tag matches the hash for the 7b model. 
|
||||
|
||||
Looking at the that page for any model, you can see several quantization options available. Quantization is a method of compression that allows the model to fit in less space and thus use less RAM and VRAM on your machine.
|
||||
|
||||
At a high level, a model is made of an enormous collection of nodes that determine how to generate text. These nodes are connected at different levels with weights. The training process adjusts these weights to be able to output the right text every time.
|
||||
|
||||
Most of the source models that we use start with weights that are 32bit floating-point numbers. Those weights, and another concept called biases, add up to be the parameters. So a source model with 7 billion parameters has 7 billion 32bit floating-point numbers, plus a description of all the nodes and more. That adds up to needing at least 28 Gigabytes of memory to load, if you choose to load one of those source models.
|
||||
|
||||
Quantization turns those 32bit floating point weights into much smaller integers. The number next to the q indicates the bit size of the weights. So a q4 model converted those 32bit floats into 4bit integers. A 4bit quantization takes up the space for 7billion 4bit integers, plus a little overhead. That comes out to almost 4 Gigabytes. Obviously, there is some loss of information in this process of going from 30GB to 4GB, but it turns out in most cases it isn't really noticeable. In fact, even the 2bit quantization which fits in less than 3GB can be very useful.
|
||||
|
||||
There are three major sets of quantizations you will see in the Ollama Library of models: **fp16**, models with just a q and a number, like **q4_0**, and then models with a **K** in the tag. The **fp16** model is one that has been converted and quantized from the source 32bit to 16bit. This will be about half the size of the 32bit source model and is the largest quantization we deliver in the library. The **q4_0**, **q4_1**, **q5_0**, etc. models use two different quantization methods that were the original methods.
|
||||
|
||||
The models with a **K** are often referred to as K Quants. This is a method that allows for models of a similar quality but smaller than the original method used. Essentially, it finds clusters of weights and quantizes those together, allowing for higher precision while using the same bit sizes as the regular quantization options. But this requires a set of maps for the model to figure out the original values which have a computational cost. You may see some impact on the speed of models with K quants compared to the regular quantizations.
|
||||
|
||||
## What is context, can I increase it, and why doesn't every model support a huge context?
|
||||
|
||||
Context refers to the size of the input you can send to a model and get sensible output back. Many models have a context size of 2048 tokens. It's sometimes possible to give it more using the **num_ctx** parameter, but the answers start to degrade. This is because half of the context is "freed" up to allow for more memory. Newer models have been able to increase that context size using different methods. This increase in context size results in a corresponding increase in memory required, sometimes by orders of magnitude.
|
||||
|
||||
> !WARNING]
|
||||
> Currently, over-allocating context size may result in model quality or stability issues.
|
||||
|
|
|
|||
Reference in New Issue
Block a user
Blocking a user prevents them from interacting with repositories, such as opening or commenting on pull requests or issues. Learn more about blocking a user.