Req for Quantization and ollama support #206

jkfnc · 2024-06-13T01:20:34Z

It would be great to have ollama support (now it supports json mode ) and quantized 4bit models.

jeffreymeetkai · 2024-06-14T07:17:30Z

Hi, we do not have any plans nor any bandwidth currently to implement ollama support. However, we may consider this in the (near?) future. Nevertheless, feel free to start on this if you are interested in contributing!

Regarding quantized models, all our models are listed here and available here. They all come with GGUF and AWQ variants.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Req for Quantization and ollama support #206

Req for Quantization and ollama support #206

jkfnc commented Jun 13, 2024

jeffreymeetkai commented Jun 14, 2024

Req for Quantization and ollama support #206

Req for Quantization and ollama support #206

Comments

jkfnc commented Jun 13, 2024

jeffreymeetkai commented Jun 14, 2024