on koboldcpp 6k model doesn't respond correctly at all.

by kex243 - opened Jul 23, 2023

Jul 23, 2023

•

edited Jul 23, 2023

Sad, it's size and 8k context sounds good. But it goes into one token loops every time from first mesage. like "AsAsA"... Maybe there is a solution? I tried q6 k and q4 k m, both same error.
seems my errors with ram memory.

kex243 changed discussion status to closed Jul 23, 2023

Jutopia

Aug 13, 2023

You are not alone; I am experiencing the same problem with the 30b q4_k_s as well.

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment