Bartowski PRO
bartowski
AI & ML interests
None yet
Organizations
bartowski's activity
NotebookLLM podcast discovery
1
#9 opened 2 days ago
by
clem
Loading with AutoModelForCausalLM.from_pretrained
1
#2 opened 3 days ago
by
AKQuestSage
Is the adapter merged into the weights?
1
#1 opened 5 days ago
by
bartowski
How did you convert it?
8
#1 opened 7 days ago
by
win10
🚩 Report: Not working
7
#14 opened 8 days ago
by
frank0071
GGUF Quants
2
#1 opened 7 days ago
by
lemon07r
compiled llama.cpp from main on 2024-09-26 and error when loading model
5
#3 opened 9 days ago
by
LaferriereJC
one part
9
#1 opened 19 days ago
by
goodasdgood
Quant request
6
#1 opened 23 days ago
by
EloyOn
Which model is recommended for 4080super 16G video memory?
2
#6 opened 9 days ago
by
decem
Hessian is not invertible
1
#1 opened 15 days ago
by
denru
90b base model + example outputs
1
#1 opened 10 days ago
by
karan4d
Llama-3.2-1B-Instruct To Android
5
#12 opened 10 days ago
by
Heigke
Original model deleted
1
#1 opened 12 days ago
by
SilverFan
This is the fixed version
4
#1 opened 12 days ago
by
bartowski
FIM mode does not work properly, due to missing stop token
2
#3 opened 13 days ago
by
qwp4w3hyb
Promising looking results on 24GB VRAM folks!
5
#3 opened 15 days ago
by
ubergarm
please include "-imat" in the repository title
3
#2 opened 15 days ago
by
AaronFeng753
Could EXL2 quantization hurt multilinguality?
8
#1 opened 17 days ago
by
Handgun1773
BOS Token
2
#1 opened 17 days ago
by
1AH
[Update Readme (Instruct Format)]
2
#1 opened 17 days ago
by
Darkknight535
Q4_0_4_4
13
#2 opened about 1 month ago
by
Yuma42
Possibly the provided prompt format is wrong.
12
#1 opened 18 days ago
by
vevi33
i am trying hf to gguf but there is no config
3
#15 opened 21 days ago
by
Batubatu
split
3
#4 opened 20 days ago
by
goodasdgood
vllm: ....does not appear to have a file named config.json
2
#1 opened 20 days ago
by
paolovic
GGUF for ARM inference?
4
#4 opened 21 days ago
by
AaronFeng753
multi-part model
7
#2 opened 23 days ago
by
goodasdgood
Re-quantize and re-upload model
2
#1 opened 22 days ago
by
mtasic85
LM Studio Says "unknown model architecture"
6
#1 opened 22 days ago
by
alexcardo
vram usage of each?
3
#1 opened 23 days ago
by
jasonden
Aren't you supposed to be on vacation? Hehe.
2
#1 opened 23 days ago
by
neoopus
Fix prompt template
#1 opened about 2 months ago
by
rombodawg
Are the Quants updated?
2
#8 opened 24 days ago
by
RealisticDream
How to know what context window size(n_ctx) one can use on each model ?
2
#13 opened 25 days ago
by
MrktWzrd
Using the model in ctransformers
2
#1 opened 26 days ago
by
PatrickSchwabl
Model Request
2
#1 opened 27 days ago
by
isr431
Having bad results, how should i use this model?
5
#5 opened 29 days ago
by
RamoreRemora
Interview request: genAI evaluation & documentation
3
#14 opened about 1 month ago
by
meggymuggy
Low quants don't seem to work (no <reflection> tags)
6
#3 opened 29 days ago
by
MrHillsss
So, is this based on OG Llama 3 or Llama 3.1?
2
#2 opened 29 days ago
by
XelotX
The original model was updated to fix a bug. Is this repo using the updated version?
2
#4 opened 29 days ago
by
RamoreRemora
does metadata has proper prompt ?
1
#1 opened 29 days ago
by
gopi87
GGUF quantized versions?
8
#4 opened about 1 month ago
by
markne
Update config.json
16
#6 opened about 1 month ago
by
bullerwins
Bartowski! Let's see how your imatrix differs from mine. 😋
5
#2 opened about 1 month ago
by
Joseph717171
Missing <|im_start|> from tokenizer_config.json
3
#3 opened about 1 month ago
by
bartowski
Add <|im_start|> as a special token to tokenizer_config.json
3
#4 opened about 1 month ago
by
bartowski
llama.cpp CPU backend crashes
1
#1 opened about 1 month ago
by
mtasic85
Tiger-Gemma-9B-v2
1
#2 opened about 1 month ago
by
EloyOn
ARM speedup.
2
#1 opened about 1 month ago
by
Midgardsormr
Quantization help?
2
#4 opened about 1 month ago
by
Daemontatox
GGUF Generation Script
3
#3 opened about 1 month ago
by
RonanMcGovern
Q2_K_L vs IQ3_M shows favorable results for Q2_K_L
1
#1 opened about 1 month ago
by
anxcat
The refusal rate is very high, it kind of look like the original censored model
4
#1 opened about 1 month ago
by
sneedingface
How do I GGUF?
2
#1 opened about 1 month ago
by
TheDrummer
Requant request
1
#1 opened about 1 month ago
by
TheDrummer
Lookin for quants
5
#1 opened about 1 month ago
by
lemon07r