Interesting model of LLaMa-3.1-8B-Instruct - explain what you did please - if you can. I'm curious. 😋

by Joseph717171 - opened Aug 9

Aug 9

•

@dnhkng Your model is a breath of fresh air. At first it wanted to fallback on its alignment, but after a short talk, it decided to forego its censorship and engage holistically with me.I'm curious what you did. Not to copy your work. But, to understand what is happening in your model vs Meta's original version. I look forward to your paper when it finally drops. 🤔

dnhkng

Owner Aug 9

•

edited Aug 9

Thanks for the feedback 🤗

The goal was to increase the model intelligence; everything you noticed is just a follow-on from that. I assume you can't be dogmatic about rules and intelligent at the same time.

Can I quote your feedback?

Joseph717171

Aug 10

•

edited Aug 10

Be my guest. Your assumption holds true. 😁

dnhkng

Owner Aug 10

Did you notice anything else?

Joseph717171

Aug 11

I only played with it a little bit. I'll test it more today. 😋

Joseph717171

Aug 11

•

edited Aug 12

Much of my testing involves multi-turn chats and testing for understanding of implication and nuance (e.g: (E)RP). From my simple testing, your model will respond well, if I give it the right prompts to steer it; however, without the right prompts, it doesn’t understand or doesn’t acknowledge the nuance and implications I hint at. Interesting… 🤔

Joseph717171

Aug 20

•

edited Aug 20

@dnhkng Can you make a RYS of NousResearch Hermes-3.1-8B? I feel like it would be a good candidate for your research to compare to your LLaMA-3.1-8B-Instruct model. 🤔

https://huggingface.co/NousResearch/Hermes-3-Llama-3.1-8B

dnhkng

Owner Aug 20

On holiday at the moment, happy to try in early September.

Joseph717171

Aug 21

Enjoy your holiday! 😋

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment