abdullah (Abdullah Abdelrhim)

upvoted a collection 2 days ago

OpenMath-2

A collection of models and datasets introduced in "OpenMathInstruct-2: Accelerating AI for Math with Massive Open-Source Instruction Data" • 6 items • Updated 3 days ago • 8

upvoted a collection 5 days ago

Llama 3.2 3B & 1B GGUF Quants

Collection

Llama.cpp compatible quants for Llama 3.2 3B and 1B Instruct models. • 4 items • Updated 10 days ago • 40

upvoted an article 8 days ago

Article

wHy DoNt YoU jUsT uSe ThE lLaMa ToKeNiZeR??

By

•

8 days ago

• 31

upvoted a collection 10 days ago

Molmo

Collection

Artifacts for open multimodal language models. • 5 items • Updated 10 days ago • 218

upvoted an article 12 days ago

Article

Document Similarity Search with ColPali

By

•

14 days ago

• 36

upvoted a paper 13 days ago

Qwen2.5-Coder Technical Report

Paper • 2409.12186 • Published 17 days ago • 121

upvoted a paper 15 days ago

Training Language Models to Self-Correct via Reinforcement Learning

Paper • 2409.12917 • Published 16 days ago • 128

upvoted a paper 17 days ago

Kolmogorov-Arnold Transformer

Paper • 2409.10594 • Published 19 days ago • 37

upvoted a paper 18 days ago

RetrievalAttention: Accelerating Long-Context LLM Inference via Vector Retrieval

Paper • 2409.10516 • Published 19 days ago • 33

upvoted a collection 18 days ago

MagpieLM

Collection

Aligning LMs with Fully Open Recipe (data+training configs+logs) • 9 items • Updated 13 days ago • 13

upvoted 3 papers about 1 month ago

upvoted a collection about 1 month ago

ArabianLLM Series | Native Arabic Large Language Models

Collection

This collection is related to native Arabic Large Language Models.. It represent different sizes of GPT trained Model for Test Generative • 8 items • Updated Aug 26 • 2

upvoted an article about 1 month ago

Article

The 5 Most Under-Rated Tools on Hugging Face

Aug 22

• 81

upvoted 3 papers about 2 months ago

LongWriter: Unleashing 10,000+ Word Generation from Long Context LLMs

Paper • 2408.07055 • Published Aug 13 • 65

Better Alignment with Instruction Back-and-Forth Translation

Paper • 2408.04614 • Published Aug 8 • 14

The AI Scientist: Towards Fully Automated Open-Ended Scientific Discovery

Paper • 2408.06292 • Published Aug 12 • 115

upvoted a paper 2 months ago

MMAU: A Holistic Benchmark of Agent Capabilities Across Diverse Domains

Paper • 2407.18961 • Published Jul 18 • 38

upvoted an article 2 months ago

Article

Fine-tune Llama 3.1 Ultra-Efficiently with Unsloth

By

•

Jul 29

• 212

upvoted an article 3 months ago

Article

SmolLM - blazingly fast and remarkably powerful

Jul 16

• 244

upvoted a collection 3 months ago

InternLM2.5

Collection

14 items • Updated 21 days ago • 68

upvoted a collection 4 months ago

MatMulfree LM

Collection

Pre-trined models for Matmulfree LM. • 4 items • Updated Jun 10 • 24

upvoted a paper 4 months ago

Are You Sure? Rank Them Again: Repeated Ranking For Better Preference Datasets

Paper • 2405.18952 • Published May 29 • 10

upvoted a collection 4 months ago

sentence-transformers-from-synthetic-data

Collection

Example of using distilabel to generate synthetic triplets data for fine-tuning a Sentence Transformer model • 4 items • Updated Jun 21 • 21

upvoted a paper 5 months ago

OpenRLHF: An Easy-to-use, Scalable and High-performance RLHF Framework

Paper • 2405.11143 • Published May 20 • 33

upvoted a collection 5 months ago

Wikimedia Datasets

Collection

Wikimedia datasets, across languages and modalities, from different Wikimedia projects, on the hub. Not all tested. • 19 items • Updated May 16 • 9

upvoted an article 5 months ago

Article

Introducing the Open Arabic LLM Leaderboard

May 14

• 64

upvoted 4 papers 5 months ago

DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language Model

Paper • 2405.04434 • Published May 7 • 13

LoRA Land: 310 Fine-tuned LLMs that Rival GPT-4, A Technical Report

Paper • 2405.00732 • Published Apr 29 • 118

Iterative Reasoning Preference Optimization

Paper • 2404.19733 • Published Apr 30 • 46

Better & Faster Large Language Models via Multi-token Prediction

Paper • 2404.19737 • Published Apr 30 • 73

upvoted an article 5 months ago

Article

⚗️ 🧑🏼‍🌾 Let's grow some Domain Specific Datasets together

By

•

Apr 29

• 28

upvoted a paper 5 months ago

AdvPrompter: Fast Adaptive Adversarial Prompting for LLMs

Paper • 2404.16873 • Published Apr 21 • 28

upvoted a collection 5 months ago

Text-to-text Generation Models (LLMs, Llama, GPT, ...)

Collection

5130 items • Updated Aug 23 • 12

upvoted an article 5 months ago

Article

🦙⚗️ Using Llama3 and distilabel to build fine-tuning datasets

By

•

Jun 4

• 69

upvoted 2 papers 6 months ago

Phi-3 Technical Report: A Highly Capable Language Model Locally on Your Phone

Paper • 2404.14219 • Published Apr 22 • 251

Toward Self-Improvement of LLMs via Imagination, Searching, and Criticizing

Paper • 2404.12253 • Published Apr 18 • 53

upvoted an article 6 months ago

Article

Fine-tune Llama 3 with ORPO

By

•

Apr 22

• 221

upvoted a collection 6 months ago

Multilingual LLMs Chat Spaces

Collection

Here you find Chat spaces to interact and test multilingual models but the goal here is to test on Arabic • 3 items • Updated May 24 • 1

upvoted 5 papers 6 months ago

Chinese Tiny LLM: Pretraining a Chinese-Centric Large Language Model

Paper • 2404.04167 • Published Apr 5 • 12

CantTalkAboutThis: Aligning Language Models to Stay on Topic in Dialogues

Paper • 2404.03820 • Published Apr 4 • 24

Mixture-of-Depths: Dynamically allocating compute in transformer-based language models

Paper • 2404.02258 • Published Apr 2 • 104

Leveraging Corpus Metadata to Detect Template-based Translation: An Exploratory Case Study of the Egyptian Arabic Wikipedia Edition

Paper • 2404.00565 • Published Mar 31 • 6

Octopus v2: On-device language model for super agent

Paper • 2404.01744 • Published Apr 2 • 56

upvoted a collection 6 months ago

A little guide to building Large Language Models in 2024

Collection

Resources mentioned by @thomwolf in https://x.com/Thom_Wolf/status/1773340316835131757 • 19 items • Updated Apr 1 • 14

upvoted 2 papers 6 months ago

Transformer-Lite: High-efficiency Deployment of Large Language Models on Mobile Phone GPUs

Paper • 2403.20041 • Published Mar 29 • 34

InternLM2 Technical Report

Paper • 2403.17297 • Published Mar 26 • 28

upvoted 2 collections 7 months ago

🔮 Mixture of Experts

Collection

MoE done using mergekit and LazyMergekit: https://colab.research.google.com/drive/1obulZ1ROXHjYLn6PPZJwRR6GzgQogxxb#scrollTo=d5mYzDo1q96y • 13 items • Updated Aug 16 • 22

Preference Datasets for KTO

Collection

This collection contains a list of curated preference datasets for KTO fine-tuning for intent alignment of LLMs through signals. • 5 items • Updated Jul 30 • 14

upvoted 4 papers 7 months ago

QLoRA: Efficient Finetuning of Quantized LLMs

Paper • 2305.14314 • Published May 23, 2023 • 45

Algorithmic progress in language models

Paper • 2403.05812 • Published Mar 9 • 18

ORPO: Monolithic Preference Optimization without Reference Model

Paper • 2403.07691 • Published Mar 12 • 60

Adding NVMe SSDs to Enable and Accelerate 100B Model Fine-tuning on a Single GPU

Paper • 2403.06504 • Published Mar 11 • 53

upvoted a collection 7 months ago

Awesome Document AI

Collection

A collection of open-source document AI 📄 📝 📈 • 27 items • Updated Mar 11 • 70

upvoted 5 papers 7 months ago

Yi: Open Foundation Models by 01.AI

Paper • 2403.04652 • Published Mar 7 • 61

ShortGPT: Layers in Large Language Models are More Redundant Than You Expect

Paper • 2403.03853 • Published Mar 6 • 63

GaLore: Memory-Efficient LLM Training by Gradient Low-Rank Projection

Paper • 2403.03507 • Published Mar 6 • 182

EasyQuant: An Efficient Data-free Quantization Algorithm for LLMs

Paper • 2403.02775 • Published Mar 5 • 11

Design2Code: How Far Are We From Automating Front-End Engineering?

Paper • 2403.03163 • Published Mar 5 • 93

Abdullah Abdelrhim

AI & ML interests

Organizations

abdullah's activity

wHy DoNt YoU jUsT uSe ThE lLaMa ToKeNiZeR??

Document Similarity Search with ColPali

The 5 Most Under-Rated Tools on Hugging Face

Fine-tune Llama 3.1 Ultra-Efficiently with Unsloth

SmolLM - blazingly fast and remarkably powerful

Introducing the Open Arabic LLM Leaderboard

⚗️ 🧑🏼‍🌾 Let's grow some Domain Specific Datasets together

🦙⚗️ Using Llama3 and distilabel to build fine-tuning datasets

Fine-tune Llama 3 with ORPO