You Don't Need Data-Augmentation in Self-Supervised Learning Paper • 2406.09294 • Published Jun 13 • 1
JPEG-LM: LLMs as Image Generators with Canonical Codec Representations Paper • 2408.08459 • Published Aug 15 • 44
An Image is Worth More Than 16x16 Patches: Exploring Transformers on Individual Pixels Paper • 2406.09415 • Published Jun 13 • 50
Transformer Explainer: Interactive Learning of Text-Generative Models Paper • 2408.04619 • Published Aug 8 • 154
Stretching Each Dollar: Diffusion Training from Scratch on a Micro-Budget Paper • 2407.15811 • Published Jul 22 • 1
Spectra: A Comprehensive Study of Ternary, Quantized, and FP16 Language Models Paper • 2407.12327 • Published Jul 17 • 76
Q-GaLore: Quantized GaLore with INT4 Projection and Layer-Adaptive Low-Rank Gradients Paper • 2407.08296 • Published Jul 11 • 31
Knowledge Composition using Task Vectors with Learned Anisotropic Scaling Paper • 2407.02880 • Published Jul 3 • 11
An Image is Worth 32 Tokens for Reconstruction and Generation Paper • 2406.07550 • Published Jun 11 • 55
Machine Perceptual Quality: Evaluating the Impact of Severe Lossy Compression on Audio and Image Models Paper • 2401.07957 • Published Jan 15 • 1