REPOEXEC: Evaluate Code Generation with a Repository-Level Executable Benchmark Paper • 2406.11927 • Published Jun 17 • 11
CodeMMLU: A Multi-Task Benchmark for Assessing Code Understanding Capabilities of CodeLLMs Paper • 2410.01999 • Published 11 days ago • 8
Dopamin Collection Transformer-based Comment Classifiers through Domain Post-training and Multi-level layer aggregation • 21 items • Updated Dec 11, 2023 • 1