CommitBench: A Benchmark for Commit Message Generation
CoRR(2024)
摘要
Writing commit messages is a tedious daily task for many software developers,
and often remains neglected. Automating this task has the potential to save
time while ensuring that messages are informative. A high-quality dataset and
an objective benchmark are vital preconditions for solid research and
evaluation towards this goal. We show that existing datasets exhibit various
problems, such as the quality of the commit selection, small sample sizes,
duplicates, privacy issues, and missing licenses for redistribution. This can
lead to unusable models and skewed evaluations, where inferior models achieve
higher evaluation scores due to biases in the data. We compile a new
large-scale dataset, CommitBench, adopting best practices for dataset creation.
We sample commits from diverse projects with licenses that permit
redistribution and apply our filtering and dataset enhancements to improve the
quality of generated commit messages. We use CommitBench to compare existing
models and show that other approaches are outperformed by a Transformer model
pretrained on source code. We hope to accelerate future research by publishing
the source code( https://github.com/Maxscha/commitbench ).
更多查看译文
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要