Coding agent benchmark report - Sigmabench Leaderboard