I have attempted to reproduce Qwen/Qwen3-Coder-30B-A3B-Instruct 's performance on webarena and webarena-verified. However, using the Generic agent prompt, I achieved extremely low performance. Has anyone benchmarked Qwen/Qwen3-Coder-30B-A3B-Instruct on any of the benchmarks integrated here?
I have attempted to reproduce Qwen/Qwen3-Coder-30B-A3B-Instruct 's performance on webarena and webarena-verified. However, using the Generic agent prompt, I achieved extremely low performance. Has anyone benchmarked Qwen/Qwen3-Coder-30B-A3B-Instruct on any of the benchmarks integrated here?