Abstract: Trustworthy evaluation methods for code snippets play a crucial role in neural code generation. Traditional methods, which either rely on reference solutions or require executable test cases ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results