You cannot select more than 25 topics Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.
reflexion-human-eval/programming_runs/run_testacc.sh

11 lines
250 B
Bash

python main.py \
--run_name "testacc_humaneval_py3" \
--root_dir "root" \
--dataset_path ./benchmarks/humaneval-py.jsonl.gz \
--strategy "test-acc" \
--language "py" \
--model "gpt-4" \
--pass_at_k "1" \
--max_iters "1" \
--verbose