A mind-blowing paper from last year: We also evaluate on other tasks including parity, multiplication and subtraction, and find similar results. On the longest subset of questions, we achieve an error reduction of approximately 10x, 5x and 2x respectively compared to the best available baselines.
Links for 2023-03-26
Links for 2023-03-26
Links for 2023-03-26
A mind-blowing paper from last year: We also evaluate on other tasks including parity, multiplication and subtraction, and find similar results. On the longest subset of questions, we achieve an error reduction of approximately 10x, 5x and 2x respectively compared to the best available baselines.