Until now, prototyping agents meant juggling between workflow tools, prompt sandboxes, and spreadsheets, but none of them were really made for evaluation. | discoverkit | discoverkit