Show HN: A new benchmark for testing LLMs for deterministic outputs

· Hacker News