Show HN: A new benchmark for testing LLMs for deterministic outputs April 29, 2026 · Hacker News Read full story at source