Would you board a plane safety-tested by GenAI?
Ben and Ryan are joined by Robin Gupta for a conversation about benchmarking and testing AI systems. They talk through the lack of trust and confidence in AI, the inherent challenges of nondeterministic systems, the role of human verification, and whether we can (or should) expect an AI to be reliable.
Robin is the author of a practical handbook for Selenium test automation.
Connect with Robin on LinkedIn, Twitter, or via his website.
Shoutout to user2651084, who earned a Great Question badge by asking How do I reset the Jupyter/IPython input prompt numbering?.