What's up with absolutely tiny amounts of tests per day? 200 tests per day is tiny. My FE team only has two devs and the product is fairly small, but we still managed to amass a couple of hundreds of E2E tests in several years. With tests run in CI on every PR or commit, this won't scale at all.
Presumably, this is backed by some sort of LLM to browser MCP integration. If true, how do you ensure tests don't randomly fail because of inherit unpredictability?
What's up with absolutely tiny amounts of tests per day? 200 tests per day is tiny. My FE team only has two devs and the product is fairly small, but we still managed to amass a couple of hundreds of E2E tests in several years. With tests run in CI on every PR or commit, this won't scale at all.
Presumably, this is backed by some sort of LLM to browser MCP integration. If true, how do you ensure tests don't randomly fail because of inherit unpredictability?