Discussion about this post

User's avatar
Drew Harry's avatar

Lots of thoughts about this. But I'll share one insight that I think gets at the core of why this is hard.

The hard part of UXR work is not generating evidence, it's convincing people to act on it. I've seen many teams ship experiences over the strenuous objections of their UXR partners. A heuristic evaluation, like you discuss here as taking "days to do," is (in my experience) routinely dismissed by teams as "not sufficient evidence." It's an "N of 1" study. Similarly with survey results. If the data confirms the team's prior, great. If it does not? They tend to undermine it and de-scope it.

What IS easier to do is to convince teams not to ship something with a negative A/B test result.

Where would evidence of this sort land?

My expectation is it lands firmly on the UXR-side of the spectrum. If it confirms people's priors, great. They cite it as a reason to ship. If it questions their work, they work past it. "Oh it's just a hallucination." If teams are happy to overrule a human, they will even more happily reject a post-commit bot that has thoughts about the UX.

On top of that, we have a problem of insight coherence. Part of what research (and data) teams offer is a single, evidence-backed POV. It is well-known that you don't want separate data teams because you get a phenomenon where leaders are forced to reconcile disparate views on the question from different data teams that lob methods critiques at each other. An LLM-driven insights world is even more pluralistic and even harder to drill down. Why does my agent say this UI is great and your agent says it's bad? Is it just noise? Did the minor change in the instructions for the task to complete flip it from shippable to "needs revision?" We run head-long into LLM-explainability for moment-to-moment product decision-making.

So, I am not bullish about this sort of automation. I AM bullish about tools that enable humans to collect data faster. But the matter of turning data into insights is one that I think should remain the domain of humans because they have organizational power and can make sure the "truth" (such as it is) gets heard and acted on.

Hugo Alves's avatar

Hey David, building in this space since B.C. (Before ChatGPT)

check out https://www.syntheticusers.com/

No posts

Ready for more?