Pankaj Gupta built a product that let 1.3 million people vote on which AI model gave the best answer. Jeff Dean invested. Biz Stone invested. The CEO of Perplexity invested. a16z crypto's Chris Dixon led a $33 million seed round. On Tuesday, Gupta announced Yupp.ai is winding down, less than ten months after launch. Platform access ends April 15.

The stated reason is the one every failed startup reaches for: product-market fit. "The AI model capability landscape has changed dramatically in the last year alone," Gupta wrote. Which is a polite way of saying the product was a leaderboard for a race where the runners kept swapping positions between refreshes.

Yupp's premise made a kind of sense when it launched in June 2025. Back then, picking between Claude and GPT and Gemini and whatever Mistral was calling itself that week felt consequential. You'd paste a prompt into three chat windows, squint at the results, and develop superstitions about which one "got you." Yupp crowdsourced that process across 800 models. Millions of preference signals a month, all feeding into a ranking system that was supposed to help ordinary people navigate the model landscape.

The problem is that ordinary people stopped caring. Not because the models got worse, but because they got interchangeably good enough. When the gap between first place and eighth place on a benchmark is statistical noise, a consumer taste-test platform becomes a thermometer for a room that's already at temperature.

There's a crueller reading. AI labs figured out that crowdsourced preferences from casual users are a blunt instrument. The shift toward agentic workflows meant models needed to impress other models, not people scrolling on their phones. For the kind of reinforcement learning that matters now, labs hire domain experts and run evaluations against PhD-level feedback. The crowd was never going to be precise enough.

Forty-five angel investors. DeepMind's chief scientist. A $33 million cheque from one of the most connected funds in Silicon Valley. And the thing it bought was ten months of server time and a blog post titled "winddown." The economics of wrapping someone else's API haven't changed since Anthropic started enforcing its terms of service. If anything, the lesson has sharpened. The thinner your layer, the faster the substrate makes you irrelevant.

Some of Yupp's employees are reportedly joining a "well-known" AI company. Which sounds like a soft landing until you consider that it's the same trajectory the product followed: absorbed back into the infrastructure it was built to evaluate.

Sources: