8 Comments
User's avatar
Ganedh's avatar

You mention "Under the token model, the equation flips. You’re now selling output vs raw compute time. A token that helps an agent close a sales deal is worth a lot more than a token that summarizes a Wikipedia page, even though they cost roughly the same to generate. That’s the whole point of Jensen’s tiered token pricing vision - different points on the Pareto curve command different price points." How do you differentiate between "Wikipedia summarization" and "Closing a Sales Deal"

Jordan Berger's avatar

Would love to attend a town hall on the topic. Elena Verna at Lovable led a similar discussion in December which was great.

Roxane Googin's avatar

It seems like pricing could be open to judgement. How do you quantify "better" reasoning or more "reliable" agents well enough to price against that? Reputation perhaps?

DOOM METAL's avatar

More powerful models use longer context windows, and agentic workflows multiply token consumption. More tokens are being consumed, and the labs are not finished training the models. Why are you bullish on this?

sticko's avatar

Definitely would love to be part of a townhall etc to learn more and discuss

Zach Cohen's avatar

But what about the implementation details?

Fabrizio Amadini's avatar

Please, count me in if you plan to organize a virtual event/panel discussion on AI pricing & packaging. Would love to attend and gain further insights. Thx for putting together this awesome content, Jamin!

---'s avatar

love this! wanted to double click: "but the pricing power lives at the premium end.". Are you saying that there's less value for folks like Modal trying to solve for batched inference?