AI Generated Image

Tokenmaxxing Is Over. It's All About Modelmaxxing Now.

Business Insider • July 4, 2026 • general

Key Points:

AI startup Bold Metrics' CTO Morgan Linton directs his engineering teams to use different AI models based on task complexity, optimizing efficiency without strict token limits.
The AI industry is moving away from "tokenmaxxing," where companies encouraged maximum AI usage, towards strategic "model switching" to balance cost and performance by assigning simpler tasks to cheaper models.
Users like UX designer Tanvi Pisal and engineer Alejandra Thomas adopt cost-saving tactics by testing models for specific strengths and combining tools, reducing wasted tokens on less critical tasks.
Model routing startups, such as Rayline and OpenRouter, are gaining popularity by automating task assignment to appropriate AI models, helping companies manage API costs and improve token efficiency.
Despite the trend toward cost-conscious AI use, some companies still default to the latest, most expensive models due to convenience or reluctance to invest effort in understanding model capabilities.

Trending Business