Source: OpenAI engineers earlier this month told some colleagues they had figured out a way to more than halve the cost of inference (Stephanie Palazzolo/The Information)

Techmeme ·

OpenAI reportedly found a way to cut inference costs by more than half, potentially improving margins and pricing power.

Categories: Money & Moves

Excerpt

<a href="https://www.theinformation.com/articles/openai-discovers-new-way-cut-inference-costs-half"><img align="RIGHT" border="0" hspace="4" src="http://www.techmeme.com/260630/i27.jpg" vspace="4" /></a> <p><a href="https://www.techmeme.com/260630/p27#a260630p27" title="Techmeme permalink"><img height="12" src="http://www.techmeme.com/img/pml.png" style="border: none; padding: 0; margin: 0;" width="11" /></a> Stephanie Palazzolo / <a href="https://www.theinformation.com/">The Information</a>:<br /> <span style="font-size: 1.3em;"><b><a href="https://www.theinformation.com/articles/openai-discovers-new-way-cut-inference-costs-half">Source: OpenAI engineers earlier this month told some colleagues they had figured out a way to more than halve the cost of inference</a></b></span>&nbsp; &mdash;&nbsp; We closely track efforts by Anthropic, Google and OpenAI to get access to more server chips to run their models.&nbsp; But we don't talk enough about the work &hellip; </p>