Source: OpenAI engineers earlier this month told some colleagues they had figured out a way to more than halve the cost of inference (Stephanie Palazzolo/The Information)
OpenAI reportedly found a way to cut inference costs by more than half, potentially improving margins and pricing power.
Excerpt
<a href="https://www.theinformation.com/articles/openai-discovers-new-way-cut-inference-costs-half"><img align="RIGHT" border="0" hspace="4" src="http://www.techmeme.com/260630/i27.jpg" vspace="4" /></a>
<p><a href="https://www.techmeme.com/260630/p27#a260630p27" title="Techmeme permalink"><img height="12" src="http://www.techmeme.com/img/pml.png" style="border: none; padding: 0; margin: 0;" width="11" /></a> Stephanie Palazzolo / <a href="https://www.theinformation.com/">The Information</a>:<br />
<span style="font-size: 1.3em;"><b><a href="https://www.theinformation.com/articles/openai-discovers-new-way-cut-inference-costs-half">Source: OpenAI engineers earlier this month told some colleagues they had figured out a way to more than halve the cost of inference</a></b></span> — We closely track efforts by Anthropic, Google and OpenAI to get access to more server chips to run their models. But we don't talk enough about the work … </p>
Read at source: https://www.techmeme.com/260630/p27#a260630p27