News
Hosted on MSN9mon
AI researchers run AI chatbots at a lightbulb-esque 13 watts with no performance loss — stripping matrix multiplication from LLMs yields massive gainsThe work was done using custom FGPA hardware, but the researchers clarify that (most) of their efficiency gains can be applied through open-source software and tweaking of existing setups.
Some results have been hidden because they may be inaccessible to you
Show inaccessible results