Steve Omohundro ...Google and others are using Mixture-of-Experts to avoid some of that cost: https://arxiv.org/abs/1701.06538Matrix multiply is a pretty inefficient primitive and alternatives are being explored: https://arxiv.org/abs/2106.10860
Steve Omohundro
...Google and others are using Mixture-of-Experts to avoid some of that cost: https://arxiv.org/abs/1701.06538
Matrix multiply is a pretty inefficient primitive and alternatives are being explored: https://arxiv.org/abs/2106.10860
These stand out for me as causes for alarm. Anything that makes ML significantly more efficient as an AI paradigm seems like it shortens timelines. Can anyone say why they aren’t cause for alarm? (See also)
These stand out for me as causes for alarm. Anything that makes ML significantly more efficient as an AI paradigm seems like it shortens timelines. Can anyone say why they aren’t cause for alarm? (See also)