Yet another tech startup wants to topple Nvidia with ‘orders of magnitude’ better energy efficiency; Sagence AI bets on analog in-memory compute to deliver 666K tokens/s on Llama2-70B
Sagence brings analog in-memory compute to redefine AI inferenceTen instances decrease energy and 20 instances decrease pricesAdditionally gives integration with ...