SaladCloud Blog


A New Price-Performance Standard for BERT Transformers.

Salad Technologies

Engineers from Numenta used Salad Container Engine (SCE) to benchmark a first-of-its-kind intelligent computing platform that optimizes BERT transformer networks. Learn how Numenta attained 10x more inferences per dollar on SCE.


Optimizing AI Systems

Deploying practical artificial intelligence applications at scale requires the distribution of large data sets to complex networks of specialized hardware. Though deep neural networks have facilitated significant advancements, their fundamental reliance on highly available processing resources and their tendency toward rapid expansion make it costly and inefficient to run transformers in the public cloud.

Price-Performance Comparison


Optimizing AI Systems

Leveraging insights from 20 years of neuroscience research, Numenta has developed breakthrough advances in AI that deliver dramatic performance improvements across broad use cases.

Grounded in the sensorimotor framework of intelligence elaborated by co-founder Jeff Hawkins in A Thousand Brains, Numenta’s innovative technology turns the principles of human learning into new architectures, data structures, and algorithms that deliver disruptive performance improvements.

Case Study

10x Price Performance

In a side-by-side comparison, Numenta’s optimized BERT technologies improved the throughput of a standard transformer network by up to 6.5x.

When deployed on SCE, Numenta attained 10x more inferences per dollar than possible with on-demand offerings from AWS—and managed to beat the cost efficiency of the nearest spot-basis instance by 2.39x.

About Numenta

Numenta has developed new artificial intelligence technologies that deliver breakthrough performance in AI/ML applications such as natural language processing and computer vision. Backed by two decades of neuroscience research, Numenta’s novel architectures, data structures, and algorithms deliver disruptive performance improvements. Numenta is currently engaged in a private beta with several Global 100 companies and startups to apply its platform technology across the full spectrum of AI, from model development to deployment—and ultimately enable novel hardware architectures and whole new categories of applications.

Have questions about SaladCloud for your workload?

Book a 15 min call with our team. Get $50 in testing credits.

Related Blog Posts

Speech to text inference benchmark - Distil Whisper Large v2

Inference Benchmark on Salad: Distil-Whisper Large V2 vs. Whisper Large V3 for Speech-to-text

Hugging Face Distil-Whisper Large V2 is a distilled version of the OpenAI Whisper model that is 6 times faster, 49% smaller and performs within 1%  WER (word error rates) on...
Read More
Openvoice text to speech gpu benchmark on SaladCloud

OpenVoice Text-to-Speech (TTS) Benchmark: 6 Million+ Words/$ Using Salad

What is OpenVoice? OpenVoice is an open-source, instant voice cloning technology that enables the creation of realistic and customizable speech from just a short audio clip of a reference speaker....
Read More
Whisper large v3 - Automatic speech - recognition - gpu benchmark

Whisper Large V3 Speech Recognition Benchmark: 1 Million hours of audio transcription for just $5110

Save over 99.8% on audio transcription using Whisper Large V3 and consumer GPUs A 99.8% cost-savings for automatic speech recognition sounds unreal. But with the right choice of GPUs and...
Read More

Don’t miss anything!

Subscribe To SaladCloud Newsletter & Stay Updated.