Meet the world's first transformer supercomputer

_Transformers etched into silicon

By burning the transformer architecture into our chips, we’re creating the world’s most powerful servers for transformer inference.

Tokens
per Second

NVIDIA
8xA100

NVIDIA
8xH100

Etched
8xSohu

Ingest thousands of words in milliseconds

Compare hundreds of responses in parallel

Generate new content in real-time

Only one core

Fully open-source software stack

Expansible to 100T param models

Beam search and MCTS decoding

144 GB HBM3E per chip

MoE and transformer variants