Cerebras Systems, traditionally focused on selling AI computers for training neural networks, is pivoting to offer inference services. The company is using its wafer-scale engine (WSE), a computer chip the size of a dinner plate, to integrate Meta’s open-source LLaMA 3.1 AI model directly onto the chip – a configuration…