While today’s chatbots are some of the most advanced ever created, they all suffer from high latency. Depending on the query, response times can range from a second to several seconds. Some companies, like Apple, want to resolve this with on-device AI processing. OpenAI took a different approach with Omni.