Engineering
Dec 12, 2025
The Pursuit of < 500ms Latency
When we started Vocalis, the standard GPT-4 response time was 3-4 seconds. In a voice conversation, that silence feels like an eternity. It breaks the illusion of presence.
We rebuilt our entire ingestion pipeline using Edge Websockets and optimistic audio buffering. Here is how we managed to get the "Time to First Byte" (TTFB) of audio down to sub-500ms...
Read Full Post ->
Product
Nov 28, 2025
Why We Built "The Shark"
Most AI assistants are designed to be servile and polite. But growth doesn't happen in a comfort zone. We realized our users weren't improving their negotiation skills because the AI was too nice.
"The Shark" is designed to interrupt you. It challenges your premises. It simulates the stress of a real salary negotiation with a difficult hiring manager. It's not nice, but it works.
Read Full Post ->