Seems like this one is Windows-only (even though it's Tauri?)
And it's not local (uses a cloud-based transcription API)
Also doesn't seem like it's realtime streaming, either. To get the most connected typing experience, try showing results in under a second from within the first word spoken (not after the utterance is complete)
This HN comment captures why realtime streaming is important: https://hw.leftium.com/#/item/47149479
I've also been prototyping realtime streaming transcription with multimodal input: https://rift-transcription.vercel.app