Show HN: Scriber Pro β Offline AI transcription for macOS
Hey HN! Built this because I was tired of waiting hours for transcription services and didn't want to upload sensitive recordings to the cloud.
Real metrics from my M1 Max: 4.5hr video file transcribed in 3 minutes 32
seconds. Works completely offline.
First 5 HN users who click the button on the page get it free. Literally promo code straight to the app sore
Key differences vs Rev/Otter:
- No 2-hour file limits (handles any length)
- Timecodes stay accurate on long files (no drift from chunking)
- Supports MP3, WAV, MP4, MOV, M4A, FLAC
- Exports to SRT, VTT, JSON, PDF, DOCX, CSV, Markdown
Built for macOS. Happy to answer questions!
I've been using MacWhisper for this, with a huge variety of transcription options and things like speaker detection. It works great for all the 1 hour and shorter videos I've fed it, but does this have more to offer?
I haven't tried a 4+ hour video with MacWhisper but I presume that would work the same.
As a side project, I just launched a privacy-first web-based meeting transcriber (https://basilai.app/app). Everything runs entirely in your browser β both the transcription and AI summarization β so no audio or text ever leaves your device.
I'm using the browser built in transcription service plus downloading a model and running it via webgpu. No login. At the end of your meeting, you get a zip file with the audio, transcript and summary.
You can also run Whisper locally in your browser for free: https://ggml.ai/whisper.cpp/
What languages does this support? Does it support switching between multiple languages in one video?
For example, could it support a video that included spoken Latin, ancient Greek, German, and Italian?
Does it support speaker diarization?
You use the word "transcribe" but the page doesn't appear to support that claim? This looks like straightforward STT? Or does it actually support transcription (diarization, etc.)?
(Also, the text is completely illegible on your site.)
One thing that Rev and other online services have as well as MacWhisper is a good interface for editing the text to correct inevitable errors. Being able to click on the text and have it sync to the correct place in the audio is a must for my use case of transcribing interviews. Also speaker diarization.
Is it only for English? is CLI available? There are thousands of files on my local and I'd like to save results to local db. Thanks!
Seconding/thirding the request for diarization! I would use this as my main transcription app if it had that.
Does it do separate speaker identification (diarization)?
What's the stack, if I may ask? (I believe Whisper-X does the diarization thing)
Question: can it discern (and label) different speakers? If so, could you kindly share the limit on speakers per video?
Timecode drift is an interesting issue, think I faced this recently while translating a Google Meet transcript into an incident report timeline.
The elapsed-time timestamps didn't correlate well with other data sources. I figured it was a mistake on my end, and just brushed it off.
Will it transcribe audio in Czech (in future versions)?
Actually I would be happy if it could just identify occurrences (timestamps) of a specific word or a small set of words.
App Store link no longer works. Willing to try/purchase but it's nowhere available. AppStore search doesn't return "Scriber Pro" either.
Thanks.
How does it compare to MacWhisper?
I vibecoded a similar app. Hereβs the open source link, if folks want to build their own:
Nice work. What model did you use and do you ship the model with a base distribution or is it downloaded with the app?
What is your tech stack to make this? Is it end to end swift?
What libraries/models is this built on?
My eyes, my eyes! What is this red colour?
Cool project, I am using ChatGPT for recording/summarising meetings but the limit there is 2 hours
undefined
Is there a reason it requires macOS 26?
Any way to access this with python so I can use it programmatically?
Too bad it requires that unspeakable abomination macOS 26. No can do.
Word level timestamps?
I sort of use SuperWhisper, it is sort of good. https://superwhisper.com/
undefined
[stub for offtopicness]