More

Tsarp · 2025-06-30T03:22:10 1751253730

Locally running wispr flow equivalent without any tracking, signup, analytics or subscriptions.

Dictate into any text window on your Mac. Works really well with technical language specifically when using with claude code, cursor, windsurf.

Very fast since the underlying whisper.cpp lib is very well optimized for Metal and CoreML usage on Apple Silicon machines.

Tsarp · 2025-06-30T03:18:45 1751253525

https://voicebraindump.com

Low friction Markdown based voice journaling. Locally transcribed voice memos with whisper and write as markdown files (to any folder or obsidian vault).

czarofvan · 2025-06-30T03:27:11 1751254031

Is this opensource or just open eco system?

Tsarp · 2025-06-30T03:16:56 1751253416

https://github.com/srv1n/kurpod

Lets you create encrypted containers disguised as normal files. 1000s of images, pdfs, videos, secrets, keys all stuffed into an innocent look "Vacation_Summer_2024.mp4".

I've almost got true steganography working i.e to get the carrier file to actually open in any file system(currently with mp4, pdf, png and jpeg).

Things like this have existed in the past, but nothing with a simple UI,recent encryption standards.

czarofvan · 2025-06-30T03:25:44 1751253944

Damn how is the docker image only 4Mb. Even with the docker slim images they typically are atleast double digit. Nice!

Tsarp · 2025-06-30T13:03:37 1751288617

Im just stuffing the binary into a scratch container. I had to port over openssl certs, but works like a charm after!

Tsarp · 2025-06-21T13:25:34 1750512334

Why not something like https://github.com/nanobrowser/nanobrowser.

Its kinda built really well without exposing webdriver etc and can comfortably run js and communicate with LLMs.Has full agentic capabilites.

Why a new browser instead of a robust extension?

yencabulator · 2025-06-21T15:49:49 1750520989

Why a new browser extension for Chrome instead of an MCP operating Chrome over Chrome DevTools Protocol?

https://chromedevtools.github.io/devtools-protocol/

Not vouching for this project, but just an example of the category existing: https://github.com/AgentDeskAI/browser-tools-mcp

Tsarp · 2025-06-21T16:55:02 1750524902

CDP is great for testing. But one of the most basic checks for bot detection is checking for CDP(webdriver). Its always going to be a cat and mouse game. You'll see a bunch of solutions captch solvers etc, But they usually are only good for a few weeks.

yencabulator · 2025-06-21T17:06:38 1750525598

There's no reason why the same cat and mouse game wouldn't apply to this browser as a whole.

Tsarp · 2025-06-23T11:49:03 1750679343

True, but its orders of magnitude lesser when webdriver flag is an extremely basic bot check that is now considered 101.

yencabulator · 2025-06-23T21:38:35 1750714715

It sounds like you're thinking of window.navigator.webdriver, which is a WebDriver thing not part of Chrome DevTools Protocol. With CDP, as far as I can tell the detection mechanisms are more about the heuristics of e.g. how fast a form is filled -- which this AI stuff will trigger immediately too.

(And even if CDP had an explicit marker somewhere, surely patching that out is easier than piling up enough patches to "make a new browser".)

Tsarp · 2025-06-24T04:41:01 1750740061

Dont you need to navigator.webdriver === true for CDP to drive automation? Maybe I need to update my understanding on this. THis is usually a dead giveaway

yencabulator · 2025-06-24T14:57:31 1750777051

I see mentions that (unpatched) webdriver is easy to detect but detecting CDP only works by heuristics on timing etc.

Tsarp · 2025-06-21T16:55:13 1750524913

With stuff like https://www.cloudflare.com/en-in/application-services/produc... and https://blog.cloudflare.com/ai-labyrinth/ big money going on both sides last thing you want is to shadow detected as a bot. Its all ok if you are scraping to top rated SEO slop which is usually static sites but for anything beyond it wont work well eventually. Quite a few issues on browerbase, crawl4ai and similar repos around being detected as a bot.

Tsarp · 2025-06-21T04:22:40 1750479760

I was initially impressed. But then I tested a bunch, it wasn't catching some really basic things. Mostly hit or miss.

Tsarp · 2025-06-09T16:58:43 1749488323

I'd love for you to try https://carelesswhisper.app

- Locally running, wrapper around whisper.cpp

- I've done a lot of work on noise profiling, stitching the segments. So when you are speaking for anything >2-3mins, its actually faster than cloud transcriptions. (Accuracy is a few WER off since they are quantized models).

- You can try without paying or putting in CC. After that ~19$ one time. No need to sign up or login.

- BYOK to use your groq, gemini free daily credits to rewrite. Support for thinking models too. can also plug into any locally running LLM.

- Works on my 1st gen M1 without a sweat.

onemoresoop · 2025-06-09T19:43:23 1749498203

How much do you pay on average for an hour of transcription?

Tsarp · 2025-06-10T02:43:22 1749523402

Runs locally on device. So no server costs.

meepmorp · 2025-06-09T17:32:51 1749490371

simultaneously related and off topic:

https://arxiv.org/abs/2402.08021

Tsarp · 2025-06-10T02:44:07 1749523447

huh! nice!

Tsarp · 2025-06-04T01:46:08 1749001568

Maybe worth considering speech to text. Dictation has come a long way and if they are using a Mac any of the locally running whisper wrappers will work.

1. https://goodsnooze.gumroad.com/l/macwhisper (dictation + transcription)

2. https://carelesswhisper.app (does dictation only, and does it really well; cheapest)

3. https://superwhisper.com (both local and hosted models + lots of bells and whistles, but much higher pricing)

Tsarp · 2025-05-30T14:54:32 1748616872

Wow. This looks awesome.

Can we build our own python sandbox using the sandboxfile spec? This is if I want to add my own packages. Would this be just having my own requirements file here - https://github.com/microsandbox/microsandbox/blob/main/MSB_V...

appcypher · 2025-05-30T15:17:57 1748618277

Thank you!

> Can we build our own python sandbox using the sandboxfile spec?

Yes and I plan to make that work with the SDK.

PS: Multi-stage build is WIP.

Tsarp · 2025-05-30T17:21:22 1748625682

Great will join the discord. Is this embeddable? Will it work with a cross platform desktop app(Tauri)?

apitman · 2025-05-31T03:54:26 1748663666

An embeddable library that lets you launch Linux VMs that works across Windows, MacOS, and Linux hosts would be incredible.

appcypher · 2025-05-31T08:49:43 1748681383

If by embeddable, you mean having the vm run in the same process, then no. The vm aborts its process when it's done so it has to run as separate process.

Tsarp · 2025-05-26T09:51:29 1748253089

A local dictation app for Mac to use when coding. I spend a lot of time talking to Cursor, Chatgpt and needed to get rust and swift library names correctly.

Spent a lot of time on low level hardware libs to roll out my own version of VAD, grammar correction and stitching segments.

Faster than the hosted dictations tools thought it runs locally and a lot more control in terms of custom vocabulary.

https://carelesswhisper.app

Tsarp · 2025-05-26T09:43:42 1748252622

Building a small framework for securely connecting desktop apps/clis directly to your existing browser using Native Messaging i.e no headless browsers or cloud sandboxes/proxies involved.

Inspired by secure password managers like Bitwarden, goal is to reduce detectability, avoid CAPTCHAs, and mitigate common fingerprinting pitfalls.

The idea is simple: leverage the trust your browser already has.

https://github.com/srv1n/rzn-browser-native