
What are you working on? Any new ideas that you're thinking about?
An PWA primarily for my wife and my daughter. They can order their hot chocolate and their coffee as if they were going to grab something at a fancy café downtown, but instead it's at home and I'm the barista. It is quite nice to have for when my wife comes back from work and want something specific, or when we are waiting for the visit of a few friend, they can order exactly the available beverages and everything is ready when they're here.
It was also a good playground for me to implement Web Push notifications (to never miss new orders).
It's a basic Nuxt 3 app with Appwrite as the backend with rough edges, but much enough for our household use !
If you want to spam my phone with notifications, please visit my café : https://mytinycafe.com/alix
LOVE this. Are there any plans to open-source? I'd love to run my own instance.
Also some feedback: the ordering buttons are inexplicably in french despite everything else being in English. Choice of language or defaulting to English would be expected...
Also - multi-select and nullable options. So that I can create options like Taco / Steak / Pasta, and add side options that are relevant only when one of those is selected.
If there's a demand for it to become open-source, why not? But I'll have to improve code quality first. As the presence of french labels indicates it, i18n is not properly implemented for this project.
No, you really don't have to. Partially complete software can have a lot of value too.
Anyone can fork it and quickly add the i18n (or just translate into a different language) for their own purposes. People will likely want to contribute i18n. People may fix or improve things for you.
Of course, it's entirely up to you - but I've appreciated half-complete software countless times before.
> But I'll have to improve code quality first.
Don't fall into this trap, strike while the HN iron is hot, all these people +1'ing will never come back when you're eventually happy the code quality is "improved"
If it's truly that bad you'll benefit from the feedback since it's an internet exposed service, although considering you're a professional freelancer, I'm sure it's fine.
It is far from fine. It was my little dirty project where I knew there was only 3 users and where I could be finally escape the usual process from my professional life.
But heh, now is the Time for me to patch all the hole I left. Lots of coffees depend on me now.
Si c'est open source je peux aider avec les traductions !
And yes, I feel like, working on it before open sourcing it is like cleaning before the cleaner. It's ok if the code is messy and there are bugs, that's why OS exist.
Very cool idea imo congrats
There's demand for it, add me to the list who wants this and would use it
I'm definitely interested! My family would love this.
Me too
+1 interest!
+1!
Me too
It's open-source now : https://github.com/Karalix/micro-cafe
have fun :)
Any plans to open source?
PLUS, to sweeten the deal. here’s a bunch of tech support!
Do not worry about cleaning up the code before open sourcing it. Get it on GitHub and I am sure plenty of us would open PRs. I know I would!
It's live on Github now !
This is lovely,I thought about doing something similar but as a 'Dads taxi' app similar to Uber where my family can request rides. Partly because I sometimes struggle to remember where I supposed to be and when, but also because it might just be fun
I really love the idea of an ecosystem of Apps Powered by Parents (APPs). Please reach out if you ever plan to implement it!
Also helpful if you have teenagers and want to make it easy for them to not drink and drive
Bug report: I tried to create a new cafe with a name that contained a space character in it. The form told me the ID was invalid but seems to have created my user account anyway. When I log in now, the CafeID I get is 'undefined'; as in, I am now the proud owner of https://mytinycafe.com/undefined/barista. I'm assuming this is not intended :)
Just had a look at the control code. I must have been drunk because it is totally idiotic. I am surprised it took that long for it to be reported on HN
Hey, that's MY cafe! (same bug happened to me)
I love this concept and the execution.
So cozy. I love this.
One nit: the contrast in dark mode at least on the marketing site is a bit off (But I'd love to fix it myself if it was open source :) )
Awesome! I like when imagination fills the gaps of technology, maybe because I played on old computers like spectrum, we had few pixels and had to imagine the rest.
The ordering could’ve been “solved” with a WhatsApp message, ( or shouting ? :D ) but that would have been so boring!
This much better life UX !
This app is a reminder of being playful and imaginative in life can bring joy, congrats!
Very cool concept - thank you for sharing it! I think would be a great solution to the near-daily "what should we eat" problem.
If I could make a (not-important) suggestion, I think being able to re-arrange / categorize menu items would be useful. Something that lets you group together drinks apart from snacks as an example.
The "what should we eat" problem was a big source of tension in our household since the arrival of our first child. Too much time-consuming, too much planning effort. What changed our life (and I really mean it) is the app https://jow.com: it suggests you a list of meals for the week suited to your family and equipment, and it creates a shopping list for your preferred delivery provider. I only have good things to say about it and could go on for hours.
We're actually in a unique situation where the planning+buying isn't the hard part, but the deciding is. We're within walking distance of a super-cheap grocery store and I'm able to cook a wide variety of dishes - many of which I can make quickly. The hard part is my wife doesn't do well with open-ended questions like "what would you like to eat"? Seeing a discrete list of things I can make and her just picking/submitting the options would solve the problem.
Though that only holds while we have free time. If we have a kid, then I can see a great amount of value in that app.
I posted this in another comment but couldn't help but notice this discussion since it seemed relevant. I've been working on https://mealsyoulove.com, which is a meal planning app that also integrates with Kroger and Instacart for ordering groceries. Jow looks similar (not sure what their pricing model is?), but I'm leveraging AI to build highly tailored recipes and meal plans while allowing you to also import your own recipes to incorporate.
Super cute! Might try it out soon (my partner and I are both working from home most of the time).
FYI: The features section of the website doesn't render correctly in dark-mode.
You should add food and prices too. Obviously you don’t need to implement an actual payment system because it’s for fun, but if it kept track of the money, your kid could charge you 0.50 per drink or something.
That's a nice idea ! Will definetly add it soon.
And for the food, one can already add anything, it's just a text field. A friend of mine only has alcoholic drinks and snacks on his menu page.
This made me ovulate, and I'm not even a woman.
Ah this is one of my favorite projects I've seen in a long while. Will open my home cafe.
This is the ultimate benefit of democratising building of software with AI. Any personal and limited use-case can become software. Then people get software that suits them, not what suits the average user.
I never expected to open this and find something that would put a big grin on my face.
Thank you for giving me some joy.
Oh man this would kinda help me a lot right now. (As a struggling home owner living w/mom and dog)
FYI this is a blank page on Firefox :(
works for me Firefox 145.0.2
The testimonials section is adorable
Lovely idea! One Latte for me :)
This is awesome. Perfectly solving a problem, showing off tech, and adorable.
so it's a point-of-snail sort of system, perfect for taking share in a teeny tiny market, and in the growth-share matrix something of a Cashless Cow?
Yes, the url is public :)
this is awesome! my wife and I host coffee for neighbors on saturdays and this would be perfect for that!! thank you
I love it! Such a cute application :D
This is adorable. Nice work!
It's really super cool!
This is so delightful!
Slightly disappointed to realize there is not some automated drink machine behind this, as that's more my interest, but cool nonetheless and you handmade drinks are probably better.
> Slightly disappointed to realize there is not some automated drink machine behind this
There is and it's called "dad"
I'm working on porting KiCad to the browser. It's a lot of sweat and tears, multithreading issues and some more sweat. I've updated a port of WxWidgets and now I support all the features KiCad needs with ~200 tests.
Right now I have a build that loads in the browser, but I really want to have "multithreading" which means workers in the web. One can use asyncify with emscripten to translate blocking C++ to WASM, but that transition is not perfect, right now I'm debugging a bug where there's a race condition that halts all execution and the main thread runs in an infinite loop waiting for the workers to stand up. I guess I'll have a few of those ahead.
The main goal is to 1. just have fun 2. use yjs as a collab backend so multiple people can edit the same PCB. This will probably work with pcbnew, KiCad's layout editor, since it has a plugin system and AFAIK I can do the sync layer there. For the rest ( schematic, component editor etc. ) I'll have to figure out something.. KiCad does not sync automatically if you modify a file, I'll have to do some lifting there.
Anyway, it's a lot of fun, I really want this thing to exist, I'm hoping that I won't run into a "wellll, this is just not going to work" kind of issue in the end.
Excellent. kicad is cool; zero install should be a good gateway drug.
That's super cool! Not needing to install is great for accessibility for new users, and I think this would also be cool if it can be embedded into other websites as a viewer (like imagine opening up a Git repo with KiCAD files in it and being able to explore them in an integrated viewer)
That's a neat idea! To be honest my brain is overflowing with ideas too, right now I want to just bring all the apps one. Or... Actually just bring the layout editor up first :) I'm trying hard to live in the present...
There's already some work in that direction, too: https://github.com/theacodes/kicanvas
I just finished a dsn parser and now I am planning to write a pcb router. All in rust. The plan is to have a wasm/wasi version as well so routing is possible in the browser.
dsn parser is open source, https://github.com/dilawar/dsn-parser (WIP). Some part of PCB routing will be open-source (MIT).
That is very cool. I can't wait to try it out!
Heyythanks! Feel free to bug me at viktor.vaczi(at)emergence-engineering.com I'd love to chat about it :)
This is a great project! Thanks for tackling it!
https://play.google.com/store/apps/details?id=com.blazingban...
Completely free, no ads, no in-app purchases and no accounts / network required offline voice transcription.
I have also built the macOS/Windows/Linux versions which I'll also make free to download and available on my site soon (https://blazingbanana.com/).
iOS version is built and works (extremely well), just waiting for the Apple Developer signup process to complete.
Big shout out to https://github.com/mybigday/whisper.rn and https://huggingface.co/ggerganov/whisper.cpp/tree/main for making this even possible.
Any suggestions are welcome.
On the subject of whisper being great... A few weeks ago a co-worker commented about the difficulty he'd had editing a work demo, I pointed at various jump-cutting tools that had automated what he did in the past (editing out silences). But I'd also wanted to play with whisper for a while...
So a couple of hours later I'd written a script that does transcription based editing: on the first pass it grabs a timestamped transcript and a plain text transcript for editing; you edit the words into any order you like and a second pass reassembles the video (it's just a couple of hundred lines of python wrapping whisper and ffmpeg). It also speeds up 4x any silences detected that sit within retained sequences in the video.
Matching up transcripts turns out to be not that hard; I normalise the text, split it, and then compare to the sequence of normalised words from the timestamped transcript. I find the longest common sequence, keep that, then recurse on the before/after sections (there's a little more detail, but not much). I also sent the transcription to ffmpeg to burn in as captions, because sometimes it makes the audio choppy and the captions make it easier to follow.
I know, tools have been doing this for years now. I just didn't have one to hand, and now I do, and I couldn't have done this without whisper.
That is absolutely awesome and I love hearing about the tools that people build themselves!
Honestly, the capabilities of whisper is insane, the fact that it's free and open source is really a gift. Some of the things it can do feels almost sci-fi.
If you ever decide to release it publicly please let me know, sounds like a very useful tool.
"release" is maybe too strong a word, it's not a lot of code and I don't plan to put any more effort into the nonexistent interface since it was just built for personal use. But the code:
https://gist.github.com/bazzargh/e1d2e2718af575a03206114a291...
This is very kind of you, thanks.
I spent a few hours editing a video in Davicni resolve to do this by hand. Then i found out this is a built in feature.
Couldn't find it on the Play store by searching for the name and the developer's name: if it is not just me then your app is very hard to discover.
So I am installing it through the link you provided, which directed me to a "install success" page saying "your purchase is successful" even if your app is free. Another obstacle to adoption :-)
Last, I was not informed on the page of the app' size. Seeing what it does and the time it takes to download I am afraid it could be huge? Third obstacle :-)
Thank you for the feedback, I really do appreciate you taking the time to check it out and write out the comment! I'll look at adding a note about total app size in the description, it won't hurt.
As for discoverability / the "your purchase is successful" message, I'm not sure what else I can do, I've set it to free, no ads etc in Google Play. Maybe I need to hit a few more keywords for transcription so it surfaces it more.
The iOS Appstore also treats/words app installs as ‘Purchases’. Always confused my…
For me, searching for "whistle" on play store, I get the app as the third result (ignoring sponsored crap). Searching for "blazingbanana" gets me the app as the first result".
App info shows 218MB size, which I suppose is about what I'd expect for a model+app code :shrug:
Good to know, it's hard to know what real users would see in the play store and not Google just showing you what you want. Thank you for checking it out
Pretty cool. I've downloaded and lightly tested. Works great.
I love the "free forever, no ads part..." But it obscures what the app is for. Maybe start with the "Speech to text transcription" to make it clearer.
Either way, that's just semantics. Great job
Thank you, really appreciate the kind words. I'll take a look at giving the description a bit of a once over for the next release coming soon.
It'd be nice to keep the voice recording too, as I noticed at least one thing that it transcribed wrong.
This way one can listen to the recording again, and correct such issues.
Great idea and an option I'm looking at implementing soon with the ability to reprocess with a different model if needed. Cheers for taking a look.
By the way, how does this handle conversations between two or more people?
Currently, terribly. It is on the roadmap after I've released it on all platforms.
That's very cool, I've been looking for a fully offline transcription app for quite a while. Thanks for building this! And thanks so much for providing an "import audio file" function, not just "record from mic" -- transcribing voice notes from various messenger apps is my main use case here.
Do you have an idea about supporting languages other than English?
Thank you, glad you like it!
The average model and upwards should support all languages from the whisper models by default.
I haven't tested them all so I'm unsure of the quality, however it should in theory support the following:
---
Albanian
Amharic
Arabic
Armenian
Assamese
Azerbaijani
Bashkir
Basque
Belarusian
Bengali
Bosnian
Breton
Bulgarian
Cantonese
Catalan
Chinese
Croatian
Czech
Danish
Dutch
English
Estonian
Faroese
Finnish
French
Galician
Georgian
German
Greek
Gujarati
Haitian creole
Hausa
Hawaiian
Hebrew
Hindi
Hungarian
Icelandic
Indonesian
Italian
Japanese
Javanese
Kannada
Kazakh
Khmer
Korean
Lao
Latin
Latvian
Lingala
Lithuanian
Luxembourgish
Macedonian
Malagasy
Malay
Malayalam
Maltese
Maori
Marathi
Mongolian
Myanmar
Nepali
Norwegian
Nynorsk
Occitan
Pashto
Persian
Polish
Portuguese
Punjabi
Romanian
Russian
Sanskrit
Serbian
Shona
Sindhi
Sinhala
Slovak
Slovenian
Somali
Spanish
Sundanese
Swahili
Swedish
Tagalog
Tajik
Tamil
Tatar
Telugu
Thai
Tibetan
Turkish
Turkmen
Ukrainian
Urdu
Uzbek
Vietnamese
Welsh
Yiddish
Yoruba
---
Apologies for the formatting, not sure how to make it look nice in the comment.
A new bugfix update for the "Translate to English" toggle (which was functionally always set to on) should be available soon, it's just awaiting Play Store approval.
Whoa cool, so all the transcription is done locally? Have you done any perf monitoring around that on iOS?
I have been using the iOS built in speechTranscriber and it is... not great, was gonna use a whisper API but running it on device would be amazing if it isn't too heavy.
Hey, just spotted your comment.
Yes, everything is done locally, stick your phone in airplane mode if you want to be sure!
I'm not an Apple fan, but I have to say I've been testing it on an iPhone 15 and my god, the performance is insanely good, I was seriously blown away. I haven't dug into how much it impacts battery, but the transcription literally takes seconds for a minute of audio so it's not holding up your device.
The iOS version is built, ready to go, there's just some bug with my Apple account and it won't let me pay the £80 fee to signup (support ticket raised and waiting). As soon as that's sorted it'll be out on App Store for free as well.
Very surprised to hear the built in transcription is not great, anything specifically bad about it? The hardware is there.
You can download the desktop version from here (https://blazingbanana.com/apps/whistle/) if you want, still very much a WIP.
I really liked wisprflow on my mac but my daily driver is Manjaro KDE. I have stitched together a bash script that copies the transcription (right now I am using the Parakeet TDT 0.6B) to my clipboard. I would give this a try on linux when it becomes available.
Would you be open to sharing your script? I run whisper.cpp in Linux through some stitched together scripts (https://news.ycombinator.com/item?id=44949314), but would be very curious to try Parakeet. I don't believe I can run it through whisper.cpp?
I looked at your script. I am doing basically the same thing just using onnx-asr (installed as a uv tool) with the parakeet model instead of the whisper-cli. Look Here: https://github.com/istupakov/onnx-asr
Just checked out whisprflow, I must say that looks really nice, kudos to those devs. Shame there isn't a Linux / Android version.
I have added the auto-copy to clipboard functionality that will come with the next Android release and be included in all others. Adding a hotkey / quickbar button is on the roadmap for the desktop versions.
If you want to give the Linux version a shot, you can download it from here - https://downloads.formait.app/whistle/linux/WhistleDesktop-l... - I've just stuck it in the same R2 bucket as another app, as I've not sorted the proper pipeline out yet.
I just tried running this on a 30 minute meeting with some 10 people in. It got to the end, then just bailed without transcribing. I also did not get any errors or anything.
Really sorry about that, longer running audio (~10m+) is something I'm working on along with handling multiple speakers.
I've been focused on getting functional parity across all OS's since the Android release. This is very close to being done and I just need to reach the milestone of it being available on all platforms before I move forward.
Hopefully you will take another look when the next update is out.
Are you piping the audio buffers straight to the transcriber as they come in? Or capturing the entire recording and then processing it at the end?
I'm capturing the audio first, which was fine for short audio but ended up killing the process when it got too big. The updated fix is out now.
I did a complete overhaul to the pipeline so that it splits and processes at the end, this seems to have sorted it. I'm thinking about doing each transcription segment as it's coming in (with a bit of a buffer / overlap to keep context) much like the live transcription does, but for now performance is ok. Something I'll keep in mind once I've crossed some other things off the list.
@blazingbanana
We have a similar product in the construction space. Would love to talk to you about some of our challenges and possibly work together. Interested?
Very interesting, happy to discuss this privately.
Would you consider adding it F-Droid?
Yes absolutely! I'm a GrapheneOS user myself so understand not wanting to have to go through the play store if you can help it.
I believe you have to make the source code public (please correct me if I'm wrong). I'm more than happy to do so, I've used a whole bunch of open source stuff to build the app so it only seems fair, I just need to make it a bit less messy and something I don't mind being public.
Yes, not just public, but also licensed under a license that permits free redistribution, modification, etc. This is awesome!
nice app!
if I am talking in german the text is translating it to english. Didn't expect that
Thank you!
There was a bug causing the "translate to english" to be always enabled. This should work correctly and translate to your native language.
Will be in the next update (in a day or two).