555 points by rzzzzru 1 day ago | 156 comments | View on ycombinator
rzzzzru 1 day ago |
Gormo about 23 hours ago |
How come this is trying to install its own vendored dependencies, including executable binaries, instead of checking for what's already installed? That approach can lead to both security and performance issues.
Edit: the Python download isn't failing, but rather the application itself is looking for the executable interpreter in `lib` rather than `bin` once the download completes. I built the release tarball in the git repo, and I'm pretty amazed that such a basic error could make it into release code.
Further edit: I tried using the build script in the tarball rather than just doing a `cargo build -r`, and it started trying to install Docker containers! Docker to build a desktop application! What is going on here?
notdefio about 22 hours ago |
The homepage still exists, but it looks like many of the other pages like the blog and wiki are long gone. It hasn't been active in probably over a decade.
ezst about 1 hour ago |
Eckter2 about 24 hours ago |
I've worked on a small toy project with a similar purpose in the past [1], though it's not nearly as polished as yours, and I've made some questionable decisions here and there.
I have questions about pitch tracking. It seems you do track the pitch for scoring, and there's a line at the top of the screen that seems related but that I can't figure out. For my use case, an important feature of karaoke apps is displaying how "high" the next note should be sung, or at least some hints. Is it something your app can do and I just haven't figured it out? Or would it be a feature request?
10729287 1 day ago |
rgilliotte about 17 hours ago |
AI is making whole categories of projects viable that simply weren't before. Not because they were technically impossible, but because they were too time-consuming for a niche audience to justify the effort.
Thanks for the cool project! (testing now)
mynameajeff 1 day ago |
dmd 1 day ago |
upsuper about 24 hours ago |
Would it be possible to process songs on one device, and then use the result in another, or even multiple? Or would it be possible to run as separate server / client?
I ask mainly because the device I connect to my TV is definitely not the most powerful one, so it would be nice if I can preprocess the songs elsewhere.
undefined about 14 hours ago |
danvoell about 23 hours ago |
rjh29 about 13 hours ago |
In the end he's gone back to karaoke videos on youtube but it seems promising.
mft_ about 23 hours ago |
A couple of immediate small pieces of feedback:
* The colour scheme on the queue/nn% buttons is really low contrast - white on pale yellow is very hard to read
* the 'models' button (bottom left) - I assumed this would give me details about which models are available, and the sizes, but instead deleted the downloaded models without warning. Maybe add a 'are you sure you want to...' check?
reddalo 1 day ago |
primenum about 6 hours ago |
integralid 1 day ago |
Impressive, very nice. Now let's see my death metal collection.
Just joking! Very nice, thanks for open-sourcing it.
caipira 1 day ago |
skwb about 12 hours ago |
arvinsim about 24 hours ago |
HaloZero about 18 hours ago |
harvey9 about 20 hours ago |
melonpan7 about 15 hours ago |
ccozan about 23 hours ago |
manbash 1 day ago |
mentalgear about 22 hours ago |
undefined about 20 hours ago |
skatedbear about 23 hours ago |
ETlol 1 day ago |
rjh29 1 day ago |
levysoft about 19 hours ago |
hasbot 1 day ago |
bmn__ about 17 hours ago |
nytrox about 16 hours ago |
OpenDQV about 22 hours ago |
nephihaha about 20 hours ago |
EGreg about 23 hours ago |
We already do this for ingesting podcasts and cutting their clips with text being highlighted as people speak. AssemblyAI also supports speaker diarization.
For videos recorded using our own livestreaming studio, we can bypass all this by using Web STT and TTS APIs resulting in perfect timing and diarization without the need for server side models.
MediaMonitorWD about 22 hours ago |
Babkock about 21 hours ago |
JohnnyLarue about 17 hours ago |
Iamkkdasari74 about 23 hours ago |
Heer_J about 21 hours ago |
inkianj82 about 17 hours ago |
spacesxbt27 1 day ago |
jamesvzb 1 day ago |
lucideer about 21 hours ago |
ML has come a long way but I have yet to encounter anything that does this reliably with speech, never mind song lyrics.
> works with any song on your computer
I'd be shocked if this is true.
Everything runs locally on your machine, nothing gets uploaded. No accounts, no subscriptions, no telemetry.
It ships as a single binary for Linux, macOS, and Windows. On first launch it sets up its own isolated Python environment and downloads the ML models it needs - no manual installation of dependencies required.
My two biggest drivers for the creation of this were:
Some highlights: The whole stack is open source. No premium tier, no "open core" - just the app.Feedback and contributions welcome.