More

mlboss · 2025-12-02T14:38:23 1764686303

I bet all are Y Combinator companies.

andsoitis · 2025-12-02T14:57:49 1764687469

Which makes it even more damning. Vibe coding platforms are software solutions themselves, which means they are subject to the same logic.

mlboss · 2025-12-02T14:37:00 1764686220

Most probably all VCs will go out of business as the cost of creating software companies approaches zero. The need to create an army of software developers is no longer needed.

alephnerd · 2025-12-02T14:56:02 1764687362

Not really.

In such a hypothetical world, it would actually be much easier for us to fund companies simply because now the only thing we are funding is just sales, demand gen, and projected compute.

We provide funding so businesses that are the right fit can scale out the functions that they need. In some cases it's expanding engineering, in other cases it's expanding sales and demand gen, and in other cases is to subsidize a major purchase such as cloud credits or GPUs.

mlboss · 2025-11-19T14:58:20 1763564300

I don’t think it is about resources. Growing up in 3rd world country I have zero resources. It is the drive of student and higher standards from parents and teachers that matters. Everybody is just getting soft.

Aurornis · 2025-11-19T15:11:55 1763565115

> I don’t think it is about resources. Growing up in 3rd world country I have zero resources.

In America many parents do have resources, though, and they will spend those on private schools, tutoring, or home schooling.

> It is the drive of student and higher standards from parents and teachers that matters.

These proposals restrict the teachers and disallow teaching advanced subjects to students with drive to learn them.

You can’t say it’s up to the students and teachers while also holding back the students and restricting the teachers.

mlboss · 2025-08-06T02:00:59 1754445659

Reddit post with generated audio sample: https://www.reddit.com/r/LocalLLaMA/comments/1mhyzp7/kitten_...

seligman99 · 2025-08-06T16:40:17 1754498417

And a quick video with all of the different voices:

https://www.youtube.com/watch?v=60Dy3zKBGQg

a96 · 2025-08-08T09:17:21 1754644641

Thanks. I really would not want to listen to any of these regularly.

tracker1 · 2025-08-06T23:23:18 1754522598

Cool, thanks... aside: the last male voice sounds high/drunk.

Eduard · 2025-08-06T20:06:28 1754510788

thank you!

smusamashah · 2025-08-06T08:22:50 1754468570

The reddit video is awesome. I don't understand how people are calling it an OK model. Under 25MB and cpu only for this quality is amazing.

soasme · 2025-08-20T07:41:03 1755675663

Just made a TTS tool based on Kitten TTS, fully browser based, no Python server backend: https://quickeditvideo.com/tts/ A tts model of this size should be industry standard!

Retr0id · 2025-08-06T11:25:30 1754479530

The people calling it "OK" probably tried it for themselves. Whatever model is being demoed in that video is not the same as the 25MB model they released.

darkwater · 2025-08-06T14:41:05 1754491265

Nope, looks like the default voice is the worst and it's not in the demo. A Reddit user generated these as well https://limewire.com/d/28CRw#UPuRLynIi7

bouchard · 2025-08-06T15:47:22 1754495242

Never thought I'd see the name LimeWire again, wow

divamgupta · 2025-08-06T21:10:18 1754514618

Haha interesting pivot!

fortyseven · 2025-08-06T19:16:21 1754507781

It did say this was a preview release, so I'll reserve judgement until that's out the door.

iab · 2025-08-06T16:11:24 1754496684

Local quality is very bad

sergiotapia · 2025-08-06T15:33:10 1754494390

https://vocaroo.com/1njz1UwwVHCF

It doesn't sound so good. Excellent technical achievement and it may just improve more and more! But for now I can't use it for consumer facing applications.

divamgupta · 2025-08-06T20:59:22 1754513962

We are still training the model. We expect the quality to go up in the next release. This is just a preview release :)

Zardoz84 · 2025-08-06T10:17:56 1754475476

Sounds very clear. For a non native english speaker like me, it's easy to understand.

tapper · 2025-08-06T07:58:11 1754467091

Sounds slow and like something from an anine

ricardobeat · 2025-08-06T08:11:24 1754467884

Speech speed is always a tunable parameter and not something intrinsic to the model.

The comparison to make is expressiveness and correct intonation for long sentences vs something like espeak. It actually sounds amazing for the size. The closest thing is probably KokoroTTS at 82M params and ~300MB.

dvh · 2025-08-06T09:32:16 1754472736

I think he meant overacting typical for English dubs.

Telemakhos · 2025-08-06T10:16:44 1754475404

The voices sound artificial and a bit grating. The male voices especially are lacking, especially in depth: only the ultimate voice has any depth at all, while the others sound like teenagers who haven't finished puberty. None of the voices sound quite human, but they're all very annoying, and part of that is that they sound like they're acting.

avisser · 2025-08-06T23:56:46 1754524606

I heard a little DVa from Overwatch.

numpad0 · 2025-08-06T11:23:04 1754479384

The only real questions are which Chinese gacha game they ripped data from and whether they used Claude Code or Gemini CLI for Python code. I bet one can get a formant match from output this much overfit to whatever data. This isn't going to stay up for long.

KaiserPro · 2025-08-06T09:15:05 1754471705

was it cross trained on futurama voices?

junon · 2025-08-06T09:26:37 1754472397

That would be a feature!

archon810 · 2025-08-06T18:46:42 1754506002

Sounds like Mort from Family Guy.

divamgupta · 2025-08-06T21:10:41 1754514641

divamgupta · 2025-08-06T20:59:53 1754513993

It was not

Aachen · 2025-08-06T11:19:23 1754479163

Impressive technical achievement, but in terms of whether I'd use it: oof, that male voice is like one of these fake-excited newsreaders. Like they're always at the edge of their breath. The female one is better but still someone reading out an advertisement for a product they were told they must act extra excited for. I assume this is what the majority of training data was like and not an intentional setting for the demo. Unsure whether I could get used to that

I use TTS on my phone regularly and recently also tried this new project on F-Droid called SherpaTTS, which grabs some models from Huggingface. They're super heavy (the phone suspends other apps to disk while this runs) and sound good, but in the first news article there were already one or two mispronunciations because it's guessing how to say uncommon or new words and it's not based on logical rules anymore to turn text into speech

Google and Samsung have each a TTS engine pre-installed on my device and those sound and work fine. A tad monotonous but it seems to always pronounce things the same way so you can always work out what the text said

Espeak (or -ng) is the absolute worst, but after 30 seconds of listening closely you get used to it and can understand everything fine. I don't know if it's the best open source option (probably there are others that I should be trying) but it's at least the most reliable where you'll always get what is happening and you can install it on any device without licensing issues

willwade · 2025-08-06T12:40:09 1754484009

anyone else wants to try sherpaOnnx you can try this.. https://github.com/willwade/tts-wrapper we recently added in the kokoro models which should sound a lot better. There are a LOT of models to choose from. I have a feeling the Droid app isnt handling cold starts very well.

spookie · 2025-08-06T21:39:03 1754516343

If anyone wants to test ready to install android apks: https://k2-fsa.github.io/sherpa/onnx/tts/apk.html

divamgupta · 2025-08-06T21:15:04 1754514904

Thanks a lot for the detailed feedback. We are working on some models which do not use a phonemizer

bornfreddy · 2025-08-06T21:00:41 1754514041

RHvoice is pretty good, imho.

mlboss · 2025-08-04T23:18:15 1754349495

Somebody should create a AI interviewer for VC funding. VCs are swamped with so many funding requests. All the founders should first convince AI why they need funding.

mlboss · 2025-08-04T23:15:11 1754349311

Candidate will be still writing the prompt.

mlboss · 2025-08-04T23:14:39 1754349279

The real irony will be that both the services will be using OpenAI speech to speech model.

mlboss · 2025-08-04T23:10:01 1754349001

Next iteration: Send your bot to work while you enjoy beach

mlboss · 2025-08-04T23:02:36 1754348556

Developer interviews are already kinda AI interviews based on take home hackerrank interviews.

mlboss · 2025-08-04T20:56:02 1754340962

I don't think you will ever talk to a user if you work in a big company. There are so many layers of abstraction.

Aeolun · 2025-08-04T23:01:42 1754348502

And it’s not even because you don’t want to. It’s just because that’s how things work. I spent years talking directly to users and then I started working for a multinational, and I haven’t seen a user in 7 years…