Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

> trained from scratch

Not exactly. They mention starting from the VAE from Stable Diffusion XL and the Transformer from Phi3.

Looks like these LLMs can really be used for anything



Pretty cool, comfy ui and community is too cumbersome for me and still results in too much throwaway content




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: