More

joshribakoff · 2026-01-21T07:26:58 1768980418

This is a really good implementation, but I don’t lean too heavily into skills especially not other people‘s. If I’m doing design who’s to say I want instructions in there in the first place like “pick an extreme“ (instructions in the design skill featured on the homepage)

joshribakoff · 2026-01-21T05:09:49 1768972189

Like a vibe that activates a certain latent space in the model

joshribakoff · 2026-01-18T05:51:21 1768715481

This company predicts software development is a dead occupation yet ships a mobile chat UI that appears to be perpetually full of bugs, and has had a number of high profile incidents.

simonw · 2026-01-18T06:17:44 1768717064

"This company predicts software development is a dead occupation"

Citation needed?

Closest I've seen to that was Dario saying AI would write 90% of the code, but that's very different from declaring the death of software development as an occupation.

falloutx · 2026-01-18T09:25:11 1768728311

The clear disdain he has for the profession is evident in any interview he gives. Him saying 90% of the code was not a signal to us, but it was directed to his fellow execs, that they can soon get rid of 90% of the engineers and some other related professions.

throw234234234 · 2026-01-19T05:28:51 1768800531

I think it's pretty clear that Anthrophic was the main AI lab pushing code automation right from the start. Their blog posts, everything just targeted code generation. Even their headings for new models in articules would be "code". My view if they weren't around, even if it would of happened eventually, code would of been solved with cadence to other use cases (i.e. gradually as per general demand).

AI Engineers aren't actually SWE's per se; they use code but they see it as tedious non-main work IMO. They are happy to automate their compliment and raise in status vs SWE's who typically before all of this had more employment opportunities and more practical ways to show value.

throw310822 · 2026-01-18T12:21:11 1768738871

AI is already writing 90% of my code. 100% of Claude Code's code, too. So Amodei was right.

joshribakoff · 2026-01-17T20:33:16 1768681996

I dislike the idea of coupling my workflow to saas platforms like github or code rabbit. The fact that you still have to create local tools is a selling point for just doing it all “locally”.

joshribakoff · 2026-01-17T16:15:14 1768666514

I’ve been doing game development and it starts to hallucinate more rapidly when it doesn’t understand things like the direction it placing things or which way the camera is oriented

Gemini models are a little bit better about spatial reasoning, but we’re still not there yet because these models were not designed to do spatial reasoning they were designed to process text

In my development, I also use the ascii matrix technique.

kleene_op · 2026-01-17T16:35:21 1768667721

Spatial awareness was also a huge limitation to Claude playing pokemon.

It really seems to me that the first AI company getting to implement "spatial awareness" vector tokens and integrating them neatly with the other conventional text, image and sound tokens will be reaping huge rewards. Some are already partnering with robot companies, it's only a matter of time before one of those gets there.

nszceta · 2026-01-17T17:02:27 1768669347

This is also my experience with attempting to use Claude and GLM-4.7 with OpenSCAD. Horrible spatial reasoning abilities.

hypercube33 · 2026-01-17T17:01:36 1768669296

I disagree. With opus I'll screenshot an app and draw all over it like a child with me paint and paste it into the chat - it seems to reasonably understand what I'm asking with my chicken scratch and dimensions.

As far as 3d I don't have experience however it could be quite awful at that

vunderba · 2026-01-18T00:11:28 1768695088

Yeah at least for 2D, Opus 4.5 seems decent. It can struggle with finer details, so sometimes I’ll grab a highlighter tool in Photoshop and mark the points of interest.

miohtama · 2026-01-17T16:34:17 1768667657

They would need a spatial reason or layout specific tool, to translate to English and back

falcor84 · 2026-01-17T17:09:21 1768669761

I wonder if they could integrate a secondary "world model" trained/fine-tuned on Rollercoaster Tycoon to just do the layout reasoning, and have the main agent offload tasks to it.

joshribakoff · 2026-01-16T21:49:28 1768600168

Banning paying users with no warning doesn’t seem super ethical. Probably not unethical, either, but I would not frame them as “the most ethical”

phist_mcgee · 2026-01-16T21:59:45 1768600785

I'd say they're about as good as the average billion dollar American tech company when it comes to ethics.

joshribakoff · 2026-01-16T21:46:59 1768600019

Yep, i have long felt like i randomly get sonnet results despite opus billing. I try to work odd hours and notice better results.

joshribakoff · 2026-01-15T19:58:12 1768507092

I expect that adding instructions that attempt to undo training produces worse results than not including the overbroad generalization in the training in the first place. I think the author isn’t making a complaint they’re documenting a tradeoff.

joshribakoff · 2026-01-11T05:12:29 1768108349

For me, I’m simply trying to read the article and there are random full screen pop-ups nagging me to sign up for newsletters and stuff

joshribakoff · 2026-01-10T15:07:44 1768057664

I have been using an open source program “handy”, it is a cross platform rust tauri app that does speech recognition and handles inputting text into programs. It works by piggybacking off the OS’s text input or copy and paste features.

You could fork this, and shell out to an LLM before finally pasting the response.