More

suchintan · 2025-12-24T18:47:42 1766602062

This is very cool. We were thinking about doing something very similar with Skyvern

What was the reason you went down this path instead of extending selenium with AI features?

hugs · 2025-12-24T19:01:15 1766602875

i partially addressed this in the "why vibium" section of the v1 announcement: https://github.com/VibiumDev/vibium/blob/main/docs/updates/2...

but why a new thing vs extending selenium? it's a little complicated, but neither selenium nor playwright were designed with ai in mind from day 1. with vibium, i'm optimizing for "vibe coding" and ai-driven workflows first.

suchintan · 2025-12-24T19:33:03 1766604783

This makes sense. I guess I wanted to understand why starting from scratch was better than "fixing" selenium, but perhaps "fixing" selenium isn't an option?

hugs · 2025-12-24T20:01:42 1766606502

for the entire testing tools industry, in some ways, selenium was the "final boss" to beat. every new tool had to trash selenium in their marketing. eventually those "hit points" added up. "fixing selenium" is as much as of a branding problem as it is a technical problem. "oh, there's a new version of selenium? i heard selenium sucks!" is actually a problem that has to be dealt with. an entire new generation of coders only know "playwright rules, selenium drools".

of course, i have a new host of problems by going all in with "vibium"... i'm making a huge bet that "vibe coding" is a trend, not a fad. (it could still be a fad! we'll see if this post ages well soon enough!)

gsnedders · 2025-12-26T15:15:24 1766762124

Also, as someone on the periphery of Selenium (mostly via WebDriver), some of the challenge is that Selenium has a huge amount of test code already written for it — and making radical API changes would break every test already written for it, and at that point you’re effectively a new library.

It’s gonna be very interesting to watch exactly how the adoption of WebDriver BiDi goes with Selenium, especially once WebDriver Classic starts to go away, and how API stability is balanced with exposing more and more async capabilities.

suchintan · 2025-12-24T20:04:10 1766606650

That makes a lot of sense. Sometimes it's easier to leave the baggage behind. It's too bad..selenium is a masterpiece. Thanks for sharing it with the world

suchintan · 2025-12-11T17:10:05 1765473005

maybe https://github.com/Skyvern-AI/Skyvern ?

suchintan · 2025-10-18T14:43:34 1760798614

What are some of the risks? This is a public web form available on the IRS website

jimrandomh · 2025-10-19T00:10:08 1760832608

If you're automating filling out the form, you aren't reading the instructions and you aren't checking what you're putting into it as much as you should be. And if you put in incorrect information, it tends to be considered fraud, even if it's downstream of a sloppy LLM rather than downstream of a particular fraudulent scheme.

suchintan · 2025-10-19T01:06:07 1760835967

You're right, but this is where the LLMs are especially useful. Our customers all prompt it to terminate if it doesn't have the right information / the pre submission confirmation doesn't match

suchintan · 2025-10-17T23:51:36 1760745096

Unrelated, but thoughtful gave us some very very helpful feedback early in our journey. We are big fans!

suchintan · 2025-10-17T23:50:00 1760745000

That's the dream

suchintan · 2025-10-17T21:34:57 1760736897

Definitely. What are your thoughts on the CloudFlare agent identity

suchintan · 2025-10-17T21:34:23 1760736863

It's funny, one time we had a customer that wanted to use us to test their website for bugs..

Skyvern kept suggesting improvements unrelated to the issue they were testing for

pyuser583 · 2025-10-17T23:09:31 1760742571

So how do clients process this sort of feedback? As a dev, “negative user feedback” gives me scares that “failed behavior testing” does not.

The AI isn’t mad, and won’t refuse to renew. Unless it’s being run by the client of course.

Are clients using your platform to assess vendors?

suchintan · 2025-10-18T00:45:32 1760748332

No, we don't have a lot of usage in that direction. People mainly use us to log into websites and either fill out forms or download files!

suchintan · 2025-10-17T21:33:43 1760736823

We do have them! We are HIPAA compliant, have soc-2 type 2 and offer self hosted deployments

herpdyderp · 2025-10-18T12:11:13 1760789473

Thank you for responding! Where is your compliance information? How do I sign a BAA?

suchintan · 2025-10-18T13:50:02 1760795402

Send me an email suchintan@skyvern.com - we can get you started

suchintan · 2025-10-17T21:33:00 1760736780

This is really cool. We might integrate this into Skyvern actually - we've been looking for a faster HTML extraction engine

Thanks for sharing!

suchintan · 2025-10-17T21:32:14 1760736734

I have a 2yo and it's been surreal watching her learn the world. It deeply resembles how LLMs learn and think. Crazy

Retric · 2025-10-17T21:39:36 1760737176

Odd, I've been stuck by how different LLMs and kids learn the world.

You don’t get that whole uncanny valley disconnect do you?

goatlover · 2025-10-17T22:18:03 1760739483

How so? Your kid has a body that interacts with the physical world. An LLM is trained on terabytes of text, then modified by human feedback and rules to be a useful chatbot for all sorts of tasks. I don't see the similarity.

crazygringo · 2025-10-17T23:31:00 1760743860

If you watch how agents attempt a task, fail, try to figure out what went wrong, try again, repeat a couple more times, then finally succeed -- you don't see the similarity?

dingnuts · 2025-10-17T23:52:53 1760745173

no I see something resembling gradient descent which is fine but it's hardly a child

haskellshill · 2025-10-18T13:44:28 1760795068

> try to figure out what went wrong

LLMs don't do this. They can't think. If you just one for like five minutes it's obvious that just because the text on the screen says "Sorry, I made I mistake, there are actually 5 r's in strawberry", doesn't mean there's any thought behind it.

crazygringo · 2025-10-18T19:00:40 1760814040

I mean, you can literally watch their thought process. They try to figure out reasons why something went wrong, and then identify solutions. Often in ways that require real deduction and creativity. And have quite a high success rate.

If that's not thinking, then I don't know what is.

haskellshill · 2025-10-22T22:50:02 1761173402

You're arguing in circles. They don't "try to figure out reasons" because there is no concept of "trying", "figuring out" or "reasons".

> If that's not thinking, then I don't know what is.

How about actual thinking, you know, what humans and to a lesser extent animals do?

balder1991 · 2025-10-18T02:08:15 1760753295

No, because an agent doesn’t learn, it’s just continuing a story. A kid will learn from the experience and at the end will be a different person.

CaptainOfCoit · 2025-10-18T10:51:25 1760784685

You just haven't added the right tool together with the right system/developer prompt. Add a `add_memory` and `list_memory` (or automatically inject the right memories for the right prompts/LLM responses) and you have something that can learn.

You can also take it a step further and add automatic fine-tuning once you start gathering a ton of data, which will rewire the model somewhat.

haskellshill · 2025-10-18T13:46:23 1760795183

Perhaps it can improve but it can't learn because that requires thought. Would you say that a PID regulator can "learn"?

CaptainOfCoit · 2025-10-18T14:47:12 1760798832

I guess it depends on what you understand "learn" to mean.

But in my mind, if I tell the LLM to do something, and it did it wrong, then I ask it to fix it, and if in the future I ask the same thing and it avoids the mistake it did first, then I'd say it had learned to avoid that same pitfall, although I know very well it hasn't "learned" like a human would, I just added it to the right place, but for all intents and purposes, it "learned" how to avoid the same mistake.

haskellshill · 2025-10-22T22:48:12 1761173292

This is a silly definition of learning, and any way, LLMs can't even do what you describe.

deadbabe · 2025-10-18T02:04:51 1760753091

A person is not their body.

The person is the data that they have ingested and trained on through the senses that are exposed by their body. Body is just an interface to reality.

haskellshill · 2025-10-18T13:47:44 1760795264

That is a very weird and fringe definition of what a person is.

deadbabe · 2025-10-18T18:13:25 1760811205

If you have a different life experience than what you had so far, wouldn’t you be a different person?

haskellshill · 2025-10-22T22:46:52 1761173212

If you had one fewer leg, wouldn't you be a different person? Implication is not equivalence.

haskellshill · 2025-10-18T13:42:31 1760794951

> It deeply resembles how LLMs learn and think

What? LLMs don't think nor learn in the sense humans do. They have absolutely no resemblance to a human being. This must be the most ridiculous statement I've read this year

melagonster · 2025-10-18T11:31:20 1760787080

I am sorry, but you are scoffing at the humanity of your kid; you know that, right?