More

rwhaling · 2025-11-07T00:58:51 1762477131

That's a great question - right now it is only looking at the results from a battery of several dozen indicators that we compute upstream of the model itself (which saves massively on tokens)

As small models continue to improve, and edge hardware becomes more capable, we would really like to run larger models that could incorporate full page content and screengrab data, which would be more likely to catch these kinds of attacks.

But we also find that sites that do one shady thing usually do others, which is a big reason why a tiny model like this can work - and why we are betting on low latency being a differentiating factor in real-world impacts.

rwhaling · 2025-09-13T21:00:05 1757797205

CLAVIER is amazing, the wire system alone is such a huge improvement over ORCA, and it's now feasible to make much larger patches and refactor safely, kudos to River for all the hard work on the polish and quality-of-life.

I was testing MIDI on a prerelease build last weekend and it turned out quite nice: https://www.instagram.com/p/DOUUIfeEQWY/

Excited for more folks to get to play with it!

rwhaling · 2025-07-18T14:17:19 1752848239

Love 100r! There aren't a ton of examples online, but their livecoding music software/language, ORCA, is a remarkable instrument. https://100r.co/site/orca.html

I posted a clip to bsky a few weeks back: https://bsky.app/profile/r.whal.ing/post/3lpyrm4vrqs2d

And Allieway Audio made some great Youtube videos about ORCA too if people would like to learn how it works in more of a tutorial format: https://www.youtube.com/watch?v=RaI_TuISSJE&t=446s

(I love the Dwarf Fortress background for this video, it absolutely nails the vibe)

RickS · 2025-07-18T18:06:21 1752861981

Love orca, and that's a really nice example. Messed with it a bit when it came out, and one toy project I'll share in the hopes that someone does it before me: an orca GUI that uses a larger grid with representative images in place of single char glyphs. I found that writing orca is fairly straightforward — you look up the sheet, find a thing and do it. It's reading that's the hurdle. An 8 char chunk that made perfect sense when I wrote it takes just as many lookups to read later. This probably gets easier over time, but I still think it's a cool design opportunity.

rwhaling · 2025-07-18T19:55:07 1752868507

Yep absolutely! It reminds me of writing cryptic perl one-liners or something.

lovich · 2025-07-18T18:48:41 1752864521

Oh this is the same group behind ORCA? I should read up on them more if they have multiple projects like this

bradly · 2025-07-18T21:49:08 1752875348

I don't suppose you are the live coding Richard who was on Lopez Island earlier this month, are you?

rwhaling · on Oct 17, 2023

clearly Factorio

rwhaling · on Feb 14, 2023

Really cool - the stuff they are doing with the Excel API to inspect editing modes is super interesting: https://www.sharpcells.com/docs/blog/monitoring-edit-mode

I work with data warehouses, but I'm really jealous of the way our Finance team uses some abysmal plugin to directly query our GL from inside Excel - building something like that the can make the contents of a modern data warehouse available to Excel users has always been a holy grail for me.

My hunch is that exposing free-form SQL in Excel doesn't work, but something more like structured metrics (something roughly like dbt metrics) could potentially work? And tooling like this is probably what I'd want to prototype with.

inglor · on Feb 14, 2023

You can both feed SQL data sources into Excel and expose Excel as an ODBC data source (natively, usable today and used by a lot of companies!)

matt3D · on Feb 14, 2023

I think PowerQuery largely solves this problem

listenallyall · on Feb 14, 2023

Can't be that abysmal if you're jealous of it and is similar to your idea of a holy grail. Is it Jedox perhaps?

rwhaling · on Feb 9, 2023

Airbnb wrote a few great blog posts about why they built a standardized metrics layer - I think this one gives better technical context on the "what" and "why" than this announcement does:

https://medium.com/airbnb-engineering/how-airbnb-achieved-me...

(As the article notes, Transform's founders are Airbnb vets who worked on Minerva)

rwhaling · on Feb 9, 2023

In practice - complicated time series metrics, especially on top of derived temporal-logic attributes like funnels, activation, etc., are phenomenally difficult to write by hand in SQL for most analysts. We are in the process of switching to dbt metrics and it cuts the effort down for this kind of thing 5x-10x, and the SQL code dbt generates runs signficantly faster too.

qeternity · on Feb 9, 2023

How much does the dbt enterprise cost?

pinkbeanz · on Feb 9, 2023

I’ve seen reports of a few hundred dollars per seat per month. I’m sure it varies widely depending on how big the enterprise is.

rwhaling · on Feb 9, 2023

One of Transform's strong suits is their Tableau integration - they have really good tooling for pushing metrics out to Tableau, rather than relying on Tableau to pull them in. Apparently AirBnb was/is still a very dedicated Tableau shop.

apahwa · on Feb 9, 2023

Minerva actually had really poor tableau support as of a year ago, that is something Transform improved on. Most of Minerva consumption was done via Superset, GoogleSheets, and email

rwhaling · on Jan 30, 2023

The biggest win to me is: when your data pipelines are in SQL, changes and maintenance can be somebody else's problem.

I've had a ton of success asking our marketing and business teams to own changes and updates to their data - very often they know the data far better than any engineer would, and likewise they actually prefer to own the business logic and to be able to change it faster than engineering cycles would otherwise allow.

bob1029 · on Jan 30, 2023

We do the same thing with our business logic & product configuration. I still haven't found something I couldn't expose as a SQL configuration opportunity.

Even complex things where you have to evaluate a rule for multiple entities can be covered with some clever functions/properties.

ramraj07 · on Jan 30, 2023

Do you mean you ask them to edit the sql?

With some oversight this can be fine but non engineers can easily end up making the sql infinitely slower not to mention get things wrong (most commonly doing inner joins or have nulls in where in joins).

tracker1 · on Jan 30, 2023

I've seen plenty of devs and da's do the same. The nice thing about SQL is it's easy enough to create a query that gives you what you want, but at scale it falls flat or purforms poorly. It's easy enough to work through most of the time, but too many lack the understanding of knowing when bottlenecks are likely to happen, and if/when it may be an issue.

I think of more than a couple basic joins in a query to be a code smell.

rwhaling · on Nov 17, 2022

Speaking as a data practitioner - data lineage is a profoundly hard and still unsolved problem.

In the last few years, I feel like we've begun to crack it - dbt's data lineage graph is a huge advance over anything I've ever used before:

https://docs.getdbt.com/terms/data-lineage

But in general - as an industry, we aren't there yet. All judgments about Meta's practices aside - no large organization likely has the ability to comply with a law like this without years or decades of engineering work, and the law seems pretty clearly intended for selective enforcement.