More

jessmartin · 2026-03-11T15:59:46 1773244786

strong agree. I always have the LLM put an actual markdown doc in a docs/plans/ folder before starting work. I often, but not always review it.

Aside: it also helps for code review! Review bots can point out the diff between plan and implementation.

Some examples for the curious: https://github.com/sociotechnica-org/symphony-ts/tree/main/d...

patchnull · 2026-03-11T16:06:51 1773245211

[flagged]

ElFitz · 2026-03-11T17:37:21 1773250641

It's one of the things that surprised me when I first started using the compound engineering plugin.

I've been considering adding a review gate with a reviewing model solely tasked with identifying gaps between the plan and the implementation.

jessmartin · 2026-03-10T19:54:45 1773172485

I got my own level 8 factory working in the last few days and it’s been exhilarating. Mine is based on OpenAI’s Symphony[1], ported to TypeScript.

Would be happy to swap war stories.

<myhnusername>@gmail.com

whattheheckheck · 2026-03-10T21:09:35 1773176975

How much money have you made with this approach

ativzzz · 2026-03-10T21:41:47 1773178907

I think the opposite question is more prevalent, how much money have you spent?

jessmartin · 2026-03-11T15:57:25 1773244645

Not a small amount :)

I spend $140/mo on Anthropic + OpenAI subs and I use all my tokens all the time.

I've started spending about $100/week on API credits, but I'd like to increase that.

ativzzz · 2026-03-11T18:26:27 1773253587

Still waiting for these software factories to solve problems that aren't related to building software factories. I'm sure it'll happen sooner or later, but so far all the outputs of these "AI did this whole thing autonomously" are just tools to have AI build things autonomously. It's like a self reinforcing pyramid.

AI agents haven't yet figured out a way to do sales, marketing or customer support in a way that people want to pay them money.

Maybe that won't be necessary and instead the agent economy will be agents providing services for other agents.

jessmartin · 2026-03-11T15:57:31 1773244651

none yet!

quotemstr · 2026-03-10T21:51:20 1773179480

... is that the purpose of life? The sole reason for doing anything?

twelve40 · 2026-03-11T03:57:12 1773201432

With so much hype it's a valid question: "is this useful/practical, or just a fun rabbit hole/productivity porn". Money is the most obvious metric, feel free to inquire the parent about other possible metrics that might be useful to others instead of asking rhetorical questions.

fragmede · 2026-03-11T08:36:53 1773218213

Unfortunately, it's hard to quantize "How much fun did you have?"

jessmartin · 2026-02-07T18:31:26 1770489086

I wonder if this is just a byproduct of factories being very early and very inefficient. Yegge and Huntley both acknowledge that their experiments in autonomous factories are extremely expensive and wasteful!

I would expect cost to come down over time, using approaches pioneered in the field of manufacturing.

jessmartin · 2026-02-07T18:27:52 1770488872

They have a Products page where they list a database and an identity system in addition to attractors: https://factory.strongdm.ai/products

For those of us working on building factories, this is pretty obvious because once you immediately need shared context across agents / sessions and an improved ID + permissions system to keep track of who is doing what.

jessmartin · 2026-02-07T18:17:29 1770488249

I’ve been building using a similar approach[1] and my intuition is that humans will be needed at some points in the factory line for specific tasks that require expertise/taste/quality. Have you found that the be the case? Where do you find that humans should be involved in the process of maximal leverage?

To name one probable area of involvement: how do you specify what needs to be built?

[1] https://sociotechnica.org/notebook/software-factory/

navanchauhan · 2026-02-07T19:36:20 1770492980

You're absolutely right ;)

Your intuition/thinking definitely lines up with how we're thinking about this problem. If you have a good definition of done and a good validation harness, these agents can hill climb their way to a solution.

But you still need human taste/judgment to decide what you want to build (unless your solution is to just brute force the entire problem space).

For maximal leverage, you should follow the mantra "Why am I doing this?" If you use this enough times, you'll come across the bottleneck that can only be solved by you for now. As a human, your job is to set the higher-level requirements for what you're trying to build. Coming up with these requirements and then using agents to shape them up is acceptable, but human judgment is definitely where we have to answer what needs to be built. At the same time, I never want to be doing something the models are better at. Until we crack the proactiveness part, we'll be required to figure out what to do next.

Also, it looks like you and Danvers are working in the same space, and we love trading notes with other teams working in this area. We'd love to connect. You can either find my personal email or shoot me an email at my work email: navan.chauhan [at] strongdm.com

jessmartin · 2026-01-26T04:28:29 1769401709

Wow. This is exactly my kitchen faucet. Which makes sense, because my house was built in 1961.

Works great. Amazing how durable the faucet is!

jessmartin · 2025-10-21T10:50:34 1761043834

This! It’s both-and. Literacy has been undeniably good, but we rarely consider the consequences of widespread literacy.

There’s a way of knowing something that can be recalled orally from memory that is different and valuable. But we even measure it using a yardstick for written knowledge (accuracy, breadth, etc).

I believe this overemphasis on written knowledge (really, it’s implicitly a denial that any other type exists) is part of what drives the hysteria about LLMs ending the world. LLM doomerism has to believe that written knowledge is at least the most important if not the only necessary form of knowledge.

jessmartin · 2025-10-21T10:04:16 1761041056

Can you say more? Link?

paulddraper · 2025-10-21T16:26:00 1761063960

https://www.google.com/search?q=claude+code+router

jessmartin · 2025-10-16T23:05:35 1760655935

Same. Was very excited about MCP but Claude code + CLI tools is so much nicer.

jessmartin · 2025-08-18T10:14:12 1755512052

Has anyone put together a comparison table for Desktop apps?

I’ve started using conductor.build and it feels nice, but would happily evaluate others.