More

ManuelKiessling · 2026-01-14T22:17:11 1768429031

https://manuel.kiessling.net

ManuelKiessling · 2026-01-14T10:40:45 1768387245

This. By now I don’t understand how anyone can still argue in the abstract while it’s trivial to simply give it a try and collect cold, hard facts.

It’s like arguing that the piano in the room is out of tune and not bothering to walk over to the piano and hit its keys.

ozim · 2026-01-14T11:10:44 1768389044

Downside is a lot of those that argue, try out some stuff in ChatGPT or other chat interface without digging a bit further. Expecting "general AI" and asking general questions where LLMs are most prone for hallucinations. Other part is cheap out setups using same subscription for multiple people who get history polluted.

They don't have time to check more stuff as they are busy with their life.

People who did check the stuff don't have time in life to prove to the ones that argue "in exactly whatever the person arguing would find useful way".

Personally like a year ago I was the person who tried out some ChatGPT and didn't have time to dabble, because all the hype was off putting and of course I was finding more important and interesting things to do in my life besides chatting with some silly bot that I can trick easily with trick questions or consider it not useful because it hallucinated something I wanted in a script.

I did take a plunge for really a deep dive into AI around April last year and I saw for my own eyes ... and only that convinced me. Using API where I built my own agent loop, getting details from images, pdf files, iterating on the code, getting unstructured "human" input into structured output I can handle in my programs.

*Data classification is easy for LLM. Data transformation is a bit harder but still great. Creating new data is hard so like answering questions where it has to generate stuff from thin air it will hallucinate like a mad man.*

Data classification like "is it a cat, answer with yes or no" it will be hard for latest models to start hallucinating.

112233 · 2026-01-14T11:34:00 1768390440

So I tried it and it is worse that having random dude from Fiverr write you code — it is actively malicious and goes out of it's way do decieve and to subtly sabotage existing working code.

Do I now get the right to talk badly about all LLM coding, or is there another exercise I need to take?

ManuelKiessling · 2026-01-16T06:56:32 1768546592

Hey, serious question that I ask in good faith: would you be open to a screensharing session, where we compare approaches and experiences?

demorro · 2026-01-14T11:19:05 1768389545

It's like arguing that the piano goes out of tune randomly and that even if you get through 1, 2, or even 10 songs without that happening, I'm not interested in playing that piano on stage.

satisfice · 2026-01-14T10:52:09 1768387929

I am hitting the keys, and I call bullshit.

Yes, the technology is interesting and useful. No, it is not a “10x” miracle.

ozim · 2026-01-14T11:11:55 1768389115

I call "AGI" or "100x miracle" a bullshit but still existing stuff definitely is "10x miracle".

runarberg · 2026-01-14T12:42:39 1768394559

This is a known sales trick, called door-in-the face. First you introduce your victim to an outrageous claim, and then follow with a more modest and more reasonable sounding claim.

In truth neither claims are reasonable, but because of the door in the face, the victim is more susceptible the the latter claim. Without the more outrageous claim it is unlikely the victim would have believed the latter claim.

In reality, both "AGI" and "100x miracle" AND the "10x miracle" are all outrageous claims, and I call bullshit on all of them.

ozim · 2026-01-14T16:09:19 1768406959

I am more concerned by bait and switch that is comming, people will get used to convenience for $100 a year or $100 a month and after 10 years they do price 5x and what are people going to do?

ManuelKiessling · 2026-01-14T04:45:33 1768365933

I guess we all know and „love“ how every five minutes, some breathless hipster influencer posts „This changes everything!!!“ to every new x.y.1 AI bubble increment.

But honestly? This here really is something.

I can vividly imagine how in a not too far future, there will only be two types of product companies: those that work like this, and those that don’t — and vanish.

Edit: To provide a less breathless take myself:

What I can very realistically imagine is that just like today sane and level-headed startups go „let’s first set up some decent infrastructure-as-code, a continuous delivery pipeline, and a solid testing framework, and then start building the product for good“, in the future sane and level-headed startups will go „let’s first set up some decent infrastructure-as-code, a continuous delivery pipeline, a solid testing framework, and a Ramp-style background agent — and then start building the product for good“.

llms01 · 2026-01-14T05:29:05 1768368545

Yeah I feel somewhat the same way. This looks like some serious engineering effort went into it, and it looks like there should be a way to measure its impact on developer productivity and quality of output. I'm a bit hesitant considering finance is not an industry you want to introduce security problems in, but nonetheless will be a good test of these tools.

If it really does work I expect there will be many paid and open source variants that other companies can adopt into their workflows. So I'll patiently wait for the outcomes before trying something like this, but I'm glad someone is.

ManuelKiessling · 2026-01-12T11:14:32 1768216472

This is by far the best summary of the state of affairs, or rather, the most sensible perspective that one should have on the state of affairs, that I've read so far.

Especially point 3 hits the nail on the head.

ManuelKiessling · 2026-01-05T21:44:20 1767649460

Isn‘t n8n a substitute of sorts?

ManuelKiessling · 2026-01-04T00:02:45 1767484965

A boy receives a horse as a gift. Villagers say, "How wonderful!"

The Zen master replies, "We'll see".

The boy falls while riding the horse, breaks his leg. Villagers say, "How terrible!"

The master says, "We'll see".

War comes, all young men are drafted, but the boy is spared due to his leg. Villagers say, "How lucky!"

The master says, "We'll see".

ManuelKiessling · 2026-01-03T23:55:02 1767484502

As a German, I can say I‘m very happy for the intervention some decades ago, but it’s of course just one example, potentially a bad one, and likely cannot be generalized — just wanted to throw this into the ring as a positive example.

jacquesm · 2026-01-04T13:06:29 1767531989

Which country was Venezuela at war with?

Germany waged war on just about the whole world, the response to that was one of defense, not offense.

andsoitis · 2026-01-04T15:36:55 1767541015

> Which country was Venezuela at war with?

> Germany waged war on just about the whole world, the response to that was one of defense, not offense.

I find it useful to distinguish legality from morality of the move of capturing Maduro and his wife.

One way I approach it is to ask myself: if one could have Maduro returned to Venezuela today, would one? Perhaps the answer that most people would give is yes (i.e. everyone would be better off), but I'm not so sure.

ManuelKiessling · 2026-01-03T23:46:36 1767483996

Over the holidays, I played with the concept of a two-player online browser game with webcam-based gestures controls:

https://www.reddit.com/r/vibecoding/comments/1puptdm/my_vibe...

https://gestures-apps.dx-tooling.org

Turns out, there isn’t a framework for these kinds of things that covers all aspects (hand tracking, UI, networking, game state), so I built it:

https://github.com/manuelkiessling/gestures-apps

ManuelKiessling · 2026-01-03T13:01:44 1767445304

Yeah, a pattern like „do the heavy lifting with cheap regexes, and every 100 line items, do one expensive LLM run comparing inputs, outputs, and existing regexes to fine-tune the regexes“.

ManuelKiessling · 2026-01-03T07:38:05 1767425885

Can you explain what you mean by that? As of this writing, there is a „Iran protests“ block on cnn.com on position #2 or #3 depending on how you count, well within the first 20% of the scroll-like homepage.