More

singpolyma3 · 2026-03-27T16:33:07 1774629187

Since the article is largely about open weights models, I think the argument is that this is the "last gasp" and soon doing inference at home will be common.

philipkglass · 2026-03-27T16:44:59 1774629899

The small models that I can run at home are becoming more capable, and I have replaced some API-based tasks with local inference as they improve, but large open weights models are still a lot stronger. The nice thing with larger open weights models is that competing providers serve them at modest margins and prices. I don't have the hardware to run the largest Qwen models, but I can get API access at low cost. Since there are only modest barriers to new commercial inference providers for these models I'm not worried that API access to them will become drastically more expensive at some future time.

CamperBob2 · 2026-03-27T17:01:13 1774630873

And since there are only modest barriers to new commercial inference providers for these models...

Congress: "Hold my beer and watch this"

vjvjvjvjghv · 2026-03-27T16:36:44 1774629404

The trend over the last decades was towards more centralization and I don't see that changing. Unless we radically change our economic system, the rent seekers will always win. There will be probably less of them but they will be even bigger.

filleduchaos · 2026-03-27T16:58:03 1774630683

Running on what devices (and additionally, purchased with what money)?

singpolyma3 · 2026-03-27T16:32:03 1774629123

In a free market it would be stop paying drivers so everyone can pay less for taxis.

But the market is so unbelievably messed up that this is not what happens in practise.

fn-mote · 2026-03-27T16:41:44 1774629704

Disagree.

The work on self-driving cars would not be done if there were not a way to profit from it. If it isn’t expected to earn more than it costs, it wouldn’t be done.

Now maybe it’s all being done because of expectations of a monopoly (this is a free market consequence, right?), but …

kingstnap · 2026-03-27T19:56:21 1774641381

Research has a large asymmetry to it. Once someone shows something can be done. Others can follow quite easily.

And more substantial is that when someone shows something can be done, its orders of mangnitude easier for $ENGINEER at $CORP to get $CSUITEs attention to get a budget / justify the risk.

Your self driving car example is the best example of this. Since waymo and tesla got popular NVIDIA really started pushing their self driving cars for everyone tech.

https://youtu.be/EzAVW1VgzcI?si=gfnOJoaO9hfIFXX1

And NVIDIA isn't the only one.

palmotea · 2026-03-27T20:04:36 1774641876

> The work on self-driving cars would not be done if there were not a way to profit from it. If it isn’t expected to earn more than it costs, it wouldn’t be done.

I disagree. "Expected" is hiding a lot of the work there, and the reasons for that expectation could be rational or completely irrational.

Hypothetically "work on self-driving cars" would be done if some crazy psychic-trusting billionaire's psychic told him they would be profitable, even if all rational analysis said they'd be a bad business. A lot of major investments, especially in tech, and done based on hope or as a bet, not due to any real foresight.

convolvatron · 2026-03-27T16:44:23 1774629863

I think you made the assumption that gp is disagreeing with, that reducing costs necessarily reduces prices. (hence the bit about messed up markets)

singpolyma3 · 2026-03-27T01:18:16 1774574296

"spinning up your own git server" isn't really a thing. Any server with ssh and git installed is a git server.

singpolyma3 · 2026-03-26T18:55:19 1774551319

I assume it looks the same as literally every other chat app

Muhammad523 · 2026-03-26T19:58:14 1774555094

Yeah. Lots of discord-like free-software(as in freedom) chat apps are spawning. I think it's clear that whichever becomes the most popular will not be about who has better code but rather about who manages to get a stronger community around their project.

subscribed · 2026-03-27T10:58:23 1774609103

Nash.

There are always quirks and edges. Like using Bluesky itself, there's a number of viable apps for them (some better, some worse), they're all slightly different. There was a large number of Reddit apps, every single one very different.

singpolyma3 · 2026-03-25T15:25:18 1774452318

How does this credit card thing even work? Are they just assuming if you have a credit card you must be an adult??

singpolyma3 · 2026-03-24T22:26:15 1774391175

Completely disagree. Inability to handle specific math or CS is a matter of training and experience not reasoning and intelligence. The barista is quite capable at reasoning and learning feats the LLMs aren't close to

tombert · 2026-03-24T22:45:56 1774392356

Yeah, there appears to be this idea that "being smart" is the same thing as "knowing facts", which I don't think is realistic.

I know plenty of people who are considerably smarter than me, but don't know nearly as much as I do about computer science or obscure 90's video game trivia. Just because I know more facts than they do (at least in this very limited scope) doesn't mean that they're less capable of learning than I am.

As you said, a barista is very likely able to reason about and learn new things, which is not something an LLM can really do.

chromoblob · 2026-03-25T10:16:40 1774433800

it's the matter of knowing the most practically important facts to know

singpolyma3 · 2026-03-24T22:14:27 1774390467

It doesn't look anything like AGI and no one who knows what that means would be confused in any era.

Is it useful? Yes. Is it as smart as a person? Not even remotely. It can't even remember things it already was told 5 minutes ago. Sometimes even if they are still in the context window un compacted!

IanCal · 2026-03-24T22:18:54 1774390734

It doesn’t need to be human level, and if I walk into a room and forget why I went in am I no longer a general intelligence?

singpolyma3 · 2026-03-24T22:24:04 1774391044

If it doesn't need to be human level then what are we even talking about? AGI means human level. Everything else is AI

IanCal · 2026-03-25T10:48:38 1774435718

No, the big thing with AGI was that it was general. AI things we made were extremely narrow, identify things out of a set of classes or route planning or something similarly specific. We couldn't just hand the systems a new kind of task, often even extremely similar ones. We've been making superhuman level narrow AI things for many years, but for a long time even extremely basic and restricted worlds still were beyond what more general systems could do.

If LLMs are your first foray into what AI means and you were used to the term ML for everything else I could see how you'd think that, but AI for decades has referred to even very simple systems.

singpolyma3 · 2026-03-25T12:32:27 1774441947

If AGI doesn't mean human level then what does? As you say, every application of A* is in some way "AI" so we had this idea of "AGI" for something "actually intelligent" but maybe I'm wrong and AGI never meant that. What does mean that?

singpolyma3 · 2026-03-23T23:42:56 1774309376

They admit a raw LLM would be dangerous and then proceed to use RAG... How is this any better? You cannot allow an LLM to generate the final outbound message if you are liable for what it says.

LLM to understand the question? Yes. Generate SQL maybe with Embeddings to look up answers? Yes. Generate the final response? No.

singpolyma3 · 2026-03-22T16:22:45 1774196565

Lots of people seem to use 4bit. Do you think that's worth it vs a smaller model in some cases?

Aurornis · 2026-03-22T17:29:45 1774200585

4 bit is as low as I like to go. There are KLD and perplexity tests that compare quantizations where you can see the curve of degradation, but perplexity and KLD numbers can be misleading compared to real world use where small errors compound over long sessions.

In my anecdotal experience I’ve been happier with Q6 and dealing with the tradeoffs that come with it over Q4 for Qwen3.5 27B.

hnfong · 2026-03-22T16:43:17 1774197797

Generally the perplexity charts indicate that quality drops significantly below 4-bit, so in that sense 4-bit is the sweet spot if you're resource constrained.

singpolyma3 · 2026-03-20T23:00:55 1774047655

OpenCode vs Aider vs Crush?

polski-g · 2026-03-21T00:08:37 1774051717

OpenCode, by reason of plugins alone, is better than all of them.