You can go below one byte per parameter. 4-bit quantization is fairly popular. It does affect quality - for some models more so than others - but, generally speaking, a 4-bit quantized model is still going to do significantly better than an 8-bit model with 1/2 parameters.
Interesting to see Grok making benchmark progress. I’m still waiting to see how it performs outside of controlled tests, especially in real-world use like coding, summarizing, or reasoning.
I’m still torn on this. On one hand, memory could make ChatGPT more useful, especially for people using it regularly for work or coding. But on the other hand, the idea that it “remembers” me just feels a little uncomfortable.
I’d want more control over what’s remembered and when. Curious if anyone here has used this yet — is it actually helpful in practice?
> I’d want more control over what’s remembered and when. Curious if anyone here has used this yet
I use the "memory" feature of ChatGPT, and taking a look right now, it seems to have about ~30 items saved from me, some of them are like "Is using egui for a UI task, particularly related to configuring smooth automatic scrolling in a scrollarea." which is useful for maybe the ~3 chats I had about it, and also other things like "Prefers more accuracy in terminology and is looking to represent LLMs in a detailed and structured way." that are more broadly applicable.
Then you can obviously remove any of them, and also manually add by telling it explicitly you want something added.
I'm not sure of its usefulness, I guess it's nice that it correctly "knows" I'm mostly on Arch Linux most of the time but have my servers with NixOS, so if I ask it to create new unix commands I usually get something that works on both, or two versions. But sometimes it also incorrectly infers something because I didn't specify otherwise in the prompt and didn't think of it, but it could see something from the memories.
I’ve been trying it out recently, mostly for writing and summarizing research. The memory feels subtle so far — it doesn’t jump in unless you really build on past prompts.
That said, I totally agree about control. I wish there was a more obvious way to “pause” or “reset” memory mid-session instead of diving into settings. It’s useful, but still a little opaque.
Totally agree, it's getting scary out there. The AMOS attack really shows how attackers are stepping up their game. If even legit websites can be weaponized like that, what’s actually safe anymore?
I only did it once (fb login for airbnb) and I deeply regret to this day. Airbnb cant even migrate my account to another type (plain old username & password is best for me).
The reason - over time I started renting an apartment there, the reputation and stream of guests is simply too big wall to climb over (I mean it can be done easily but it would hurt financially pretty bad).
Now I cant get rid of that cancer that facebook is. 10 years ago it didnt seems so, my mistake, should have seen it coming.
What? You’d rather have more, smaller rando services handling your passwords than a few big names that have the resources to put into at least paying lip service to security?
Would you also need a separate email address for each account? I think most mailboxes support a recovery email address and some require one for billing.
Is truly separate accounts for everything really doable?