Hacker Newsnew | past | comments | ask | show | jobs | submit | Nerd_Nest's commentslogin

These tips seem useful. Has anyone here tried them out yet?


Whoa, 120B? That’s huge.


120B MoE. The 20B is dense.

As far as dense models go, it’s larger than many but Mistral has released multiple 120B dense models, not to mention Llama3 405B.


for posterity, since shown that is it actually MoE

> 21B parameters with 3.6B active parameters


How much ram do you need to run this !!??


Probably about one byte per weight (parameter) plus a bit extra for the key-value cache (depends on the size of the context window).


You can go below one byte per parameter. 4-bit quantization is fairly popular. It does affect quality - for some models more so than others - but, generally speaking, a 4-bit quantized model is still going to do significantly better than an 8-bit model with 1/2 parameters.


These tools save time, sure, but I’ve seen junior devs blindly accept AI suggestions without really understanding the code.


Interesting to see Grok making benchmark progress. I’m still waiting to see how it performs outside of controlled tests, especially in real-world use like coding, summarizing, or reasoning.


I’m still torn on this. On one hand, memory could make ChatGPT more useful, especially for people using it regularly for work or coding. But on the other hand, the idea that it “remembers” me just feels a little uncomfortable.

I’d want more control over what’s remembered and when. Curious if anyone here has used this yet — is it actually helpful in practice?


> I’d want more control over what’s remembered and when. Curious if anyone here has used this yet

I use the "memory" feature of ChatGPT, and taking a look right now, it seems to have about ~30 items saved from me, some of them are like "Is using egui for a UI task, particularly related to configuring smooth automatic scrolling in a scrollarea." which is useful for maybe the ~3 chats I had about it, and also other things like "Prefers more accuracy in terminology and is looking to represent LLMs in a detailed and structured way." that are more broadly applicable.

Then you can obviously remove any of them, and also manually add by telling it explicitly you want something added.

I'm not sure of its usefulness, I guess it's nice that it correctly "knows" I'm mostly on Arch Linux most of the time but have my servers with NixOS, so if I ask it to create new unix commands I usually get something that works on both, or two versions. But sometimes it also incorrectly infers something because I didn't specify otherwise in the prompt and didn't think of it, but it could see something from the memories.


I’ve been trying it out recently, mostly for writing and summarizing research. The memory feels subtle so far — it doesn’t jump in unless you really build on past prompts.

That said, I totally agree about control. I wish there was a more obvious way to “pause” or “reset” memory mid-session instead of diving into settings. It’s useful, but still a little opaque.


Cool to see how far NotebookLM has come. For anyone interested, I came across this breakdown on the Android app launch and how it works with AI: https://www.squaredtech.co/google-launches-notebooklm-androi...


Grok in Telegram? That’s one bold move by xAI. I'm curious to see how users actually engage with it in day-to-day conversations.


Totally agree, it's getting scary out there. The AMOS attack really shows how attackers are stepping up their game. If even legit websites can be weaponized like that, what’s actually safe anymore?


Stop using federated login services and maintain unique accounts per service. Blacklist any service that doesn't provide their own account management.


I only did it once (fb login for airbnb) and I deeply regret to this day. Airbnb cant even migrate my account to another type (plain old username & password is best for me).

The reason - over time I started renting an apartment there, the reputation and stream of guests is simply too big wall to climb over (I mean it can be done easily but it would hurt financially pretty bad).

Now I cant get rid of that cancer that facebook is. 10 years ago it didnt seems so, my mistake, should have seen it coming.


What? You’d rather have more, smaller rando services handling your passwords than a few big names that have the resources to put into at least paying lip service to security?


Would you also need a separate email address for each account? I think most mailboxes support a recovery email address and some require one for billing.

Is truly separate accounts for everything really doable?


I don't know why you would need a separate email address. I would think the password is the important part.


I mean using google [for example] to log in everywhere vs using gmail.


Get a mail service that lets you use your own domain. Then you setup a wildcard to use any address you want with zero effort.


Ah, that clears things up. No wonder the website is hard to find if they hadn’t officially announced the company yet.


Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: