I don't think that GPT-5 Pro is much better (if better at all) than o3-pro. It's...

mikert89 · 2025-08-10T00:56:45 1754787405

I think it is, I've been using these models for 6 hours a day for almost a year. At any given time I have 2 of the max subscriptions (right now grok and openai).

I have a bug that was a complex interaction between backend and front end over websockets. The day before I was banging my head against the wall with o3 pro and grok heavy, gpt5 solved it first try.

I think its also true that most people arent pushing the limits on the models, and dont even really know how to correctly push the limits. Which is also why openai is not focussed on the best models

energy123 · 2025-08-10T10:02:45 1754820165

Has anyone noticed that OpenAI is cutting off submitted context in ChatGPT Pro?

If I send about 60k tokens, the model can't see the question (at the bottom of the text). I need to reduce it to 50k.

If I send two prompts with 40k tokens, the model can't see the beginning of the first prompt.

This seems quite unethical given they advertise 128k context, and I doubt it's an accident (since it runs in the direction of cost savings).

I_am_tiberius · 2025-08-10T02:14:33 1754792073

Similar usage as me, but I don't see a difference between o3-pro and 5-pro. Sounds odd, but my impression is that o1-pro was better at creating complex independent small functions than o3-pro/5-pro.

mikert89 · 2025-08-10T02:32:14 1754793134

Actually will agree that o1 pro was better than o3 at really deep bug finding/coding analysis. Which is also why i have the theory that they could just turn up the compute to show better results, but dont do to cost. O3 and GPT5 seem heavily quantized, o1 pro was more raw

Another thing I’ll add though, is o3 pro is better through the api than the chat website. They clearly constrain it unless you’re paying the absurd api cost

I_am_tiberius · 2025-08-10T04:45:38 1754801138

yes, I think that's right. o1-pro for sure was too expensive for them.

nojs · 2025-08-10T02:32:43 1754793163

How are you using GPT5 for coding (Cursor?)

mikert89 · 2025-08-10T02:39:01 1754793541

I use repo prompt to build the prompt manually with the code files. Then give it to ChatGPT in the console. Then I have cursor ingrate the results.

I actually think cursor alone is not that good

happycube · 2025-08-10T00:57:08 1754787428

Maybe it's the exposure to Chinese? I've heard that training models on code first helps, so I could see it.

I've also heard hearsay that R1 is quite clever in Chinese, too.

vintagedave · 2025-08-10T01:03:36 1754787816

> Kimi is actually on par with most published poets

Could you provide some examples, please? I find this really exciting. I’ve never yet encountered an AI with good literary writing style.

And poetry is really hard, even for humans.

DrewADesign · 2025-08-10T03:27:12 1754796432

To each their own, but I find the idea of ai-generated poetry sad as hell. I simply can’t see poetry as a collection of evocative words judged without context, in a vacuum— is poetry not both an activity and a relationship to most people? A person deftly portraying some difficult-to-express facet of the human experience and just maybe it viscerally strikes a chord with other people? I just don’t understand how people don’t value the fundamental humanity of that process. Even prose. James Baldwin stories, word for word, would land a hell of a lot differently if they were written and published by Hemingway.

vintagedave · 2025-08-10T09:37:03 1754818623

I 100% agree. I am inclined to think an AI may be able to develop a sense of what words carry in future -- they can analyse it -- but it still lacks real experience.

Plus their creative output in literary quality is dreary, dull, and dire. That's why I was so curious for the OP to share examples.

orbital-decay · 2025-08-10T05:09:34 1754802574

Stylized prose is where Claude 3 Opus particularly shines due to its character training and multilingual performance. It's plagued with claudeisms and has a ton of other shortcomings, but it's still better than any current model at this, including K2, R1, and especially Claude 4. Too bad Anthropic basically reversed their direction on creative writing, despite reporting some improvements each time (which don't seem to be true in practice).