More

jawiggins · 2025-11-14T22:04:45 1763157885

How sure are you that OpenAI is using that?

I would have suspected it too, but I’ve been struggling with OpenAI returning syntactically invalid JSON when provided with a simple pydantic class (a list of strings), which shouldn’t be possible unless they have a glaring error in their grammar.

gradys · 2025-11-14T22:18:03 1763158683

You might be using JSON mode, which doesn’t guarantee a schema will be followed, or structured outputs not in strict mode. It is possible to get the property that the response is either a valid instance of the schema or an error (eg for refusal)

jawiggins · 2025-11-15T04:15:21 1763180121

How do you activate strict mode when using pydantic schemas? It doesn't look like that is a valid parameter to me.

No, I don't get refusals, I see literally invalid json, like: `{"field": ["value...}`

koakuma-chan · 2025-11-14T22:19:33 1763158773

https://github.com/guidance-ai/llguidance

> 2025-05-20 LLGuidance shipped in OpenAI for JSON Schema

mmoskal · 2025-11-14T22:23:08 1763158988

OpenAI is using [0] LLGuidance [1]. You need to set strict:true in your request for schema validation to kick in though.

[0] https://platform.openai.com/docs/guides/function-calling#lar... [1] https://github.com/guidance-ai/llguidance

jawiggins · 2025-11-15T04:13:24 1763180004

I don't think that parameter is an option when using pydantic schemas.

class FooBar(BaseModel): foo: list[str] bar: list[int]

prompt = """#Task Your job is to reply with Foo Bar, a json object with foo, a list of strings, and bar, a list of ints """

response = openai_client.chat.completions.parse( model="gpt-5-nano-2025-08-07", messages=[{"role": "system", "content": FooBar}], max_completion_tokens=4096, seed=123, response_format=CommentAnalysis, strict=True )

TypeError: Completions.parse() got an unexpected keyword argument 'strict'

simonw · 2025-11-14T22:23:57 1763159037

You have to explicitly opt into it by passing strict=True https://platform.openai.com/docs/guides/structured-outputs/s...

jawiggins · 2025-11-15T04:09:00 1763179740

Are you able to use `strict=True` when using pydantic models? It doesn't seem to be valid for me. I think that only works for json schemas.

class FooBar(BaseModel): foo: list[str] bar: list[int]

prompt = """#Task Your job is to reply with Foo Bar, a json object with foo, a list of strings, and bar, a list of ints """

response = openai_client.chat.completions.parse( model="gpt-5-nano-2025-08-07", messages=[{"role": "system", "content": FooBar}], max_completion_tokens=4096, seed=123, response_format=CommentAnalysis, strict=True )

> TypeError: Completions.parse() got an unexpected keyword argument 'strict'

jawiggins · 2025-11-14T21:58:28 1763157508

So cool to see Anthropic support this feature. I’m a heavy user of the OpenAI version, however they seem to have a bug where frequently the model will return a string that is not syntactically valid json, leading the OpenAI client to raise a ValidationError when trying to construct the pydantic model. Curious if anyone else here has experienced this? I would have expected the implementation to prevent this, maybe using a state machine to only allow the model to pick syntactically valid tokens. Hopefully Anthropic took a different approach that doesn’t have this issue.

brianyu8 · 2025-11-14T23:44:31 1763163871

Brian on the OpenAI API team here. I would love to help you get to the bottom of the structured outputs issues you're seeing. Mind sending me some more details about your schema / prompt or any request IDs you might have to by[at]openai.com?

jawiggins · 2025-11-15T04:05:53 1763179553

Thanks so much for reaching out, sent an email :).

robot-wrangler · 2025-11-15T11:07:20 1763204840

https://github.com/pydantic/pydantic-ai/issues/582 https://github.com/pydantic/pydantic-ai/issues/2405

matheist · 2025-11-15T01:19:04 1763169544

yeah I have, but I think only when it gets stuck in a loop and outputs a (for example) array that goes on forever. a truncated array is obviously not valid JSON. but it'd be hard to miss that if you're looking at the outputs.

jawiggins · 2025-11-04T04:16:41 1762229801

Yeah I really like their rewards program: https://e-i.uhc.com/uhcrewards

Each year they pay me $1,000 (in the form of HSA deposits, which I can invest) to do basic things like get a checkup, get a flu shot, and get a blood test. I sync my wear-able data and they pay me $1-2 each time I exercise or get enough sleep.

It's great!

krackers · 2025-11-04T05:45:51 1762235151

>I sync my wear-able data and they pay me $1-2 each time I exercise or get enough sleep

I wonder if the data is sold off and if so whether it's properly anonymized...

Tadpole9181 · 2025-11-04T20:58:02 1762289882

Where do you think the $1-2 comes from? Insurance company is double dipping to optimize those profits!

jawiggins · 2025-10-13T22:10:24 1760393424

A reasonably common belief among people who have studied the issue is that Tether was at one point unbacked (see NY AG report), and likely fudged their numbers a bit through the use of corperate paper which was plausibly worth $1 but practically could be bought for less than $1, but they have since made enough through various investments that they could now plausibly be fully backed.

3abiton · 2025-10-13T22:26:46 1760394406

This sounds like black magic, ie waiting for a "bank run" to test the validity of Tether.

jawiggins · 2025-10-13T22:32:50 1760394770

Yes - a common criticism of tether is that many times they have claimed to have an audit coming "soon".

disgruntledphd2 · 2025-10-14T10:30:29 1760437829

And to be fair, given basically all of the fraudulent companies that managed to pass audits, the fact that they won't even bother is a pretty strong signal.

But hey, ultimately it's just gonna blow up the economy at some point, but we'll be fine right? Right?

jawiggins · 2025-10-13T22:05:17 1760393117

1. Bloomberg Businessweek — “Anyone Seen Tether’s Billions?” (Cover Story, Oct 2021) — a deep investigation into Tether’s backing, counterparties, and leadership. www.bloomberg.com/news/features/2021-10-07/crypto-mystery-where-s-the-69-billion-backing-the-stablecoin-tether

2. Wall Street Journal — “Cantor Fitzgerald Helps Oversee Tether’s Billions in Treasurys” — on who holds and manages a large chunk of Tether’s reserves. https://www.wsj.com/finance/currencies/wall-street-firm-over...

3. CoinDesk — “Tether’s banking relationships & commercial paper exposure in newly released NYAG docs” — FOIL-sourced documents on what sat inside reserves. https://www.coindesk.com/policy/2023/06/16/tethers-banking-r...

4. CFTC (primary source) — 2021 order fining Tether over reserve claims — official enforcement record on “fully backed” statements. https://www.cftc.gov/PressRoom/PressReleases/8450-21

exabrial · 2025-10-14T01:54:28 1760406868

Apologies, I meant about them printing the money Friday. Trying to understand the relationship with them stabilizing bitcoin prices somehow but also why everyone is worried about USDT and it being "real"

ethbr1 · 2025-10-14T11:25:47 1760441147

https://www.coindesk.com/business/2025/09/30/tether-adds-usd...

Tether's backing / solvency become more important when it's a major provider of crypto liquidity.

If liquidity is generated by many participants, the failure of one doesn't impact the underlying asset.

If liquidity is concentrated in one participant, it increases the potential volatility of the asset, as that participant's failure can drastically limit liquidity and leave the asset open to bigger price swings.

That said, even at $1B, Tether is a smaller portion of the BTC market than it was historically.

exabrial · 2025-10-14T15:04:36 1760454276

Thanks! Appreciate your explanation

jawiggins · 2025-08-26T19:58:17 1756238297

The US Government wanted companies to build fabs in the US so it offered them money to do it. Intel, which was one of those companies, but not the only one, took them up on the offer and was paid to begin construction on a fab in the US.

Normally when we pay businesses to do things we don't demand equity stakes in the businesses afterwards.

Notably, the biggest shareholders in Intel appear to be retirement funds of Americans - so Trump has just pilfered some money from the retirement accounts of Americans.

jawiggins · 2025-07-13T16:08:15 1752422895

Did you try it? What did the person say?

jawiggins · 2025-06-08T21:06:21 1749416781

Figure 5 is really quite remarkable. It seems to show that normal LLMs are better at tasks where the correct answer is likely to be the next token. For tasks that require a small number of intermediate steps, current reasoning models do much better, but break down as the number of intermediate steps grow.

This seems to indicate that the next generation of models should focus on recursively solving small parts of the problem before function-calling another model to solve another small part of the problem and working it's answer into the reasoning loop.

Many seem to be citing this paper as an indication that LLMs are over - I think this indicates a clear path towards the next step function change in their abilities.

jawiggins · 2025-06-02T17:15:53 1748884553

Respondology | REMOTE (US Based only) | Senior Backend Engineer | Full-time | https://respondology.com/

Respondology is working on a social media comment activation platform, building tools to automatically hide spam, abuse, and hate comments, understand audiences, and respond to users using GenAI/LLMs. We are a small team that just raised a recent funding round and are rapidly growing. We are looking to hire a Senior Backend Engineer with significant experience with Python, experience with Opensearch/Elasticsearch is also a plus [120k-140k + equity + bonus]. Our core tech stack is: React, a core Ruby-on-Rails monolith with Python+FastAPI microservices, Postgres, AWS. We are based out of Boulder, CO, but the engineering team is distributed across the US. Learn more here: https://respondology.com/careers/

jawiggins · 2025-05-29T18:33:31 1748543611

The last Starship test flight failed - now SpaceX is probably not going to make the Mars launch window next year that they were aiming to hit. It will also probably not meet the goal NASA had to return to the moon in 2027.