More

lvogel · 2025-09-09T20:44:30 1757450670

That's just my age showing, I guess. For me, all 2.5D shooters are DOOMlikes. I was actually first thinking about calling it a Quake-like since IMHO that is much more well known for its multiplayer, but then I never got around to implement powerups and all the other stuff people might have expected.

boxed · 2025-09-10T12:38:21 1757507901

DOOM was revolutionary because it switched from raycasting like Wolfenstein to a binary tree rendering system that allowed much more complex architecture.

If it's using raycasting, it's not DOOM-like...

lvogel · 2025-09-09T20:42:08 1757450528

Love that you liked it! Your project was the inspiration and showed me the insanity was actually feasible :D

lvogel · 2025-09-09T20:41:01 1757450461

Yea, I actually thought of the viability of SQL for games while working on DOOMQL. It's just so easy to express a lot of game logic in SQL queries. As an avid OSRS player I was thinking about doing a simple MUD/MMO next.

Thanks for the pointer to SpacetimeDB - haven't heard of it before!

all2 · 2025-09-09T23:09:33 1757459373

My understanding is that Spacetime db embeds the "system" component of an ECS into a DB. The rest of the ECS is in there too.

I briefly looked into it, but never got past that fundamental piece.

lvogel · 2025-09-09T17:50:08 1757440208

If you want to take a look at the source code, here's the repo! https://github.com/cedardb/DOOMQL

halifaxbeard · 2025-09-09T22:08:56 1757455736

unironically, this is such an elegant way to express ray marching

https://github.com/cedardb/DOOMQL/blob/f14b5ef9ef0b23045376b...

lvogel · 2025-03-07T18:04:45 1741370685

The version is just an atomic integer assigned to that btree node which is monotonically increasing. Each writer increases the version when it releases the lock IF it has modified the node.

Wraparounds are only a theoretical issue. There would have to be exactly UINT64_MAX writers between a reader first checking the version and verifying the version.

adrian_b · 2025-03-07T20:09:45 1741378185

"Optimistic locking" is a bad choice of words because no locking is optimistic.

In the well-known access method described here, the writers access the shared data with mutual exclusion, i.e. they use locking.

The readers use no locking, but they access the shared data concurrently and optimistically, hoping that the access will succeed at the first attempt.

When the readers are unlucky, they must retry the access.

So there is locking used by writers for mutual exclusion and there is optimistic access used by the readers.

There is no "optimistic locking", which is a contradiction in terms (locking is pessimistic).

In general, there are only 3 methods for accessing shared data: mutual exclusion (a.k.a. pessimistic access), where locking forces the accesses to be sequential, and 2 methods where accesses may be concurrent, optimistic access (a.k.a. lock-free), where retries may be necessary, and dynamic partitioning of the shared data (typically used for shared arrays or for shared buffers/queues), where neither locking nor retries are needed.

The method described here for accessing B-trees employs a combination of all 3 methods, because the release of the locks at higher levels is a consequence of restricting the future accesses to only a part of the shared data, i.e. the writers that access the shared B-tree start by accessing sequentially the root, but then they partition the tree between themselves, so the next accesses that fall in distinct subtrees may proceed concurrently.

layer8 · 2025-03-08T00:42:06 1741394526

“Optimistic locking” is a well-established and widespread terminology though, not the least due to its catchiness. The more factual, but unwieldy term is “optimistic concurrency control”.

adrian_b · 2025-03-08T08:23:24 1741422204

I agree that it is widespread, but whenever you see such illogical terms you have to wonder whether the authors who use them do not understand what they are really doing or they understand, but they succumb to conforming with a widespread inappropriate usage in the hope to be better understood by naive readers.

Understanding the difference between pessimistic access (mutual exclusion implemented by locking) and optimistic access (concurrent accesses with retries when necessary) is absolutely critical for the correct and efficient implementation of algorithms that use shared data structures.

It is frequent to combine both methods in an algorithm, like here, but in English that is not "optimistic locking", but at most "locking and optimistic access" or "optimism and locking".

Pessimistic access means that you expect that another process will attempt to access concurrently the shared data, so you must use a lock to prevent this. Optimistic access means that you expect that no other process will attempt to access concurrently the shared data, so you may proceed to access it immediately, but then you must have some means to detect that your assumption has been wrong and another process has interfered, when the transaction must be retried.

Depending on the application, either pessimistic access or optimistic access results in a better performance, neither is always better than the other. Optimistic access (lock-free access) makes better the best case, but it makes much worse the worst case. Depending on the frequency distribution of such cases optimistic access increases or decreases the performance.

Pessimistic access and optimistic access have the advantage of being applicable to any kind of shared data structure, but dynamic data partitioning, where applicable, like for this shared B-tree example, which can be partitioned in sub-trees accessed concurrently, normally results in better performance than both pessimistic access and optimistic access, by being deterministic and avoiding both locking and retries. Dynamic data partitioning may require locking for a very short time in order to partition the shared resource before accessing the allocated part, though the mutual exclusion provided by atomic instructions may be sufficient for this purpose. It is frequent that an atomic fetch-and-add instruction is enough to partition a shared data structure, like a shared array or a shared message queue, between concurrent processes attempting to access it.

xxs · 2025-03-07T19:21:57 1741375317

I chuckle every time anyone has tried to handle long(64bit) overflows while incrementing by one.

Other than that, the version checks + retries have been a thing since forever (they are the most bog standard way to do any lock-free datastructures, along with database updates in the same manner). They do need a back off, though.

pfent · 2025-03-07T18:25:30 1741371930

When incrementing the version with 4 GHz, it takes over 100 years non-stop for a 64bit wraparound.

xxs · 2025-03-07T19:52:59 1741377179

likely even more as it has to be an atomic operation which causes extra latency and coherency traffic

lvogel · on June 19, 2024

Tell me about it... Was just too expensive compared to flash. But the tech was definitely awesome and I hope it'll come back in one way or another.

When designing CedarDB, we recently had multiple instances where we thought: "If we had just a few KiB of Optane here ... "

ssl-3 · on June 19, 2024

Oh, it's amazing. I use a small chunk of it at home for ZIL, even though my synch writes are few and far between.

It's just approximately the Right Thing to use for some things, and I had room for it, and it was rather inexpensive when I bought it. :)

10/10, would recommend and buy again. (Whoops!)

ComputerGuru · on June 20, 2024

I have various generations of optane I stocked up on for my own small business and home lab use, and fully agree.

I don’t know if you’re aware but Samsung introduced their own version of optane just shortly before Intel killed their product line. It was called Samsung Z and it was prohibitively expensive for some reason, but I didn’t hear that it was officially killed off (though it likely has been).

ComputerGuru · on June 20, 2024

I found it deep in my Amazon “saved for later” but it’s no longer available: https://amzn.to/3z7QeGu

the8472 · on June 19, 2024

If it's just a few KiB then various NVRAM techs have been slowly creeping forward for decades. But they've never made it to datacenter storage.

https://www.avalanche-technology.com/wp-content/uploads/1Gb-... https://www.everspin.com/spin-transfer-torque-ddr-products

Dylan16807 · on June 20, 2024

Does an SSD with power loss protection achieve the same level of performance, or is optane still better?

Are you taking the durability into account when you say kilobytes?

lvogel · on June 19, 2024

> Ultimately, a reliable non-volatile write cache, sized for your workload is the answer.

Author here, I agree! It's quite sad that we need such an involved solution to offset the inherent complexity of the flash medium (latency spikes, erase blocks, ...). We nearly had the perfect solution with Optane[1]: 100ns latency, instantly persisted writes and all that good stuff.

I'm still not over Intel killing it while I did my PhD on it.

[1] https://en.m.wikipedia.org/wiki/3D_XPoint

hi-v-rocknroll · on June 19, 2024

More industry experience in real world would've made it clear that they're totally useless for datacenters where there are rarely power outages with many 9's of electrical uptime measured in months or years.

UPSes and BBWC evolved to bring reliability to production gear running in non-DC environments when mainline servers used spinning rust without backup power. Today, it's largely a vendor up-charge.

Write barriers cause far too much latency in practice on servers in tier IV datacenters, so they're almost always turned off except for a tiny fraction of systems.

There has never been a "perfect" or a universal solution, only a risk budget and suitability for a specific use-case.

jakedata · on June 19, 2024

Write barriers aren't just for durability - they also can even out major latency spikes when bloated buffers ultimately must be flushed. Database, filesystem, RAID, device, flash management and individual chips all become saturated at some point. Managing performance at the database engine layer gives you visibility into the issue at a high level as opposed to rolling merrily along until commits start taking 2000 ms. As an example, ZFS is terrifyingly unpredictable at high loads.

ohnoitsahuman · on June 20, 2024

Can confirm about ZFS - at anything above 85% space utilisation under high load it gets bad on FreeBSD and impossibly weird on Linux.

otterley · on June 19, 2024

Power availability isn't the only concern when it comes to taking the risk of async writes. Kernel panics will cause unflushed writes to be lost, and storage hardware can experience gray or total failures, sometimes completely suddenly.

throwaway81523 · on June 19, 2024

There used to be battery backed ramdisks with SATA interfaces. Are they history now? These days it would be NVMe or whatever, of course.

jakedata · on June 19, 2024

Radian makes ultracapacitor backed RAM cards with flash backup that appear to be NVMe devices. They do a nice job for things like Write Ahead Logs and ZFS ARC/ZIL. They offer effectively unlimited write endurance with a tradeoff of cost and capacity.

lvogel · on June 8, 2024

Yes, the way it is currently implemented, the build side has to fit into RAM. There is no inherent reason we couldn't also spool to disk, but we haven't implemented that yet.

tanelpoder · on June 8, 2024

Thanks for confirming. (I deliberately worded my question like that, as it makes sense to roll such features out in phases, just like plenty of others have done - off the top of my head, DuckDB and Apache Impala for example.)

Edit: In the post you mentioned that you optimized the hot path for likely taking the non-matching record path. Sometimes with well-designed partition wise joins, most of the records actually do match and survive the join - I guess in such (estimated or detected) cases you could switch to an alternative path with a match being the likely branch in the hot path…

lvogel · on May 3, 2024

Depends on your use case. Redis is a key-value store that is optimized for simpler workloads and is designed as a caching layer.

Umbra, on the other hand, is a full fledged, ACID compliant relational database systems with all bells and whistles.

However, Umbra can process multiple 100ks of transactions per second. If you're interested in more details, take a look at Figure 6 in this research paper: https://db.in.tum.de/~freitag/papers/p2797-freitag.pdf

Disclosure: I work for CedarDB (https://cedardb.com), the commercial spinoff of Umbra.

lvogel · on May 3, 2024

Yes, we do use io_uring