Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

But can o3 write a symphony?

Seriously though, I'd like to hear suggestions on how to automatically evaluate an AI model's creativity, no humans in the loop.



In my view there's two modes of creativity:

1. That two distant topics or ideas are actually much more closely related. The creative sees one example of an idea and applies it to a discipline that nobody expects. In theory, reduction of the maximally distant can probably be measured with a tangible metric.

2. Discovery of ideas that are even more maximally distant. Pushing the edge, and this can be done by pure search and randomness actually. But it's no good if it's garbage. The trick is, what is garbage? That is very context dependent.

(Also, a creative might be measured on the efficiency of these metrics rather than absolute output)


Terry Tao has referred to this classification system as foxes vs hedgehogs. https://en.m.wikipedia.org/wiki/The_Hedgehog_and_the_Fox


we'd have to create a numerical scale for creativity, from boring to Dali, with milliEschers and MegaGeigers somewhere in there as well


It's essential that we quantify everything so that we can put a price on it. I'd go with Kahlograms though.


Have you tried suno.ai?


Have _you_? It lost its novelty after a couple of days.


I probably listen to Suno (both my own songs, and songs other people have created) about as often as I listen to Spotify, these days.


LLMs have read everything humans made so just ask one if there’s anything truly new in that freshly confabulated slop-phony.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: