I could see it being handy to estimate probability where each word is to better inform each guess. The tricky thing is I'm not sure if each run of wave function collapse (with randomness injected) would be an accurate sampling method for the real distribution of possible permutations. While doing this writeup I tried to find ways to analyze permutations with restriction, but it turns out most general methods are pretty intractable: https://en.wikipedia.org/wiki/Permanent_(mathematics)#Enumer...
https://en.m.wikipedia.org/wiki/Model_synthesis
https://www.rserra.it/solving-hardest-sudoku/