The model is capable of generating many different responses to the same prompt. ...

CamperBob2 · on Feb 13, 2023

Exactly. Given a source of truth, it can't be that hard to train a separate analytic model to evaluate answers from the existing synthetic model. (Neglecting for the moment the whole Gödel thing.)

The problem isn't going to be developing the model, it's going to be how to arrive at an uncontroversial source of ground truth for it to draw from.

Meanwhile, people are complaining that the talking dog they got for Christmas is no good because the C++ code it wrote for them has bugs. Give it time.

swatcoder · on Feb 13, 2023

That’s quite the system that can take in any natural language statement and confirm whether its true or false.

You might be underestimating the scope of some task here.

mortehu · on Feb 13, 2023

Not true or false; just present or absent in the reference data. Note that false negatives will not result in erroneous output, so the model can safely err on the side of caution.

Also 100% accuracy is probably not the real threshold for being useful. There are many low hanging fruits today that could be solved by absolutely tiny error correcting models (e.g. arithmetic and rhyming).

astrange · on Feb 13, 2023

There's research showing you can tell if something is a hallucination or memorized fact based on the activation patterns inside the LM.