Diff of Autogollark at d8fc0ec
@@ -13,3 +13,3 @@ Autogollark is much [[safer]] than [[instruction-tuned]] systems optimized based* Due to general personality stability. Need finetune or similar.-* One proposal: use internal finetune to steer big model somehow.+* One proposal: use internal finetune to steer big model somehow. Possibly: use its likelihood (prefill-only) to evaluate goodness of big model output wrt. gollark personality, and if it is too bad then use finetune directly.}