Narrow finetuning can produce broadly misaligned LLMs

sylware 4 months ago

Is anybody pointing on the fact that "alignment" is brain-washing?

achierius 4 months ago

Is teaching a child? Is talking with your friend? Is punishing a criminal?
Or, more pointedly, what about training the model in the first place? Why do you pretend that AI are somehow "people" with a "natural tendency" we're overriding?