NL-241, Inference-Time Intervention: Eliciting Truthful Answers from a Language Model, NeurIPS 2024

 

















Reference

댓글