Yousef Mroueh
IBM
Scientific, Seminar
Kantorovich Initiative Seminar: Yousef Mroueh
Current LLM alignment techniques use pairwise human preferences at a sample level, and as such, they do not imply an alignment on the distributional level. We propose in this paper Alignment via Optimal Transport (AOT), a novel method for...