Section 01
[Introduction] Reasoning and Misalignment: A Comparative Empirical Study of Three Open-Source Language Models
A master's thesis study that systematically compares the performance of three open-source large language models on reasoning tasks, revealing the potential tension between model capabilities and their alignment training.
Original author/maintainer: haavardos Source platform: GitHub Original title: master-thesis-ikt590-reasoning-misalignment Original link: https://github.com/haavardos/master-thesis-ikt590-reasoning-misalignment Source publication/update time: 2026-06-01T15:05:02Z
Keywords: large language models, alignment training, reasoning ability, RLHF, open-source models, AI safety, empirical study