Mechanistic Interpretability for Large Language Model Alignment: Progress, Challenges, and Future Directions | Knowledge Base | MenFem