Research

My research interests span machine learning, NLP, and AI safety.

Reasoning Alignment in Large Language Models

ongoing

Investigating methods to align chain-of-thought reasoning in LLMs with human logical processes, focusing on faithfulness and transparency of intermediate steps.

AlignmentReasoningChain-of-ThoughtLLM

Cross-Lingual Transfer Learning for Low-Resource Languages

published

Developing efficient transfer learning techniques that enable NLP capabilities in languages with limited training data, leveraging structural similarities across language families.

NLPTransfer LearningMultilingualLow-ResourceACL 2025

Human-in-the-Loop Interactive Machine Learning

published

Designing feedback interfaces that allow non-expert users to iteratively improve model behavior through natural interaction patterns.

HCIInteractive MLHuman-in-the-LoopNeurIPS 2025

Comprehensive Safety Evaluation Framework for Foundation Models

preprint

Building a systematic evaluation framework that tests foundation models across multiple safety dimensions including toxicity, bias, and adversarial robustness.

AI SafetyEvaluationFoundation ModelsBenchmark