Research Projects

Ongoing

Reasoning-Level Fairness in LLMsUnder submission

Tech Python PyTorch OpenR Inference-time scaling Fairness-aware RL Benchmark

2025

MASTOPIA: Transparency in LLM-Assisted Intelligence Analysis

Demo Code

Tech Python GPT-4 / GPT-3.5 RAG Multi-agent LLM Vector DB Prompt engineering Flask Zero-inflated Poisson regression Ridit analysis Prolific / Qualtrics human-subject design

Bayesian Learning for Uncertainty-Aware Hallucination Mitigation

Tech Python PyTorch Bayesian methods LLM evaluation

2024

Towards Fair Language Modeling via Parameter-Efficient Methods by Machine Feedback

Tech Python PyTorch Hugging Face LoRA RL

MEGAWATT: MAST for Evaluating Generative AI in Worker–Automation Team Tasks

Tech Python GPT-4 API RAG Human-subject study design

Automated Evaluation of Machine-generated Summaries using RLHF

Tech Python PyTorch RLHF LLM evaluation

2023

PADTHAI-MM: Designing Trustworthy, Human-Centered AI Systems Using the MAST Methodology

Code

Tech Python Decision-support system design User study & evaluation

2022

READIT: Reporting Assistant for Defense and Intelligence Tasks

Code

Tech Python Transformers Node.js Google Cloud Platform

Facewise: AI-based Face ID Verification System

Code

Tech Python PyTorch CNN ResNet

Bridging the Gap: Online and Offline COVID-19 Data

Paper Code

Tech Python Topic modeling Matrix factorization Social computing

2021

Interpreting Text Classifiers with Counterfactual Explanation

Tech Python PyTorch Explainable AI

2017

Biomedical Entity Relation Extraction

Tech Python TensorFlow Tree-RNN Distant supervision