Controllable generation in masked diffusion language models
Investigating controllable generation in masked diffusion language models (MDLMs) through inference-time unmasking, and developing masking-aware activation-steering methods for partially observed sequences. Benchmarking token-selection strategies (margin, entropy, saliency, and concept-directed) for controllability and text quality, measured with perplexity, MAUVE, and classifier-based metrics on the LLaDA codebase.