Portfolio item number 1
Short description of portfolio item number 1
Short description of portfolio item number 1
Short description of portfolio item number 2 
Published in Gut (IF: 24.5), 2024
This study characterizes the role of MED12 loss in activating endogenous retroelements, which sensitizes pancreatic ductal adenocarcinoma (PDAC) to immune checkpoint blockade. Using integrative multi-omics analysis, we demonstrate that MED12 deficiency rewires the tumor immune microenvironment, offering a potential biomarker for immunotherapy response in pancreatic cancer. Published as co-first author in Gut (2024).
Recommended citation: Tang Y*, Tang S*, Yang W, et al. MED12 loss activates endogenous retroelements to sensitise immunotherapy in pancreatic cancer. Gut. Published Online First: 31 August 2024. doi:10.1136/gutjnl-2024-332350 (*co-first author)
Download Paper
Published in bioRxiv, 2025
ARCADE is a controllable multi-objective codon design framework that leverages pretrained genomic language models and extends activation engineering — originally developed for controllable text generation — to steer continuous-valued biological metrics. By deriving semantic steering vectors in the model’s activation space, ARCADE directly controls properties such as Codon Adaptation Index (CAI), Minimum Free Energy (MFE), and GC content, enabling programmable biological sequence design for applications including mRNA vaccines and gene therapies.
Recommended citation: Li J, Lai HS, Liang L, Du S, Tang S, Kingsford C. ARCADE: Controllable Codon Design from Foundation Models via Activation Engineering. bioRxiv. 2025. doi:10.1101/2025.08.19.668819
Download Paper
Published in bioRxiv, 2026
CodonRL is a reinforcement learning framework for multi-objective mRNA codon optimization that learns a structural prior from efficient folding feedback and demonstration-guided replay. The framework uses LinearFold for fast intermediate reward computation, milestone-based rewards to address delayed feedback in long-range optimization, and enables user-controlled trade-offs between translation efficiency, RNA stability, and compositional properties. On a benchmark of 55 human proteins, CodonRL outperforms GEMORNA by 9.5% higher CAI, 25.4 kcal/mol more favorable MFE, and 3.4% lower uridine content.
Recommended citation: Du S, Kaynar G, Li J, You Z, Tang S, Kingsford C. CodonRL: Multi-Objective Codon Sequence Optimization Using Demonstration-Guided Reinforcement Learning. bioRxiv. 2026. doi:10.64898/2026.02.12.705465
Download Paper
Published in arXiv preprint arXiv:2604.12277, 2026
This paper addresses shortcut learning in pretrained language models, where superficial token-level patterns fail to generalize under distribution shifts. “Shortcut Guardrail” is a deployment-time framework that requires no access to original training data or advance knowledge of shortcut types. It identifies problematic shortcuts through gradient-based attribution and employs a lightweight LoRA-based module trained with Masked Contrastive Learning to maintain consistent representations regardless of individual tokens. The approach improves performance across sentiment classification, toxicity detection, and natural language inference tasks under distribution shift.
Recommended citation: Li J, Tang S, Kaynar G, Du S, Kingsford C. Models Know Their Shortcuts: Deployment-Time Shortcut Mitigation. arXiv preprint arXiv:2604.12277. 2026.
Download Paper
Published:
This is a description of your talk, which is a markdown files that can be all markdown-ified like any other post. Yay markdown!
Published:
This is a description of your conference proceedings talk, note the different field in type. You can put anything in this field.
Undergraduate course, University 1, Department, 2014
This is a description of a teaching experience. You can use markdown like any other post.
Workshop, University 1, Department, 2015
This is a description of a teaching experience. You can use markdown like any other post.