Papers.
- FL2026 Under review
Formalizing Latent Thoughts: Four Axioms of Thought Representation in LLMs
We formalize four axioms for LLM thought representations and show structural failures in candidate representations.
- CC2026 TOSEM ACM Transactions on Software Engineering and Methodology Under review
Context-Augmented Code Generation Using Programming Knowledge Graphs
Grounds code generation in a programming knowledge graph so the model retrieves precise structural context (call sites, types, idioms) before generating, instead of relying on lossy in-context recall.
- PF2026 FSE ACM International Conference on the Foundations of Software Engineering
Panther: Faster and Cheaper Computations with Randomized Numerical Linear Algebra
Randomized numerical linear algebra primitives that drop into existing PyTorch layers – 5x speedup over the dense baseline and 75% parameter reduction at negligible accuracy cost on BERT.
- AO2025 EMSE Empirical Software Engineering Under review
Analysis of AdvFusion: Adapter-based Multilingual Learning for Code LLMs
An empirical analysis of AdvFusion, an adapter-based multilingual training scheme for code LLMs, with a focus on cross-language transfer and pareto trade-offs.
- AA2023 ICECCE International Conference on Electrical, Communication and Computer Engineering
AGS: Arabic GPT Summarization Corpus
An Arabic abstractive-summarisation dataset built via prompt engineering – the first Arabic abstractive corpus labelled by an LLM. Backed our 1st-place AIC-1 (ICMTC) submission.
3D model: "3D Origami crane" by JuanG3D · CC BY 4.0