I am a Senior Manager of Applied Science & AI at Oracle Corporation, Melbourne, Australia. My research and engineering work spans Natural Language Processing (NLP), Generative AI, Large Language Models (LLMs), Agentic AI, NL2SQL, NL2Code, Conversational AI, Analytics AI, and Healthcare AI. I lead multiple science teams building production-grade AI systems across Oracle's enterprise cloud portfolio — Oracle Analytics Cloud (OAC), Oracle Cloud Infrastructure (OCI), Oracle Digital Assistant (ODA), and Oracle Health & AI (OHAI).
I obtained my PhD in Engineering (NLP & Deep Learning) from the University of Melbourne in 2019, under the joint supervision of Prof. Trevor Cohn and Prof. Reza Haffari. Prior to Oracle, I was an AI Scientist at Speak.AI (a spin-off of Voicebox Technologies, subsequently acquired by Oracle), a research intern at NAVER LABS Europe (formerly Xerox Research Centre Europe), a visiting scholar at Carnegie Mellon University, Language Technologies Institute, and a Senior Research Engineer at HLT, I²R, A*STAR, Singapore. Before that, I studied at National University of Singapore (NUS) (MSc) and was a teaching & research assistant at the University of Science, Vietnam National University HCMC.
I hold 34 granted U.S. patents and more than 42 pending, covering NL2SQL, NL2Code, LLM training, NER, and agentic systems. My research has received 1,350+ citations (h-index 13, i10-index 17) on Google Scholar, with publications at ACL, EMNLP, NAACL, and COLING.
*** Our paper CLARITY: A Framework and Benchmark for Conversational Language Ambiguity and Unanswerability in Interactive NL2SQL Systems has been accepted at ACL 2026 (Industry Track).
*** Two papers accepted at NAACL 2025: Distill-C: Enhanced NL2SQL via Distilled Customization with LLMs (Industry Track) and Mastering the Craft of Data Synthesis for CodeLLMs (Long Paper).
*** Our paper SQLong: Enhanced NL2SQL for Longer Contexts with LLMs has been accepted at the Table Representation Learning Workshop at ACL 2025.
*** I was promoted to Senior Manager, Applied Science & AI at Oracle Corporation (Oct 2024), leading AI research programs across OAC, OCI, ODA, and OHAI.
*** Our Oracle IP portfolio has grown to 34 granted U.S. patents and more than 42 pending applications spanning NL2SQL, NL2Code, LLM training, NER, and dialog systems.