About Me

Here is Zhuoyu WU(Reagan, 吴卓宇).

I am a PhD student in the CyPhi AI Lab at the Monash University, supervised by Prof. Raphael Phan, and funded by Graduate Research Excellent Scholarship.

Prior to Monash, I have worked on Efficient AI algorithm design as well as the SW/HW co-design under the guidance of Assoc. Prof. Dr. Zheng Wang and Dr. Wenqi Fang at Shenzhen Institutes of Advanced Technology, Chinese Academy of Sciences(SIAT, CAS). I had a wonderful time there, collaborating with a group of talented individuals to deliver several key projects.

If you are interested in any aspect of me, I am always open to discussions and academic collaborations. Feel free to reach out to me at — zhuoyu.wu [at] monash.edu

Research Interests

Efficient AI
HW/SW co-design
LLM
AI for Science

News and Updates

A compact research log: ⚙️ efficient AI / hardware, 🖥️ GUI agents, and 🎓 personal milestones.

2026

⚙️ Apr 2026: DepthPolyp was accepted by ICPR 2026. A pseudo-depth guided lightweight segmentation model for real-time colonoscopy, achieving 3.57M parameters, 0.86 GMACs, and over 180 FPS on mobile devices. [Paper] [Code] [HF Model] [HF Demo]
⚙️ Jan 2026: EndoCaver was accepted by ICASSP 2026. This work improves segmentation under fog, blur, and glare by connecting image restoration with the visual attention used for segmentation. [Preprint]

2025

⚙️ Nov 2025: FLICKER was accepted by DATE 2026. We reveal fine-grained sparsity in 3D Gaussian Splatting and design a contribution-aware reconfigurable accelerator for real-time 3DGS. [Preprint]
⚙️ Oct 2025: TMU was accepted by IEEE TVLSI. This work builds near-memory tensor manipulation capability for data-move-intensive, compute-light operators. [Paper]
⚙️ Sep 2025: RT-Focuser was accepted by ICTA 2025 as an Oral paper. A lightweight real-time deblurring model for edge-side vision, exceeding 140 FPS on mobile devices. [Paper] [Code]
🎓 Aug 2025: I began my PhD journey at Monash University.
🖥️ Jul 2025: GUI-Narrator was accepted by ACM MM 2025. This work studies how to detect and caption computer GUI actions for multimodal agents. [Paper]
⚙️ Apr 2025: AttenPU was accepted by GLSVLSI 2025. An area-efficient attention processor with reconfigurable FP8 precision and bidirectional dataflow. [Paper]
⚙️ Jan 2025: PR-KAN was accepted by ISCAS 2025 as an Oral paper. To the best of our knowledge, this is the first FPGA accelerator dedicated to Kolmogorov-Arnold Networks (KANs). [Paper]

Earlier

⚙️ Sep 2024: Fourier-LSTM was accepted by ICTA 2024. We embed Fourier transforms into LSTM to improve DNA sequencing accuracy on portable devices. [Paper]
🖥️ May 2024: GUI Action Narrator was publicly released. This work uses VLMs and pre-grounding to support GUI action understanding across platforms and applications. [Paper] [Code]
⚙️ Apr 2024: Harmonizing U-Nets was accepted by Computers in Biology and Medicine (Q1, IF=7.7). A cascaded design with adaptive attention fusion for efficient low-quality OCT fluid segmentation. [Paper]
⚙️ Apr 2023: COMPACT was presented at DATE 2023, focusing on precision-adjustable nonlinear activation acceleration. [Paper]

Academic Services

Journal:
- Biomedical Signal Processing and Control (Q1) – Reviewer
- MEDICAL & BIOLOGICAL ENGINEERING & COMPUTING (Q2) – Reviewer
- Frontiers in Digital Health (Q1) – Reviewer
Conference:
- ACMMM Dataset Track 2026 – Programme Committee Member
- ACMMM 2026 – Reviewer
- MICCAI 2026 – Reviewer
- WCCI 2026 – Reviewer
- ICASSP 2026 – Reviewer
- BMVC 2025 – Reviewer
- IJCNN 2025 – Reviewer