Sitemap
A list of all the posts and pages found on the site. For you robots out there is an XML version available for digesting as well.
Pages
About me
About mePage not in menu
This is a page not in th emain menuPosts
Blog Post number 4
Published:
This is a sample blog post. Lorem ipsum I can’t remember the rest of lorem ipsum and don’t have an internet connection right now. Testing testing testing this blog post. Blog posts are cool.
Blog Post number 3
Published:
This is a sample blog post. Lorem ipsum I can’t remember the rest of lorem ipsum and don’t have an internet connection right now. Testing testing testing this blog post. Blog posts are cool.
Blog Post number 2
Published:
This is a sample blog post. Lorem ipsum I can’t remember the rest of lorem ipsum and don’t have an internet connection right now. Testing testing testing this blog post. Blog posts are cool.
Blog Post number 1
Published:
This is a sample blog post. Lorem ipsum I can’t remember the rest of lorem ipsum and don’t have an internet connection right now. Testing testing testing this blog post. Blog posts are cool.
publications
BrowserSparseAttention: Sparse Attention and KV Cache Design for Browser Agents
Jiaheng Lu, ......, Vincent Liu.Work in Progress, 2026
[Paper], [Code]
DynamicWebArena: Evaluating Browser Agents in Dynamic and Temporally Evolving Full-Stack Environments
Jiaheng Lu, Xian Wang, Shrishti Roy, Colin Zhao, Sidharth Sankhe, Chuyue Wang, Xuting Liu, Spyros Pavlatos, Linh Phan, Vincent Liu.In submission to NeurIPS, 2026
[Paper], [Code]
Beyond Domains: Reusing Web Skills via Transferable Interaction Patterns
Shiqi He, Yue Cui, Feijie Wu, Xinyu Ma, Jiaheng Lu, Yaliang Li, Bolin Ding, Mosharaf Chowdhury.In submission to NeurIPS, 2026
[Paper], [Code]
Mercury: QoS-Aware Tiered Memory System
Jiaheng Lu*, Yiwen Zhang*, Hasan Al Maruf, Minseo Park, Yunxuan Tang, Fan Lai, Mosharaf Chowdhury.In submission to NSDI, 2026
*Equal contribution
[Paper], [Code]
OpenInfra: A Co-simulation Framework for the Infrastructure Nexus
Jiaheng Lu, Yunming Xiao, Shmeelok Chakraborty, Silvery Fu, Yoon Sung Ji, Ang Chen, Mosharaf Chowdhury, Nalini Rao, Sylvia Ratnasamy, Xinyu Wang.HotInfra @ SOSP, 2024
[Paper], [Code], [Presentation]
MLM-WR: A Swarm Intelligence-based Cloud-Edge-Terminal Collaboration Data Collection Scheme in The Era of AIoT
Jiaheng Lu, Zhenze Qu, Anfeng Liu, Shaobo Zhang, Neal N. Xiong.IEEE Internet of Things Journal, 2023
[Paper], [Code]
Automatic Information Extraction for Financial Events by Integrating BiGRU and Attention Mechanism
Jiaheng Lu and Weirong Liu.Journal of Physics: Conference Series, International Conference on Computer, Big Data and Artificial Intelligence, 2022
[Paper], [Code]
publications_old
research
Cause-Effect Relationship Extraction from Text of Financial Events
Research Assistant, Advisor: Prof. Weirong Liu (Central South University)
Jun 2021 - Jan 2022
In this research, I Designed an NER model based on BiGRU and self-attention mechanism to extract cause-effect relationships from the text of financial events, achieving 78% accuracy.
Lightweight Real-time Portrait Segmentation System
Team Leader (Research Assistant), Advisor: Prof. Yixiong Liang (Central South University)
Feb 2022 - May 2022
In this research, I innovated a lightweight network structure based on BiSeNet and STDC, achieved a high frame rate (186 fps) on CPU and guaranteed the segmentation effect under high resolution up to 720p. Besides, I achieved an mIoU of 93.9% on Supervisely Person Dataset and deployed the model on the desktop (LibTorch, QT, C++) and Android (LibTorch, JNI, JAVA).
Reliable Cloud-Edge-Terminal Data Collection Framework
Research Assistant, Advisor: Prof. Anfeng Liu (Central South University), Prof. Neal Xiong (Sul Ross State University)
Feb 2022 - Aug 2023
Mobile Crowd Sensing (MCS) is a cloud-edge-terminal collaboration model that relies on edge terminal devices, or “workers,” to sense data and build applications for cloud-hosted platforms. However, to ensure high-quality application development, recruiting truthful workers in the edge network is crucial. With the emergence of Artificial Intelligence (AI), the Internet of Things (IoT) is entering a new era, known as Artificial Intelligence of Things (AIoT). This paper proposes an AI-enabled MCS system, which includes MLM-WR, a cloud-edge-terminal collaboration data collection scheme for AIoT. MLM-WR leverages swarm intelligence to match truthful workers with sensing tasks, enabling efficient and effective data collection for AIoT applications. The matching theory is applied from two perspectives: truthful workers discovery and sensing difference discovery. To identify truthful workers, we adjust their credibility based on the deviation of their sensing data with Ground Truth Data (GTD) obtained through collaboration with the Unmanned Aerial Vehicle (UAV). In the sensing difference discovery, we obtain workers’ sensing attribute reliability by calculating attribute data errors and incorporate absolute and relative sensing location preferences to determine workers’ sensing quality at different locations. Additionally, MLM-WR employs the Particle Swarm Optimization (PSO) algorithm to assign workers while considering sensing attribute and location reliability and recruitment cost, thus addressing the tradeoff between recruitment cost and data quality. The effectiveness of our approach is demonstrated through extensive evaluations, where MLM-WR outperformed the state-of-the-art approaches.
CXL-Enabled Retrieval-Augmented Generation System
Research Assistant, Advisor: Prof. Mosharaf Chowdhury
Mar 2024 - Present
Retrieval-Augmented Generation (RAG) is a popular technique for improving the reliability of Large Language Models (LLMs) by reducing hallucination. Implementing this effectively often requires repeatedly searching through large vector databases. At scale, these vector searches become a substantial computational bottleneck for RAG-enabled LLM inference. In this project we introduce Aether, a prototype system that enhances search efficiency across massive sharded datasets through scheduling optimizations and dynamic file management. Aether is a highly adaptable system that is designed to scale across multi-tier memory systems, such as CXL-enabled clusters. Through comprehensive evaluations across diverse workloads and under varying memory constraints, we demonstrate that Aether significantly outperforms baseline approaches. Leveraging asynchronous I/O, our scheduling optimizations achieve an average throughput improvement of 19.7% over their synchronous counterpart. Additionally, dynamic index file management using a novel LFU+ caching policy outperforms traditional LRU by 14.5% in serving throughput.
Co-simulation Framework for the Infrastructure Nexus
Research Assistant, Advisor: Prof. Ang Chen
Mar 2024 - Present
Critical infrastructures like datacenters, power grids, and water systems are interdependent, forming complex “infrastructure nexuses” that require co-optimization for efficiency, resilience, and sustainability. We present OpenInfra, a co-simulation framework designed to model these interdependencies by integrating domain-specific simulators for datacenters, power grids, and cooling systems but focusing on stitching them together for end-to-end experimentation. OpenInfra enables seamless integration of diverse simulators and flexible configuration of infrastructure interactions. Our evaluation demonstrates its ability to simulate large-scale infrastructure dynamics, including 7,392 servers over 100+ hours.
CXL-Enabled QoS-Aware Tiered Memory System
Research Assistant, Advisor: Prof. Mosharaf Chowdhury
Sep 2023 - Present
Tiered memory systems have widely been adopted to provide larger memory capacity in response to increasing memory demands from memory-intensive workloads. Although increased memory capacity allows more applications to be deployed, existing solutions for tiered memory systems are not built with Quality-of-Service (QoS) support. As a result, they often cannot meet service-level objectives (SLOs) when multiple applications share a tiered memory system. Specifically, applications suffer from local memory contention and memory bandwidth interference, two sources of performance unpredictability unique to tiered memory systems. Indeed, we observe application performance drops by 43% and 70% during severe memory contention and interference. This paper presents Mercury, a QoS-aware tiered memory system that provides predictable performance for coexisting memory-intensive applications, each with different SLOs. Mercury enables per-tier page reclamation to enforce application-level resource management. It leverages a novel admission control and real-time adaptation algorithm to maximize local memory utilization while mitigating memory interference. Evaluations with real-world applications show that Mercury can provide QoS guarantees among multiple applications sharing a tiered memory system with up to 53.4% improvement in performance.
talks
Talk 1 on Relevant Topic in Your Field
Published:
This is a description of your talk, which is a markdown files that can be all markdown-ified like any other post. Yay markdown!
Conference Proceeding talk 3 on Relevant Topic in Your Field
Published:
This is a description of your conference proceedings talk, note the different field in type. You can put anything in this field.
