Yuanjian Liu

Hi there! I am a fifth-year Ph.D. student in Computer Science at the University of Chicago, interested in high-performance computing and autonomous laboratory research. I am a member of Globus Labs where I am co-advised by Ian Foster and Kyle Chard. I completed my Bachelors in Computer Science at the Zhejiang University and previously worked at Google and Alibaba.

RESEARCH

Autonomous Laboratory:

The science discovery can be slowed down by tedious assembly and tricky mannual operations. The autonomous laboratory project aims to replace the tasks traditionally performed by human researchers by automated systems and intelligent algorithms. I currently work with Ian Foster and Chibueze Amanchukwu to build an autonomous laboratory to manufacture coin-cell batteries. We propose the development of generative AI models to identify candiate electrolyte solvents with desired properties (high ionic conductivity, oxidative stability, and Coulombic efficiencies) and the deployment of self-driving labs for electrolyte synthesis and battery fabrication and testing.

SZ3 Lossy Compression:

Modern simulations (e.g. particle simulation, climate simulation) can produce huge amount of data every day. Lossy compression can significantly reduce the data size while preserving important information for analysis. I work with Sheng Di in the compression project. We explore lossy compression on scientific datasets, especially the datasets consisting of floating-point numbers. The data files are usually planar (e.g. CESM dataset 1800x3600) or cubic (e.g. Nyx dataset 512x512x512). Some extremely large single file can be over 900 GB (e.g. Turbulent Channel Flow 10240x7680x1536). Other datasets may contain thousands of smaller files. The goal of this project is to provide a friendly program for users to compress, transfer and store these huge datasets.

PUBLICATIONS

Ocelot: An Interactive, Efficient Distributed Compression-As-a-Service Platform With Optimized Data Compression Techniques [ /2025/05/20 ]

Yuanjian Liu, Sheng Di, Jiajun Huang, Zhaorui Zhang, Kyle Chard, Ian Foster

TPDS 2025

TLDR | URL | Code | BibTex | PDF

Hybrid Lossy Compression Methods Can Confidently Optimize Wide Network Transfer of Complex Datasets [ /2025/05/07 ]

Yuanjian Liu

Dissertation

Optimizing Scientific Data Transfer on Globus with Error-Bounded Lossy Compression [ /2023/10/11 ]

Yuanjian Liu, Sheng Di, Kyle Chard, Ian Foster, Franck Cappello

ICDCS 2023

FastqZip: An Improved Reference-Based Genome Sequence Lossy Compression Framework [ /2023/09/10 ]

Yuanjian Liu, Huihao Luo, Zhijun Han, Yao Hu, Yehui Yang, Kyle Chard, Sheng Di, Ian Foster, Jiesheng Wu

preprint

TLDR | PDF

Optimizing Error-Bounded Lossy Compression for Scientific Data With Diverse Constraints [ /2022/07/28 ]

Yuanjian Liu, Sheng Di, Kai Zhao, Sian Jin, Cheng Wang, Kyle Chard, Dingwen Tao, Ian Foster, Franck Cappello

TPDS 2022

Optimizing Multi-Range based Error-Bounded Lossy Compression for Scientific Datasets [ /2022/01/24 ]

Yuanjian Liu, Sheng Di, Kai Zhao, Sian Jin, Cheng Wang, Kyle Chard, Dingwen Tao, Ian Foster, Franck Cappello

HiPC 2021

TLDR | URL | BibTex | PDF

Understanding Effectiveness of Multi-Error-Bounded Lossy Compression for Preserving Ranges of Interest in Scientific Analysis [ /2021/12/28 ]

Yuanjian Liu, Sheng Di, Kai Zhao, Sian Jin, Cheng Wang, Kyle Chard, Dingwen Tao, Ian Foster, Franck Cappello

DRBSD-7 2021

TLDR | URL | BibTex | PDF

RESEARCH

PUBLICATIONS

Trending Tags