Posts by Collection

people

Meihao Fan

Ph.D. student at Renmin University of China
Avatar

projects

publications

Cost-Effective In-Context Learning for Entity Resolution: A Design Space Exploration

Published in IEEE 40th International Conference on Data Engineering (ICDE), 2024

This paper is about LLM for Entity Resolution.

Recommended citation: Fan, Meihao, Xiaoyue Han, Ju Fan, Chengliang Chai, Nan Tang, Guoliang Li, and Xiaoyong Du. "Cost-effective in-context learning for entity resolution: A design space exploration." In 2024 IEEE 40th International Conference on Data Engineering (ICDE), pp. 3696-3709. IEEE, 2024. https://ieeexplore.ieee.org/abstract/document/10597751

DeepPrep: An LLM-Powered Agentic System for Autonomous Data Preparation

Published in VLDB 2026 (Under Review), 2026

This paper proposes DeepPrep, an LLM-powered agentic system for autonomous data preparation.

Recommended citation: Meihao Fan, Ju Fan, Yuxin Zhang, Shaolei Zhang, Xiaoyong Du, Jie Song, Peng Li, Fuxin Jiang, Tieying Zhang, Jianjun Chen. "DeepPrep: An LLM-Powered Agentic System for Autonomous Data Preparation." VLDB 2026 (Under Review).

TACO: A Benchmark for Open-Domain Text-to-SQL with Ambiguous and Cross-Database Queries

Published in VLDB 2026 (Accepted), 2026

This paper proposes TACO, a benchmark for Open-Domain Text-to-SQL.

Recommended citation: Chao Deng, Ju Fan, Yuyu Luo, Qinliang Xue, Meihao Fan, Yuxin Zhang, Min Zhang, Xiaofeng Jia, Jing Zhang, Xiaoyong Du. "TACO: A Benchmark for Open-Domain Text-to-SQL with Ambiguous and Cross-Database Queries." VLDB 2026 (Accepted).

DeepAnalyze: Agentic Large Language Models for Autonomous Data Science

Published in ICML 2026 (Under Review), 2026

This paper explores agentic large language models for autonomous data science.

Recommended citation: Anonymous Authors (incl. **Meihao Fan**). "DeepAnalyze: Agentic Large Language Models for Autonomous Data Science." ICML 2026 (Under Review).

CODA-BENCH: Can Code Agents Handle Data-Intensive Tasks?

Published in ICML 2026 (Under Review), 2026

This paper introduces CODA-BENCH to evaluate code agents on data-intensive tasks.

Recommended citation: Anonymous Authors (incl. **Meihao Fan**). "CODA-BENCH: Can Code Agents Handle Data-Intensive Tasks?" ICML 2026 (Under Review).

software

talks

teaching