*: Equal Contributions; ✝: My Advisee
-
MAmmoTH2: Scaling Instructions from the Web
Xiang Yue, Tuney Zheng, Ge Zhang, Wenhu Chen
NeurIPS 2024
-
Grokked Transformers are Implicit Reasoners: A Mechanistic Journey to the Edge of Generalization
Boshi Wang, Xiang Yue, Yu Su, Huan Sun
NeurIPS 2024
-
MixEval: Deriving Wisdom of the Crowd from LLM Benchmark Mixtures
Jinjie Ni, Fuzhao Xue, Xiang Yue, Yuntian Deng, Mahir Shah, Kabir Jain, Graham Neubig, Yang You
NeurIPS 2024
-
VisualWebBench: How Far Have Multimodal LLMs Evolved in Web Page Understanding and Grounding?
Junpeng Liu*✝, Yifan Song*, Bill Yuchen Lin, Wai Lam, Graham Neubig, Yuanzhi Li, Xiang Yue
COLM 2024
-
OpenCodeInterpreter: Integrating Code Generation with Execution and Refinement
Tianyu Zheng*✝, Ge Zhang*, Tianhao Shen*, Xueling Liu*, Bill Yuchen Lin, Jie Fu, Wenhu Chen, Xiang Yue
ACL 2024, Findings
-
🔥 MMMU: A Massive Multi-discipline Multimodal Understanding and Reasoning Benchmark for Expert AGI
Xiang Yue, Yuansheng Ni, Kai Zhang, Tianyu Zheng, Ruoqi Liu, Ge Zhang, Samuel Stevens, Dongfu Jiang, Weiming Ren, Yuxuan Sun, Cong Wei, Botao Yu, Ruibin Yuan, Renliang Sun, Ming Yin, Boyuan Zheng, Zhenzhu Yang, Yibo Liu, Wenhao Huang, Huan Sun, Yu Su, Wenhu Chen
CVPR 2024 (🏆 Award Candidate Paper, Oral: 24/11,532=0.2%)
-
MAmmoTH: Building Math Generalist Models through Hybrid Instruction Tuning
Xiang Yue*, Xingwei Qu, Ge Zhang, Yao Fu, Wenhao Huang, Huan Sun, Yu Su, Wenhu Chen*
ICLR 2024 (Spotlight)
-
Automatic Evaluation of Attribution by Large Language Models
Xiang Yue, Boshi Wang, Kai Zhang, Ziru Chen, Yu Su, Huan Sun
EMNLP 2023, Findings
-
Synthetic Text Generation with Differential Privacy: A Simple and Practical Recipe
Xiang Yue, Huseyin A. Inan, Xuechen Li, Girish Kumar, Julia McAnallen, Hoda Shajari, Huan Sun, David Levitan, Robert Sim
ACL 2023 (🏆 Best Paper Honorable Mention)
-
Synthetic Question Value Estimation for Domain Adaptation of Question Answering
Xiang Yue, Ziyu Yao, Huan Sun
ACL 2022
-
C-MORE: Pretraining to Answer Open-Domain Questions by Consulting Millions of References
Xiang Yue, Xiaoman Pan, Wenlin Yao, Dian Yu, Dong Yu, Jianshu Chen
ACL 2022
-
CliniQG4QA: Generating Diverse Questions for Domain Adaptation of Clinical Question Answering
Xiang Yue*, Frederick Zhang*, Ziyu Yao, Simon Lin, Huan Sun
IEEE Internatinal Conference on Bioinformatics and Biomedicine 2021 (BIBM 2021)
🏆 Best Paper (1/727)
-
Differential Privacy for Text Analytics via Natural Text Sanitization
Xiang Yue*, Minxin Du*, Tianhao Wang, Yaliang Li, Huan Sun and Sherman S. M. Chow
ACL 2021, Findings
-
Clinical Reading Comprehension: A Thorough Analysis of the emrQA Dataset
Xiang Yue, Bernal Jimenez Gutierrez and Huan Sun
ACL 2020