Zhen Wang / 王震

Hi! I'm a postdoctoral researcher working with Prof. Eric Xing from CMU/MBZUAI and Prof. Zhiting Hu from UCSD. My research interests lie in natural language processing and machine learning. I received my PhD from The Ohio State University, advised by Prof. Huan Sun.

In summer 2022, I interned at MIT-IBM Watson AI Lab working with Rameswar, Yoon, Leonid, and Rogerio on efficient adaptation of large language models. In summer 2021, I was a research intern at the NLP group in Microsoft Research, Redmond, working with Nebojsa and Kolya, studying coherence boosting and prompting calibration on GPT-3. In summer 2020, I was a research intern at the Data Science team in NEC Labs America working with Bo Zong, exploring commonsense knowledge representation and reasoning.

Email  /  CV (Feb 2023)  /  GitHub  /  Twitter  /  Google Scholar

profile photo

Research Overview

My research is rooted in human-centered AI with the aim to infuse machine learning models, especially foundation models, with a human-like understanding of the world and knowledge to enable AI as both a reliable assistant and an insightful collaborator. I believe future AI systems need to not only augment human capabilities but also resonate with human values, understandings, accessibility, and proactive problem-solving. My ultimate goal is not to build AI that merely replicates or replaces human abilities, but to develop systems that enrich human experiences, amplify human potential, honor human values, and actively collaborate with humans in addressing real-world challenges.

  1. Interpreting and steering ML models towards human values: Transparency is key in human-centered AI. I develop methodologies to unlock the black box to enhance our understanding of their behavior, and ensure they align with human values via more robust control and prompting techniques. [ACL 2020] [ACL 2022] [ACL 2023] [New preprint]
  2. Adapting and transferring knowledge for dynamic human needs: Human-centered AI demands AI systems that can swiftly adapt and learn in response to the changing needs and circumstances of its human users. I develop efficient methods to transfer knowledge between AI systems and adapt them across diverse tasks and domains for greater accessibility. [ICLR 2023] [NAACL SUKI 2022] [EACL 2023]
  3. Augmenting models to proactively solve real-world problems for humans: An active problem-solving drive is a distinctive trait of human intelligence. I aim to elevate AI systems from passive respondents into proactive problem solvers by interacting with the physical world and novel domains. [New preprint] [New preprint]

Research Opportunities: I consistently seek out highly motivated students, particularly from underrepresented groups, to join me in various research projects both during the school year and throughout the summer. If you are eager to enhance your research abilities and be a part of this exciting opportunity, kindly email me expressing your interest.

News

Publications

Preprint

ThinkSum

Reasoning with Language Model is Planning with World Model


Shibo Hao, Yi Gu, Haodi Ma, Joshua Jiahua Hong, Zhen Wang, Daisy Zhe Wang, Zhiting Hu
PDF

ToolkenGPT: Augmenting Frozen Language Models with Massive Tools via Tool Embeddings


Shibo Hao, Tianyang Liu, Zhen Wang, Zhiting Hu
PDF

GPT Is Becoming a Turing Machine: Here Are Some Ways to Program It


Ana Jojic, Zhen Wang, Nebojsa Jojic
PDF

2023

thinksum_acl2023

ThinkSum: Probabilistic Reasoning Over Sets Using Large Language Models


Batu Ozturkler, Nikolay Malkin, Zhen Wang, Nebojsa Jojic
[ACL 2023] The 61st Annual Meeting of the Association for Computational Linguistics (Main)
PDF / Code / Slides / Poster

We propose a two-stage probabilistic inference paradigm, ThinkSum, to improve LLMs' abilities of reasoning over multiple objects in two steps, Think (e.g., retrieval of associations) and Sum (e.g., aggregation of results), which beats chain-of-thought prompting in hard BIG-bench tasks.

mpt_iclr2023

Multitask Prompt Tuning Enables Parameter-Efficient Transfer Learning


Zhen Wang, Rameswar Panda, Leonid Karlinsky, Rogerio Feris, Huan Sun, Yoon Kim
[ICLR 2023] The Eleventh International Conference on Learning Representations
PDF / Code / Slides / Poster

We propose Multitask Prompt Tuning (MPT) to exploit the rich cross-task knowledge for more efficient and generalizable transfer learning. MPT learns a single trasnferrable soft prompt through the use of a novel combination of prompt decomposition and prompt distillation.

meet_eacl2023

Entity Tracking via Effective Use of Multi-Task Learning Models


Janvijay Singh, Fan Bai, Zhen Wang
[EACL 2023] The 17th Conference of the European Chapter of the Association for Computational Linguistics (Main)
PDF / Code / Slides / Poster

How to transfer multi-task knowledge from pre-training to niche downstream tasks, such as entity tracking on the procedural text? We show that you can reach STOA performance by simply fine-tuning T5 but with specialized QA prompt and task-specific decoding.

2022

dissertation

Toward Knowledge-Centric NLP: Acquisition, Representation, Transfer, and Reasoning


Zhen Wang
The Ohio State University, Ph.D. Dissertation, 2022
PDF
cb_acl2022

Coherence Boosting: When Your Pretrained Language Model is Not Paying Enough Attention


Nikolay Malkin, Zhen Wang, Nebojsa Jojic
[ACL 2022] The 60th Annual Meeting of the Association for Computational Linguistics
PDF / Code / Slides / Poster (Long Paper, Oral Presentation)

We demonstrate that large language models have insufficiently learned the effect of distant words on next-token prediction. We present Coherence Boosting, an inference procedure that increases a LM’s focus on a long context, which gets huge improvement on NLG and NLU tasks.

SimultQA

Knowledge Transfer between Structured and Unstructured Sources for Complex Question Answering


Lingbo Mo*, Zhen Wang*, Jie Zhao, Huan Sun
[SUKI@NAACL 2022] NAACL 2022 Structured and Unstructured Knowledge Integration
PDF / Code / Slides / Poster *Equal contribution

We study knowledge transfer for multi-hop reasoning processes between structured (Knowledge Base) and unstructred (text corpus) knowledge. We design SimultQA unifying KBQA and TextQA systems and leverage it to study how the reasoning is transferred between two knowledge sources.

2021

SimultQA

Bootstrapping a User-Centered Task-Oriented Dialogue System


Shijie Chen, Ziru Chen, Xiang Deng, Ashley Lewis, Lingbo Mo, Samuel Stevens, Zhen Wang, Xiang Yue, Tianshu Zhang, Yu Su, Huan Sun
[Alexa Prize TaskBot Challenge] 1st Proceedings of Alexa Prize TaskBot (Alexa Prize 2021)
PDF / Third-place honor in the TaskBot Finals!

We build TacoBot, a task-oriented dialogue system for the inaugural Alexa Prize TaskBot Challenge to assist users in multi-step cooking and home improvement tasks. We propose several data augmentation methods, such as GPT-3 simulation to bootstrap neural dialogue systems into new domains and make them more robust to noise user initiatives.

conpi_wsdm_2021

Modeling Context Pair Interaction for Pairwise Tasks on Graphs


Zhen Wang, Bo Zong, Huan Sun
[WSDM 2021] The 14th ACM International Conference on Web Search and Data Mining
PDF / Code / Slides / Poster (Long Paper, Online Presentation)

We propose to explicitly model context interactions for pairwise prediction tasks on graphs, which consists of two perspectives, node-centric and pair-centric. We also propose to pre-train pair embeddings to facilitate the pair-centric model.

2020

x-clinrela_acl__2020

Rationalizing Medical Relation Prediction from Corpus-level Statistics


Zhen Wang, Jennifer Lee, Simon Lin, Huan Sun
[ACL 2020] The 58th Annual Meeting of the Association for Computational Linguistics
PDF / Code / Slides / Poster / Video (Long Paper, Online Presentation)

We propose a self-interpretable framework to rationalize the neural relation prediction based on corpus-level statistics. This framework is inspired by human cognitive theory about recall and recognition, which provides structured knowledge triplets as rationales.

graph_bioinformatics

Graph Embedding on Biomedical Networks: Methods, Applications, and Evaluations


Xiang Yue, Zhen Wang, Jingong Huang, Srinivasan Parthasarathy, Soheil Moosavinasab, Yungui Huang, Simon Lin, Wen Zhang, Ping Zhang, Huan Sun
[Bioinformatics] Volume 36, Issue 4, 15 February 2020, Pages 1241-1251
PDF / Code / Slides / Poster

We benchmark 11 representative graph embedding methods on 5 important biomedical tasks. We verify the effectivenes of recent graph embedding methods and provide general guidelines for their usage.

2019

surfcon_kdd_2019

SurfCon: Synonym Discovery on Privacy-Aware Clinical Data


Zhen Wang, Xiang Yue, Soheil Moosavinasab, Yungui Huang, Simon Lin, Huan Sun
[KDD 2019] The 25th ACM SIGKDD Conference on Knowledge Discovery and Data Mining
PDF / Code / Slides / Poster (Research Track, Long Paper, Oral Presentation)

We propose to discover structured knowledge, synonyms from privacy-aware text corpus and present a novel framework to leverage both surface form and context information to discover out-of-distribution synonyms.

Before 2019

code_kdd_2018_dlday

A Comprehensive Study of StaQC for Deep Code Summarization


Jayavardhan Reddy Peddamail, Ziyu Yao, Zhen Wang, Huan Sun
[KDD 2018 Deep Learning Day] The 24th ACM SIGKDD Conference on Knowledge Discovery and Data Mining
PDF / Code / Slides / Poster (SPOTLIGHT)

We examine three popular datasets mined from Stack Overflow on the code summarization task and show that StaQC (Stack Overflow Question-Code pairs) helps achieve substantially better results.

hessian_mmm_2015

Hessian Regularized Sparse Coding for Human Action Recognition


Weifeng Liu, Zhen Wang, Dapeng Tao, Jun Yu
[MMM 2015] The 21th International Conference on Multimedia Modeling
PDF / Code / Slides / Poster / Bibtex

We propose Hessian regularized sparse coding (HessianSC) for action recognition, which can well preserve the local geometry and steer the sparse coding varying linearly along the manifold of data distribution.

Honors and Awards

  • Third-Place Honor, Inaugural Alexa Prize TaskBot Challenge, 2022
  • Graduate Research Award, CSE, OSU, 2022
  • Graduate Student Research Poster Award (Top 5), CSE, OSU, 2021
  • SIGIR Student Travel Grant, 2021
  • Rising Stars in Data Science, Center for Data and Computing (CDAC), University of Chicago, January 2021
  • SIGKDD Student Travel Award, 2019
  • China Scholarship Council (CSC) Scholarship (fully funded visiting program in Polytech Nice Sophia), Nice, France, 2015
  • National Scholarship, China, 2014
  • Soong Ching Ling Foundation (SCLF) Scholarship, China, 2013
  • National Scholarship for Encouragement, China, 2012

Services

    Area Chair or Senior PC Member:
    • NLPCC 2023
    Program Committee Member:
    • ACL ARR (Oct'21, Nov'21, Jan'22, Apr'22, Sep'22, Oct'22, Dec'22, Feb'23)
    • NAACL (2021, 2022 SUKI 2022 Workshop)
    • EMNLP (2021, 2022)
    • ACL (2021, 2023)
    • ICML 2023
    • NeurIPS 2023
    • KDD 2023
    • AAAI 2023
    • NLPCC (2020, 2021, 2022)
    External Reviewer:
    • KDD (2019, 2020), ACL 2018, ICDM 2018


Source code from Leonid Keselman, design and inspiration from Jon Barron and Dongkuan.