I am a Ph.D. student at MIT CSAIL, advised by Prof. William T. Freeman. I’m excited about understanding and designing computational modules for visual and auditory perception problems, especially ones that human can solve easily. I received my B.S. in Electronic Engineering from Tsinghua University, advised by Prof. Yebin Liu

Research

Structure and Motion for Casual Videos
Zhoutong Zhang, Forrester Cole, Zhengqi Li, Michael Rubinstein, Noah Snavely, William T. Freeman
[In Submission]

Unsupervised Semantic Segmentation By Distilling Feature Correspondences
Mark Hamilton, Zhoutong Zhang, Bharath Hariharan, Noah Snavely, William T. Freeman
ICLR 2022
Paper

Consistent Depth of Moving Objects in Video
Zhoutong Zhang, Forrester Cole, Richard Tucker, William T. Freeman, Tali Dekel
SIGGRAPH 2021
Project Page | Paper | Video | Code

Differentiable Surface Rendering via Non-Differentiable Sampling
Forrester Cole, Kyle Genova, Avneesh Sud, Daniel Vlasic, Zhoutong Zhang
ICCV 2021
Paper

Editing Conditional Radiance Fields
Steven Liu, Xiuming Zhang, Zhoutong Zhang, Richard Zhang, Jun-Yan Zhu, Bryan Russell,
ICCV 2021
Paper | Webpage | Code | Video | Demo

Deep Audio Priors Emerge From Harmonic Convolutional Networks
Zhoutong Zhang, Yunyun Wang, Chuang Gan, Jiajun Wu, Joshua B. Tenenbaum, Antonio Torralba, William T. Freeman,
ICLR 2020
Project Page | Slides | Paper

A Computational Model for Combinatorial Generalization in Physical Auditory Perception
Yunyun Wang, Chuang Gan, Max H. Siegel, Zhoutong Zhang, Jiajun Wu, Joshua B. Tenenbaum
CCN 2019
Paper

Learning to Reconstruct Shapes from Unseen Classes
Xiuming Zhang*, Zhoutong Zhang*, Chengkai Zhang, Joshua B. Tenenbaum, William T. Freeman, Jiajun Wu
NeurIPS 2018 (Oral)
Paper | Code | Project | Talk | BibTeX

Visual Object Networks: Image Generation with Disentangled 3D Representation
Jun-Yan Zhu, Zhoutong Zhang, Chengkai Zhang, Jiajun Wu, Antonio Torralba, Joshua B. Tenenbaum, William T. Freeman
NeurIPS 2018
Paper | Code | Project | BibTeX

Seeing Tree Structure from Vibration
Tianfan Xue*, Jiajun Wu*, Zhoutong Zhang, Joshua B. Tenenbaum, William T. Freeman
ECCV 2018
Paper | Project

Learning Shape Priors for Single-View 3D Completion and Reconstruction
Jiajun Wu*, Chengkai Zhang*, Xiuming Zhang, Zhoutong Zhang, William T. Freeman, Joshua B. Tenenbaum
ECCV 2018
Paper | Project | BibTeX

Pix3D: Dataset and Methods for Single-Image 3D Shape Modeling
Xingyuan Sun*, Jiajun Wu*, Xiuming Zhang, Zhoutong Zhang, Chengkai Zhang*, William T. Freeman, Joshua B. Tenenbaum
CVPR 2018
Paper | Code | Project | BibTeX

Shape and Material from Sound
Zhoutong Zhang, Qiujia Li, Zhengjia Huang, Jiajun Wu, Joshua B. Tenenbaum, William T. Freeman
NeurIPS 2017
Paper | Code | Project | BibTeX

Generative Modeling of Audible Shapes for Object Perception
Zhoutong Zhang*, Jiajun Wu*, Qiujia Li, Zhengjia Huang, James Traer, Josh H. McDermott, Joshua B. Tenenbaum William T. Freeman
ICCV 2017
Paper | Code | Project | BibTeX

Light Field from Micro-Baseline Image Pair
Zhoutong Zhang, Yebin Liu, Qionghai Dai
CVPR 2015
Paper | Code | Video | Project | Supp | BibTeX