Yuanzhen Li I'm a Senior Staff Software Engineer at Google DeepMind, working on new kinds of LLMs. Previously, I was a TLM (Tech Lead Manager) at Google Research, where I supported a talented team and a few cross-team Computer Vision and Generative AI efforts, e.g., Generative Product Imagery, "Imagen" in Google Cloud Vertex AI, Muse, DreamBooth, DreamBooth3D, Generative Uncrop, etc. Prior to Google, I spent 10 years in the startup world. I made iPhone computational-photography apps, and one of them (TrueHDR) was quite popular in the 2009-2012 era. I also founded a startup leveraging deep learning for image search; it was acquired by VSCO in 2015 and I then worked at VSCO as a Director of Engineering. Prior to that, I completed my PhD in 2009 at MIT with Prof. Edward H. Adelson. At MIT, I was a member of the Computer Science and Artificial Intelligence Laboratory (CSAIL) and the Perceptual Science Laboratory. |
||
Some Projects | ||
|
DreamBooth3D: Subject-Driven Text-to-3D Generation Amit Raj, Srinivas Kaza, Ben Poole, Michael Niemeyer, Nataniel Ruiz, Ben Mildenhall, Shiran Zada, Kfir Aberman, Michael Rubinstein, Jonathan T. Barron, Yuanzhen Li, Varun Jampani March 2023, Paper, Webpage | |
|
Muse: Text-To-Image Generation via Masked Generative Transformers Huiwen Chang, Han Zhang, Jarred Barber, AJ Maschinot, Jose Lezama, Lu Jiang, Ming-Hsuan Yang, Kevin Murphy, William T. Freeman, Michael Rubinstein, Yuanzhen Li, Dilip Krishnan January 2023, Paper, Webpage | |
|
DreamBooth: Fine Tuning Text-to-Image Diffusion Models for Subject-Driven Generation Nataniel Ruiz, Yuanzhen Li, Varun Jampani, Yael Pritch, Michael Rubinstein, Kfir Aberman August 2022, Paper, Webpage | |
|
Simplified Transfer Learning for Chest Radiography Models Using Less Data Andrew B. Sellergren, Christina Chen, Zaid Nabulsi, Yuanzhen Li, Aaron Maschinot, Aaron Sarna, Jenny Huang, Charles Lau, Sreenivasa Raju Kalidindi, Mozziyar Etemadi, Florencia Garcia-Vicente, David Melnick, Yun Liu, Krish Eswaran, Daniel Tse, Neeral Beladia, Dilip Krishnan, Shravya Shetty Radiology. 2022, Paper, Blog, Tweet | |
|
SAMURAI: Shape And Material from Unconstrained Real-world Arbitrary Image collections Mark Boss, Andreas Engelhardt, Abhishek Kar, Yuanzhen Li, Deqing Sun, Jonathan T. Barron, Hendrik P. A. Lensch, Varun Jampani 2022, Paper, Video | |
|
LASSIE: Learning Articulated Shape from Sparse Image Ensemble via 3D Part Discovery Chun-Han Yao, Wei-Chih Hung, Yuanzhen Li, Michael Rubinstein, Ming-Hsuan Yang, Varun Jampani 2022, Paper, Video, Supplemental | |
|
Ads Image Uncropping, Blog | |
|
Automated Designs for Responsive Display Ads, Blog, Patent | |
|
ScribbleBoost: Adding Classification to Edge-Aware Interpolation of Local Image and Video Adjustments Paper |
|
Image Mapping Using Local and Global Statistics |
||
Image Statistics for Surface Reflectance Perception |
||
|
Measuring Visual Clutter Ruth Rosenholtz, Yuanzhen Li, Lisa Nakano. Journal of Vision, 7(2):17, 1-22, 2007 |
|
Compressing
and Companding High Dynamic Range Images with Subband Architectures. Download the doll picture (the hdr version of the image on the left). |
||
|
||
Feature Congestion:
A Measure of Display Clutter. Download matlab code for generating color and contrast "clutter maps". |
||
Multiple-cue Illumination Estimation in
Textured Scenes. (pdf)
|
||
Diffuse-Specular Separation and Depth Recovery
from Image Sequences. (pdf) Stephen Lin, Yuanzhen Li, Sing Bing Kang, Xin Tong, Heung-Yeung Shum. ECCV 2002 |
||
Multibaseline Stereo in the Presence
of Specular Reflections. (pdf) |
||
Single-Image Reflectance Estimation for
Relighting by Iterative Soft Grouping. (pdf) |
||
|