Yuying Ge 葛玉莹

yyge13@gmail.com

Google Sholar

Github

Shenzhen, China

Biography

I am currently a Senior Researcher at Tencent ARC Lab, working on multimodal foundation models. Before that, I was a reseacher at Tencent AI Lab. In Aug 2023, I got my Ph.D. degree from the Department of Computer Science, The University of Hong Kong, under the supervision of Prof. Ping Luo. I was also a visiting student at UCSD, working with Prof. Xiaolong Wang. We are actively looking for self-motivated interns to work on related research topics. Please feel free to reach out if you are interested.

News

Publications

SEED-X: Multimodal Models with Unified Multi-granularity Comprehension and Generation,
Yuying Ge*, Sijie Zhao*, Jinguo Zhu*, Yixiao Ge, Kun Yi, Lin Song, Chen Li, Xiaohan Ding, Ying Shan,
Arxiv, 2024
[paper|code|gradio demo]
SEED-Bench-2: Benchmarking Multimodal Large Language Models,
Bohao Li*, Yuying Ge*, Yixiao Ge, Guangzhi Wang, Rui Wang, Ruimao Zhang, Ying Shan,
CVPR, 2024
[paper|code|dataset|leaderboard]
Making LLaMA SEE and Draw with SEED Tokenizer,
Yuying Ge*, Sijie Zhao*, Ziyun Zeng, Yixiao Ge, Chen Li, Xintao Wang, Ying Shan,
ICLR, 2024
[paper|code|project|gradio demo]
Planting a SEED of Vision in Large Language Model,
Yuying Ge*, Yixiao Ge*, Ziyun Zeng, Xintao Wang, Ying Shan,
Technical Report, 2023
[paper|code]
Policy Adaptation from Foundation Model Feedback,
Yuying Ge, Annabella Macaluso, Li Erran Li, Ping Luo, Xiaolong Wang
IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2023
[paper|project]
Learning Transferable Spatiotemporal Representations from Natural Script Knowledge,
Ziyun Zeng*, Yuying Ge*, Xihui Liu, Bin Chen, Ping Luo, Shu-Tao Xia, Yixiao Ge
IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2023
[paper|code]
MILES: Visual BERT Pre-training with Injected Language Semantics for Video-text Retrieval,
Yuying Ge, Yixiao Ge, Xihui Liu, Alex Jinpeng Wang, Jianping Wu, Ying Shan, Xiaohu Qie and Ping Luo
European Conference on Computer Vision (ECCV) 2022
[paper|code]
Bridging Video-text Retrieval with Multiple Choice Questions,
Yuying Ge, Yixiao Ge, Xihui Liu, Dian Li, Ying Shan, Xiaohu Qie and Ping Luo
IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2022 (oral)
[paper|code|project]
MetaDance: Few-shot Dancing Video Retargeting via Temporal-aware Meta-learning,
Yuying Ge, Yibing Song, Ruimao Zhang and Ping Luo
arXiv preprint, 2022
[paper|demo]
MetaCloth: Learning Unseen Tasks of Dense Fashion Landmark Detection from a Few Samples,
Yuying Ge, Ruimao Zhang, and Ping Luo
IEEE Transactions on Image Processing (TIP) 2021
[paper]
Parser-Free Virtual Try-on via Distilling Appearance Flows,
Yuying Ge, Yibing Song, Ruimao Zhang, Chongjian Ge, Wei Liu, and Ping Luo
IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2021
[paper|code]
DeepFashion2: A Versatile Benchmark for Detection, Pose Estimation, Segmentation and Re-Identification of Clothing Images,
Yuying Ge, Ruimao Zhang, Xiaogang Wang, Xiaoou Tang, and Ping Luo
IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2019
[paper|dataset]

Education

Experiences

Academic Activities


© Yuying Ge | Last updated: Dec. 2021