Yuying Ge 葛玉莹Google Sholar Github Shenzhen, China |
|
I am currently a Senior Researcher at Tencent ARC Lab, working on multimodal foundation models.
Before that, I was a reseacher at Tencent AI Lab.
In Aug 2023, I got my Ph.D. degree from the Department of Computer Science, The University of Hong Kong,
under the supervision of Prof. Ping Luo.
I was also a visiting student at UCSD, working with Prof. Xiaolong Wang.
We are actively looking for self-motivated interns to work on related research topics. Please feel free to reach out if you are interested.
SEED-X: Multimodal Models with Unified Multi-granularity Comprehension and Generation,
Yuying Ge*, Sijie Zhao*, Jinguo Zhu*, Yixiao Ge, Kun Yi, Lin Song, Chen Li, Xiaohan Ding, Ying Shan, Arxiv, 2024 [paper|code|gradio demo] |
|
SEED-Bench-2: Benchmarking Multimodal Large Language Models,
Bohao Li*, Yuying Ge*, Yixiao Ge, Guangzhi Wang, Rui Wang, Ruimao Zhang, Ying Shan, CVPR, 2024 [paper|code|dataset|leaderboard] |
|
Making LLaMA SEE and Draw with SEED Tokenizer,
Yuying Ge*, Sijie Zhao*, Ziyun Zeng, Yixiao Ge, Chen Li, Xintao Wang, Ying Shan, ICLR, 2024 [paper|code|project|gradio demo] |
|
Planting a SEED of Vision in Large Language Model,
Yuying Ge*, Yixiao Ge*, Ziyun Zeng, Xintao Wang, Ying Shan, Technical Report, 2023 [paper|code] |
|
Policy Adaptation from Foundation Model Feedback,
Yuying Ge, Annabella Macaluso, Li Erran Li, Ping Luo, Xiaolong Wang IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2023 [paper|project] |
|
Learning Transferable Spatiotemporal Representations from Natural Script Knowledge,
Ziyun Zeng*, Yuying Ge*, Xihui Liu, Bin Chen, Ping Luo, Shu-Tao Xia, Yixiao Ge IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2023 [paper|code] |
|
MILES: Visual BERT Pre-training with Injected Language Semantics for Video-text Retrieval,
Yuying Ge, Yixiao Ge, Xihui Liu, Alex Jinpeng Wang, Jianping Wu, Ying Shan, Xiaohu Qie and Ping Luo European Conference on Computer Vision (ECCV) 2022 [paper|code] |
|
Bridging Video-text Retrieval with Multiple Choice Questions,
Yuying Ge, Yixiao Ge, Xihui Liu, Dian Li, Ying Shan, Xiaohu Qie and Ping Luo IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2022 (oral) [paper|code|project] |
|
MetaDance: Few-shot Dancing Video Retargeting via Temporal-aware Meta-learning,
Yuying Ge, Yibing Song, Ruimao Zhang and Ping Luo arXiv preprint, 2022 [paper|demo] |
|
MetaCloth: Learning Unseen Tasks of Dense Fashion Landmark Detection from a Few Samples,
Yuying Ge, Ruimao Zhang, and Ping Luo IEEE Transactions on Image Processing (TIP) 2021 [paper] |
|
Parser-Free Virtual Try-on via Distilling Appearance Flows,
Yuying Ge, Yibing Song, Ruimao Zhang, Chongjian Ge, Wei Liu, and Ping Luo IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2021 [paper|code] |
|
DeepFashion2: A Versatile Benchmark for Detection, Pose Estimation, Segmentation and Re-Identification of Clothing Images,
Yuying Ge, Ruimao Zhang, Xiaogang Wang, Xiaoou Tang, and Ping Luo IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2019 [paper|dataset] |
Ph.D., Department of Computer Science, The University of Hong Kong, 2019 - 2023
Bachelor, University of Electronic Science and Technology of China (UESTC) (ranking 1/525), 2014 - 2018
Senior Researcher in Tencent ARC Lab, 2024 - Present
Senior Researcher in Tencent AI Lab, 2023 - 2024
Intern in Tencent ARC Lab, 2021 - 2022
Intern in Tencent AI Lab, 2020 - 2021
Research Assistant in Multimedia Lab (MMLab), The Chinese University of Hong Kong, 2018 - 2019
Intern in SenseTime Research, 2017 - 2018
Reviewer for CVPR, ICLR, ICML, NeurIPS, ECCV, ICCV, TPAMI, TNNLS, TMM, TVCJ
Organizer of DeepFashion2 Challenge Clothes Landmark Detection
and Clothes Retrieval in 2019, 2020
Organizer of Third Workshop on Computer Vision for Fashion, Art and Design in CVPR, 2020
Organizer of Second Workshop on Computer Vision for Fashion, Art and Design in ICCV, 2019