![]() | Zhou Ren (任洲) |
Zhou is a technical innovator, actively seeking solutions to build and enhance real-world products. He is a founding member of Wormpex AI Research, which is the AI branch of BianLiFeng (便利蜂), a fast growing advanced convenience store chain in China backed by a global capital. At Wormpex AI research (directed by Dr. Gang Hua), we build state-of-the-art AI technologies to facilitate new retail logistics from storefronts, warehouses to manufacture.
Zhou is an active researcher. His research interests include Computer Vision, Multimedia, Natural Language Processing and Machine Learning. He has worked on problems of hand pose estimation, multi-modal joint understanding, object detection, action detection, image captioning, reinforcement learning, and adversarial machine learning, etc. He received his Ph.D. degree in Computer Science from University of California, Los Angeles (UCLA) in 2016, and a M.Eng degree from Nanyang Technological University (NTU) in 2012. Before that, he received his Bachelor’s degree with highest honor from Huazhong University of Science and Technology (HUST) in 2010.
Selected honors: (1) Runner-up winner in NIPS 2017 Adversarial Attack and Defense Competition (among 107 teams); (2). been nominated to the “CVPR 2017 Best Student Paper Award”; (3). winner of the “IEEE Trans. on Multimedia 2016 Best Paper Award”; (4). developed the first part-based hand gesture recognition system using Kinect sensor with Nanyang Technological University and Microsoft Research Redmond (Demo1, Demo2).
[08/19] One paper accepted by IEEE Trans on Pattern Analysis and Machine Intelligence (TPAMI). Congratulations to my mentored student (Sheng Liu) and collaborators!
[07/19] Two papers accepted by ICCV 2019. One paper accepted by BMVC 2019 as an Oral! Congratulations to my mentored students (Tan Yu, Tianlong Chen) and collaborators!
[03/19] Three papers accepted by CVPR 2019, two as Orals, and one as Poster. Congratulations to my mentored students (Liuhao Ge, Jonghwan Mun, Cihang Xie) and collaborators!
[12/18] I have recently joined Wormpex AI Research, to assist our executives set up and grow a research group in visual recognition & analysis for future retail business at BianLiFeng (便利蜂).
[07/18] Been invited to present in a panel discussion at ICME 2018, together with Dr. Tao Mei, Dr. Wenjun Zeng, Prof. Xilin Chen, Prof. Mohan Kankanhalli, and Prof. Junsong Yuan.
![]() |
[07/18] One paper accepted by ACM Multimedia 2018 as an Oral! Two papers accepted by ECCV 2018. One paper accepted by ICLR 2018. And a book chapter published in Springer on the topic of Adversarial Attacks and Defenses.
[11/17] We have won the 2nd place in NIPS’17 Adversarial Defense Challenge among 107 teams; and were invited to present in NIPS’17 Competition Track (for details, please check our work).
![]() |
[08/17] Our work on hand gesture recognition is ranked the most cited paper in IEEE TMM since 2011. Our TWO works on hand gesture recognition are ranked the most cited paper && the 2nd most cited paper in ACM MM 2011.
[07/17] Our paper on Reinforcement Learning-based Image Captioning has been nominated to the “Best Student Paper Award” at CVPR 2017!
[06/16] Our paper “Robust Part-based Hand Gesture Recognition Using Kinect Sensor” has been selected to receive the 2016 IEEE Trans. on Multimedia Prize Paper Award (TMM Best Paper Award)!
![]() |
My research interests lie in the fields of Computer Vision, Multimedia, Natural Language Processing, and Machine Learning. I have worked on hand pose estimation, object detection, multi-modal joint understanding, image captioning, video captioning, shape understanding, reinforcement learning, and adversarial machine learning, etc.
My current research focuses include: 1) human/hand pose estimation, 2) object detection, 3) Human Re-ID, 4)multi-modal joint understanding.
(Note: “^” indicates the co-author is the student I mentored during whose internship or during an university collaboration)
1. on Hand Gesture Recognition and Pose Estimation
![]() |
3D Hand Shape and Pose Estimation from a Single RGB Image
Liuhao Ge^, Zhou Ren, Yuncheng Li, Zehao Xue, Yingying Wang, Jianfei Cai, Junsong Yuan
In IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2019 (Oral).
[PDF][supplementary][video]
End-to-End 3D Hand Pose Estimation from Stereo Cameras
Yuncheng Li, Zehao Xue, Yingying Wang, Liuhao Ge, Zhou Ren, Jonathan Rodriguez
In British Machine Vision Conference (BMVC), 2019 (Oral).
[PDF]
Point-to-Point Regression PointNet for 3D Hand Pose Estimation
Liuhao Ge^, Zhou Ren, and Junsong Yuan
In European Conference on Computer Vision (ECCV), 2018.
[PDF]
Robust Part-based Hand Gesture Recognition Using Kinect Sensor
Zhou Ren, Junsong Yuan, Jingjing Meng, and Zhengyou Zhang
In IEEE Trans. on Multimedia (TMM), 15(5), 1110-1120, 2013.
* Winner of 2016 IEEE Trans. on Multimedia Prize Paper Award (Best Paper Award)*
[PDF][Bibtex][NTU-Microsoft-Kinect HandGesture Dataset]
Robust Hand Gesture Recognition based on Finger-Earth Mover’s Distance with a Commodity Depth Camera
Zhou Ren, Junsong Yuan, and Zhengyou Zhang
In ACM Multimedia (ACM MM), Scottsdale, Arizona, USA, Nov. 28-Dec. 1, 2011.
*The most cited paper in ACM MM 2011*
[PDF][Bibtex][NTU-Microsoft-Kinect HandGesture Dataset][Demo]
Robust Hand Gesture Recognition with Kinect Sensor
Zhou Ren, Jingjing Meng, Junsong Yuan, and Zhengyou Zhang
In ACM Multimedia (ACM MM), Scottsdale, Arizona, USA, Nov. 28-Dec. 1, 2011.
*The 2nd most cited paper in ACM MM 2011*
[PDF][Bibtex][Demo]
2. on Multi-modal Joint Representation Learning
![]() |
SibNet: Sibling Convolutional Encoder for Video Captioning
Sheng Liu^, Zhou Ren, and Junsong Yuan; In IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2019.
[PDF]
Streamlined Dense Video Captioning
Jonghwan Mun^, Linjie Yang, Zhou Ren, Ning Xu, and Bohyung Han
In IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2019 (Oral).
[PDF][supplementary]
SibNet: Sibling Convolutional Encoder for Video Captioning
Sheng Liu^, Zhou Ren, and Junsong Yuan
In ACM Multimedia, 2018 (Oral)
[PDF]
Multiple Instance Visual-Semantic Embedding
Zhou Ren, Hailin Jin, Zhe Lin, Chen Fang, and Alan Yuille
In British Machine Vision Conference (BMVC), 2017 (Oral)
[PDF][Supplementary][Bibtex][Video]
Deep Reinforcement Learning-based Image Captioning with Embedding Reward
Zhou Ren, Xiaoyu Wang, Ning Zhang, Xutao Lv, and Li-Jia Li
In IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2017 (Oral)
*Best Student Paper Award Nomination*
[PDF][Bibtex][Talk slides][Poster][Video]
3. on Object Detection and Representation Learning
![]() |
Deep Regionlets for Object Detection
Hongyu Xu^, Xutao Lv, Xiaoyu Wang, Zhou Ren, and Rama Chellappa
In European Conference on Computer Vision (ECCV), 2018.
[PDF]
4. on Action Detection
![]() |
Temporal Structure Mining for Weakly Supervised Action Detection
Tan Yu^, Zhou Ren, Yuncheng Li, Enxu Yan, Ning Xu, and Junsong Yuan
In International Conference on Computer Vision (ICCV), 2019.
[PDF]
5. on Person Re-Identification
![]() |
ABD-Net: Attentive but Diverse Person Re-Identification
Tianlong Chen^, Shaojin Ding, Jingyi Xie, Ye Yuan, Wuyang Chen, Yang Yang, Zhou Ren, and Zhangyang Wang
In International Conference on Computer Vision (ICCV), 2019.
[PDF]
6. on Adversarial Machine Learning
![]() |
Improving Transferability of Adversarial Examples with Input Diversity
Cihang Xie^, Yuyin Zhou, Song Bai, Zhishuai Zhang, Jianyu Wang, Zhou Ren, and Alan Yuille
In IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2019.
[PDF]
Mitigating Adversarial Effects Through Randomization
Cihang Xie^, Jianyu Wang, Zhishuai Zhang, Zhou Ren, and Alan Yuille
In International Conference on Learning Representations (ICLR), 2018
* Runner-up Winner in NIPS 2017 Adversarial Attack and Defense Competition (among 107 teams)*
[PDF]
Adversarial Attacks and Defences Competition
Alexey Kurakin, et. al.
In a book chapter from the NIPS 2017 Competition Book, Springer 2018
[PDF]
7. on Shape Representation and Shape Coding
![]() |
8. on Medical Image Processing
![]() |
Automated Pericardial Fat Quantification from Coronary Magnetic Resonance Angiography: A Feasibility Study
Xiaowei Ding, Jianing Pang, Zhou Ren, Mariana Diaz-Zamudio, Chenfangfu Jiang, Zhaoyang Fan, Daniel Berman, Debiao Li, Demetri Terzopoulos, Piotr Slomka, and Damini Dey
In Journal of Medical Imaging, 2016.
[PDF][Bibtex]
System and method for robust hand gesture recognition using commodity depth sensor, Singapore provisional patent application, filed in 10/2011
Co-invented with Junsong Yuan, Jingjing Meng
Modeling semantic concepts in an embedding space as distributions, US patent application, filed in 01/2016
Co-invented with Hailin Jin, Zhe Lin, and Chen Fang
Embedding-driven image captioning using deep reinforcement learning and lookahead beam search, US patent application, filed in 11/2016
Co-invented with Xiaoyu Wang, Ning Zhang, Xutao Lv, and Jia Li
Generating Data in a Messaging System for a Machine Learning Model, US patent application, filed in 12/2017
Co-invented with Zehao Xue
Query Matching to Media Collections in a Messaging System, US patent application, filed in 01/2018
Co-invented with Roger Luo, Sushobhan Nayak, Xinran He, and Christophe Van Gysel
Device Location based on Machine Learning Classifications, US patent, granted in 05/2018
Co-invented with Ebony Charlton, Sumant Hanumante, Dhritiman Sagar
Embedding space for images with multiple text labels, US patent, granted in 07/2018
Co-invented with Hailin Jin, Zhe Lin, and Chen Fang
Lluis Castrejon (2017 Summer), PhD student at MILA, University of Montreal
Zhe Li (2017 Summer), PhD student at University of Iowa
Hongyu Xu (2017 Summer), PhD student at University of Maryland, College Park
Cihang Xie (2017 Fall - 2018 Spring), PhD student at Johns Hopkins University
Sheng Liu (2017 Fall - present), PhD student at The State University of New York at Buffalo
Liuhao Ge (2018 Spring - 2019 Spring), PhD student at Nanyang Technological University
Tan Yu (2018 Summer), PhD student at Nanyang Technological University
Shibi He (2018 Summer), PhD student at University of Illinois Urbana-Champaign
Jonghwan Mun (2018 Summer), PhD student at Pohang University of Science and Technology
Tianlong Chen (2019 Spring), PhD student at Texas A&M University
Ye Yuan (2019 Spring), PhD student at Texas A&M University
Wuyang Chen (2019 Spring), PhD student at Texas A&M University
Shiyi Lan (2019 Summer), PhD student at University of Maryland, College Park
Teaching Associate of CS32 Introduction to Computer Science II (data structures and algorithms) with Prof. Carey Nachenberg
[discussion materials]
Teaching Assistant of CS31 Introduction to Computer Science I (C++ programming) with Prof. David Smallberg
[discussion materials]
Associate Editor of The Visual Computer Journal (TVCJ).
Program Committee of CVPR 2017, FG 2018 2019, IJCAI 2018 2019, ECAI 2018, ACM Multimedia 2018 2019, AAAI 2019, etc.
Reviewer of FG 2016 2017 2018, WACV 2017, CVPR 2016 2017 2018 2019, ICCV 2017 2019, ECCV 2018, IJCAI 2018 2019, etc.
Reviewer of IEEE TPAMI; IEEE TIP; IEEE TCSVT; IEEE TMM; IEEE THMS; CVIU; TVCJ; Machine Vision and Application; Journal of Computer Science and Technology; etc.
联系客服