Using Motion Cues to Supervise Single-Frame Body Pose and Shape Estimation in Low Data Regimes
CoRR(2024)
摘要
When enough annotated training data is available, supervised deep-learning
algorithms excel at estimating human body pose and shape using a single camera.
The effects of too little such data being available can be mitigated by using
other information sources, such as databases of body shapes, to learn priors.
Unfortunately, such sources are not always available either. We show that, in
such cases, easy-to-obtain unannotated videos can be used instead to provide
the required supervisory signals. Given a trained model using too little
annotated data, we compute poses in consecutive frames along with the optical
flow between them. We then enforce consistency between the image optical flow
and the one that can be inferred from the change in pose from one frame to the
next. This provides enough additional supervision to effectively refine the
network weights and to perform on par with methods trained using far more
annotated data.
更多查看译文
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要