Vision-based kinematic structure learning of arbitrary articulated rigid objects