Method for applying reinforcement learning to motion planning and control of under-actuated underwater vehicle in unknown non-uniform sea flow | IEEE Conference Publication | IEEE Xplore