Humanoid Robotics · Physical AI · Machine Learning

Patrick
Franke Oviedo

Graduate student in robotics and machine learning at TUM, interested in the foundations of machine learning for physical AI.

Patrick Franke Oviedo
01

About

I'm a graduate student at the Technical University of Munich in robotics and machine learning, currently focused on the foundations behind physical AI and humanoid robots — vision-language-action models and world models.

I hold a Bachelor in Engineering Science from TUM, during which I had the privilege of several research and exchange stays abroad — at the Politecnico di Milano, the University of Zurich, and CentraleSupélec in Paris. Today I'm a robotics engineer with RoboTUM and help run partnerships at the European Student Robotics Association (ESRA).

I like work that spans the full arc — from the research idea to a system that actually runs on hardware. Past projects have ranged from on-board ML for a CubeSat to voice-driven motion planning on a Unitree G1 humanoid.

I'm always glad to talk about robotics, physical AI, and ML research. The fastest way to reach me is email.

Focus areas

Physical AI Humanoid Robotics Vision-Language-Action World Models Deep Learning Computer Vision

Toolbox

Python PyTorch ROS C++

Languages

German English Spanish French
02

Selected Projects

assets/projects/video-to-3d.png

P/01

Humanoid Video-to-3D

A feed-forward pipeline that turns short indoor phone videos into geometrically coherent 3D scenes with open-vocabulary semantic labels — no COLMAP or per-scene optimization. Built for a humanoid perception & spatial-AI challenge, it supports natural-language 3D queries in milliseconds.

PyTorchVGGTSAM 2.1OpenCLIPOpen3DRerun
assets/projects/g1.png

P/02

Unitree G1 Humanoid — RoboTUM

Team work at RoboTUM on the Unitree G1 humanoid robot, where I led the voice pipeline and motion planning — connecting spoken commands to planned, executable motion on the robot. Developed collaboratively with the team.

Humanoid RoboticsMotion PlanningVoice PipelinePythonROS
assets/projects/sleep.png

P/03

Wearable Sleep-Position Classifier

A non-invasive wearable that classifies sleep position from two accelerometers using machine learning — 100% accuracy across the four main position groups (supine, prone, left, right). End-to-end hardware: PSoC microcontroller, I²C sensors, and Bluetooth streaming.

PythonRandom ForestAccelerometersEmbeddedBluetooth