Meng Song

I completed my PhD at UC San Diego, where I was advised by Prof. Manmohan Chandraker.

Prior to that, I obtained my Master's degree at the Robotics Institute, Carnegie Mellon University, working with Prof. Abhinav Gupta and Dr. Daniel Huber.

Email / CV / Scholar / Twitter / Github / LinkedIn

Research

My research is motivated by the goal of developing a mathematical construct of the intelligent agent from first principles. My recent work has primarily focused on answering the question "What is a good representation of states and goals in decision-making problems?" I explored this problem under three different learning paradigms: reinforcement learning, imitation learning, and unsupervised learning.

	Good Actions Succeed, Bad Actions Generalize: A Case Study on Why RL Generalizes Better Meng Song Out-of-Distribution Generalization in Robotics Workshop at The Robotics: Science and Systems (RSS), 2025 (Spotlight Presentation) open review / code PPO and BC generalize differently in visual navigation: BC imitates successful trajectories, while PPO combinatorially stitches together past experiences, including failures, to solve new tasks and achieve stronger generalization.
	Towards Unsupervised Goal Discovery: Learning Plannable Representations with Probabilistic World Modeling Meng Song PhD Thesis, 2024 escholarship Learning through interaction is a foundational principle in both human and animal learning. In a broad sense, intelligent agents can be formulated as goal-directed systems interacting with an uncertain environment. Despite the generality of this definition, a key challenge in computationally grounding it lies in how to effectively set up and represent goals and purposes. This dissertation explores this question through the lens of various machine learning paradigms.
	Probabilistic World Modeling with Asymmetric Distance Measure Meng Song Geometry-grounded Representation Learning and Generative Modeling Workshop at International Conference on Machine Learning (ICML), 2024 (Oral Presentation) arXiv A novel probabilistic world model trained with contrastive learning. The learned latent space enables subgoal discovery, asymmetric transition modeling, and supports highly efficient planning without requiring any inference-time search.
	A Minimalist Prompt for Zero-Shot Policy Learning Meng Song, Xuezhi Wang, Tanay Biradar, Yao Qin, Manmohan Chandraker Task Specification Workshop at The Robotics: Science and Systems (RSS), 2024 arXiv / code A novel prompting method that enables interpretable zero-shot generalization in unseen robotics tasks without requiring demonstrations and surpasses few-shot baselines.
	RLPrompt: Optimizing Discrete Text Prompts with Reinforcement Learning Mingkai Deng, Jianyu Wang, Cheng-Ping Hsieh, Yihan Wang, Han Guo, Tianmin Shu, Meng Song, Eric P. Xing, Zhiting Hu Conference on Empirical Methods in Natural Language Processing (EMNLP), 2022 arXiv / blog RL-based prompt optimization approach outperforms a wide range of finetuning and prompting baselines on text classification and style transfer tasks.
	Learning to Rearrange with Physics-Inspired Risk Awareness Meng Song, Yuhan Liu, Zhengqin Li, Manmohan Chandraker Conference on Risk Aware Decision Making Workshop at The Robotics: Science and Systems (RSS), 2022 arXiv / code / project page A PPO agent learns physical concepts such as mass and friction by actively interacting with the environment and mastering everyday skills, instead of passively observing physical processes.
	OpenRooms: An End-to-End Open Framework for Photorealistic Indoor Scene Datasets Zhengqin Li, Ting-Wei Yu, Shen Sang, Sarah Wang, Meng Song, Yuhan Liu, Yu-Ying Yeh, Rui Zhu, Nitesh Gundavarapu, Jia Shi, Sai Bi, Zexiang Xu, Hong-Xing Yu, Kalyan Sunkavalli, Miloš Hašan, Ravi Ramamoorthi, Manmohan Chandraker Conference on Computer Vision and Pattern Recognition (CVPR), 2021 (Oral Presentation) arXiv / dataset / project page A novel framework for creating large-scale photorealistic datasets of indoor scenes, enabling broad applications in inverse rendering, scene understanding, and robotics.
	S4G: Amodal Single-view Single-Shot SE(3) Grasp Detection in Cluttered Scenes Yuzhe Qin, Rui Chen, Hao Zhu, Meng Song, Jing Xu, Hao Su Conference on Robot Learning (CoRL), 2019 (Spotlight Presentation) arXiv A novel 6-DoF grasp detection method using a single-shot grasp proposal network trained on automatically generated synthetic data, significantly outperforming state-of-the-art methods.
	Automatic Recovery of Networks of Thin Structures Meng Song, Daniel Huber International Conference on 3D Vision (3DV), 2015 (Oral Presentation) paper A novel geometric approach that automatically segments and parses a complex network of thin structures from low-quality 3D point clouds.
	Natural Feature based Localization in Forested Environments Meng Song, Fengchi Sun, Karl Iagnemma International Conference on Intelligent Robots and Systems (IROS), 2012 (Oral Presentation) paper A novel geometric approach for extracting and modeling tree trunk landmarks from noisy 3D point clouds of cluttered forested environments.
	Natural Landmark Extraction in Cluttered Forested Environments Meng Song, Fengchi Sun, Karl Iagnemma International Conference on Robotics and Automation (ICRA), 2012 (Oral Presentation) paper A novel geometric approach for extracting and modeling tree trunk landmarks from noisy 3D point clouds of cluttered forested environments.

Thoughts and Talks

Probabilistic World Modeling with Asymmetric Distance Measure, GRaM Workshop @ ICML, 2024 [slides] [poster]
Learning to Rearrange with Physics-Inspired Risk Awareness, RADM Workshop @ RSS, 2022 [slides]
Physics-Aware Robot Learning, Thesis Proposal, 2021 [slides]
Finding Structure in Deep Reinforcement Learning, Research Exam, 2019 [slides]
One-shot Logo Detection in the Wild, WiML Workshop @ NeurIPS, 2018 [poster]
Bridging Computational Complexity and Machine Learning, CSE 200: Computability and Complexity, 2018 [paper]
The Syntactic Mechanism Behind Image Captioning, Research Project with Xinlei Chen, 2015 [paper]

Teaching

CSE 251-U: Unsupervised Learning UCSD, Teaching Assistant, Winter 2024
DSC 250: Advanced Data Mining UCSD, Teaching Assistant, Fall 2023
CSE 291-F: Unsupervised Learning UCSD, Teaching Assistant, Spring 2023
CSE 203-B: Convex Optimization UCSD, Teaching Assistant, Winter 2023
CSE 152-A: Introduction to Computer Vision UCSD, Teaching Assistant, Fall 2022
CSE 252-D: Advanced Computer Vision UCSD, Teaching Assistant, Spring 2022
CSE 203-B: Convex Optimization UCSD, Teaching Assistant, Winter 2022
DSC 190: Machine Learning with Few Labels UCSD, Teaching Assistant, Fall 2021
CSE 8A: Introduction to Programming 1 UCSD, Teaching Assistant, Spring 2021
CSE 291-C: Probabilistic Approaches to Unsupervised Learning UCSD, Teaching Assistant, Fall 2020
CSE 291-F: Machine Learning Meets Geometry Data UCSD, Teaching Assistant, Winter 2020
CSE 291-I: Machine Learning on 3D Data UCSD, Teaching Assistant, Winter 2019
CSE 152-A: Introduction to Computer Vision UCSD, Teaching Assistant, Fall 2018

Academic Service

Reviewer: ICLR 2025, RSS Workshop 2024, NeurIPS Workshop 2018
Research Mentoring: Yuhan Liu (UCSD MS → Rutgers University PhD)
Student Committee Member: UCSD CSE Faculty Hiring Committee, 2017

Template stolen from Jon Barron.