I am Hao Sun (孙浩), an innovative researcher specializing in Artificial Intelligence (AI), with expertise in multimodal learning, large language models (LLMs), vision-language-action (VLA) models, embodied AI, reinforcement learning, and affective computing. Passionate about advancing Artificial General Intelligence (AGI) and transformative technologies to push the boundaries of human knowledge and civilization. My work has been published in top-tier venues including ACL, ACM Multimedia, Information Fusion, IEEE Transactions on Affective Computing, and Pattern Recognition, accumulating 500+ citations alongside several patents.
Education & Experience:
For more details on my research and publications, please visit my Google Scholar page or ORCID page.
Here are some of my recently published academic papers, covering topics such as multimodal learning, large language models, vision-language-action (VLA) agents, embodied AI, and affective computing. With 500+ citations to date, my full publication list (20+) is available at my Google Scholar page.
Papers as first or corresponding author:
Preprints in submission (first author):
Papers as co-first, second or third author:
Recent Patents:
Software Copyrights:
For 30+ independent projects, please visit my Github Page. From 2020 to 2021, I was responsible for publishing TensorFlow tutorials on IMOOC.
Selected invited talks that share my research vision on embodied intelligence, multimodal learning, and AGI with international scientific communities.
I have been actively involved in several exciting research projects, contributing to advancements in areas such as mutimodal learning and real-time monitoring. Here are some of my recent participated projects.
I actively contribute to the research community as a guest editor and reviewer for leading journals and conferences in AI, multimodal learning, and affective computing.
The following honors and awards recognize my academic excellence, research achievements, and leadership contributions throughout my graduate and professional journey.
This section outlines my core academic and engineering skills, including research design, algorithm development, large-scale model training, and practical system implementation, with a strong focus on AI, large language models (LLMs), multimodal learning, and embodied AI.