Advancing Artificial General Intelligence and Pushing the Boundaries of Human Civilization

Dr.Hao Sun (孙浩)

I am Hao Sun (孙浩), an innovative researcher specializing in Artificial Intelligence (AI), with expertise in multimodal learning, large-scale models, reinforcement learning, sentiment analysis. Passionate about advancing Artificial General Intelligence (AGI) and transformative technologies to push the boundaries of human knowledge and civilization.

Education & Experience:

  • 06.2025 - Now: Ritsumeikan University (Osaka, Japan). Senior Researcher
    • Invited by Yen-Wei Chen, Fellow of the Engineering Academy of Japan
    • Oversaw the research part of LLMs and reinforcement learning in the host laboratory
    • Led a research initiative focused on AGI utilizing LLMs, reinforcement leanring, and bionics
  • 08.2023 - 08.2024: Ritsumeikan University (Osaka&Otsu, Japan). Visiting Scholar
    • Invited by Yen-Wei Chen, Fellow of the Engineering Academy of Japan
    • Funded by the Zhejiang University PhD Academic Star Program (awarded to the top 100 graduate students)
    • Led a project on developing a unified multimodal and multitask framework with LLMs
    • Led a project on enabling LLMs with multimodal processing capabilities through parameter-efficient fine-tuning
    • Published findings at IEEE Transactions on Affective Computing, Pattern Recognition, etc
  • 09.2020 - 06.2025: Zhejiang University (Hangzhou, China). Ph.D in Computer Science and Technology
    • Awarded the Outstanding Graduate Honor, selected as one of the top 10% of graduates for academic achievement
  • 09.2016 - 06.2020: Harbin Institute of Technology (Weihai, China). B.E in Software Engineering
    • Awarded the Outstanding Graduate Honor, granted to the top 8% of students

For more details on my research and publications, please visit my Google Scholar page or ORCID page.

Recent Publications & Patents

Here are some of my recently published academic papers, covering topics such as multimodal learning, large-scale models, and sentiment analysis. For a complete list of my publications (20+) and further research details, please visit my Google Scholar page.

Papers as first or corresponding author:

  • One Framework to Rule Them All: Unifying Multimodal Tasks with LLM Neural-Tuning. Pattern Recognition, 2025. (IF: 8.5)
  • Multimodal Sentiment Analysis with Mutual Information-based Disentangled Representation Learning. IEEE Transactions on Affective Computing, 2025. (IF: 13.9)
  • Modality-invariant temporal representation learning for multimodal sentiment classification. Information Fusion, 2023. (IF: 18.1)
  • Tensorformer: A Tensor-Based Multimodal Transformer for Multimodal Sentiment Analysis and Depression Detection. IEEE Transactions on Affective Computing, 2023. (IF: 13.9)
  • Multi-Modal Adaptive Fusion Transformer Network for the Estimation of Depression Level. Sensors, 2021. (IF: 3.7)
  • Cubemlp: An MLP-Based Model for Multimodal Sentiment Analysis and Depression Estimation, ACM Multimedia,2022.

Papers as second or third author:

  • IRLSG: Invariant Representation Learning for Single-Domain Generalization in Medical Image Segmentation, ICASSP 2024.
  • CoSTHR: A Heart Rate Estimating Network with Adaptive Color Space Transformation. IEEE Transactions on Instrumentation and Measurement, 2022. (IF: 5.6)
  • MCKD: Mutually Collaborative Knowledge Distillation for Federated Domain Adaptation and Generalization. ICASSP, 2023
  • LGA: A Language Guide Adapter for Advancing the SAM Model’s Capabilities in Medical Image Segmentation. Springer MICCAI, 2024.
  • Enhancing Deep Learning-based Depression Level Estimation Based on Multi-task Learning. ACM International Conference on Machine Learning and Machine Intelligence, 2024.

Recent Patents:

  • Engineering Progress Determining Method and Device Based on Multi-Mode Time Sequence Information Fusion. App. No: CN202310788030.2/US20250005475A1. Country: CN/US. Publication Date: 2023-07. Third Inventor
  • Rheumatoid Arthritis Activity Grading Device Based on Multimodal Data. App. No:CN202310755346.1. Country: CN. Publication Date: 2023-06. Fourth Inventor
  • A Single-Domain Generalization Method for Medical Image Segmentation. App. No: CN116596832A. Country: CN. Publication Date: 2023-02. Fifth Inventor.

For 30+ independent projects, please visit my Github Page.
From 2020 to 2021, I am responsible for publishing TensorFlow tutorials on IMOOC.

Projects

I have been actively involved in several exciting research projects, contributing to advancements in areas such as mutimodal learning and real-time monitoring. Here are some of my recent participated projects.

  • 2022 - 2025: Intelligent Integrated Analysis Platform Construction for Rheumatoid Arthritis (RA)
    • Funded by National Key R&D Program Project, 2022YFC2504605, Ministry of Science and Technology, China
    • Aimed to develop an AI-driven platform for integrated analysis to enhance diagnosis and treatment of RA
    • Built a new approach that integrates multimodal clinical data and optimizes diagnosis accuracy by 10%
    • Accountable for the development, implementation, and validation of multimodal methodologies
  • 2022 - 2024: Preoperative Early Recurrence Detection and Prediction of HCC Based on Federated Learning
    • Funded by Zhejiang Provincial Natural Science Foundation Key Project, LZ22F020012
    • Aim to develop a federated learning solution for early preoperative HCC recurrence prediction with privacy
    • Achieved +13% accuracy in recurrence prediction while ensuring data privacy through federated learning
    • Responsible for project proposals, multimodal methodologies, and final validation
  • 2022 - 2024: Research on Key Technologies for Smart Construction Site Management Platform Based on CV
    • Funded by Hangzhou New Zhongda Technology Co., Ltd., 2022AIZD0147-02
    • Aimed to develop a smart construction site monitor and management platform to improve safety and efficiency
    • Successfully built a real-time monitoring platform that reduced violations by 15% and simplified management
    • Responsible for project proposals, methodology, acceptance, and project management

Honors & Awards

The following honors and awards recognize my academic excellence, research achievements, and leadership contributions throughout my graduate and professional journey.

  • Excellent Postgraduate Students' Award (06.2025, by Zhejiang University)
    • Awarded to the top 10% of outstanding doctoral students in recognition of their academic excellence
  • Award of Honor for Graduate (Four times) (12.2024 / 12.2023 / 12.2022 / 12.2021, by Zhejiang University)
    • Awarded annually to the top 15% of outstanding doctoral students in recognition of their excellence
  • Zhejiang University Academic Scholarship (12.2022, by Zhejiang University)
    • Awarded to support research by outstanding doctoral students
  • Outstanding Graduate Leader Award (Twice) (12.2024 / 12.2023, by Zhejiang University)
    • Recognizes exceptional graduate students who demonstrate outstanding leadership to their field or community
  • Graduate with Merit A Performanced (12.2023, by Zhejiang University)
    • Awarded to graduates demonstrated exceptional academic performance and active participation in social activities.
  • HUAWEI Scholarship (12.2023, by Zhejiang University)
    • Awarded to exceptional students in computer science and AI for academic excellence and research innovatione
  • Outstanding Undergraduate Award (06.2020, by Harbin Institute of Technologyy)
    • Awarded to the top 15% of outstanding under-graduate students in recognition of their excellence
  • National Aspirational Scholarsh (12.2018, by Harbin Institute of Technologyy)
    • Awarded to the top 5% of outstanding undergraduate students in recognition of their excellence

Skills

This section outlines my core academic and engineering skills, including research design, algorithm development, large-scale model training, and practical system implementation, with a strong focus on AI, large language models (LLMs), and multimodal learning

  • Academic-Specific: Scholarly Writing, Publication, Peer Review, Conference Presentation, Grant Proposal
  • AI Research: Algorithm Development, Model Training & Finetuning, Data Processing, Evaluation
  • LLM-Specific: Model Customization, Knowledge Integration, Multimodal Tuning, Scalability and Efficiency/li>
  • Multimodal Research: Framework Design, Task Adaptation, Multimodal System Deployment
  • Software Engineering: Feasibility and Requirements Analysis, System and Detailed Design, Implementation and Software Maintenance, etc
  • Programming Languages: Python, PyTorch, Numpy, TensorFlow, Java, C++, C, HTML, GO, etc
  • Full-stack Development: FrontEnd Programming, BackEnd Design and implementation, Database Systems
  • Language: Mandarin, English, Japanese