I am a Ph.D. student at the Institute of Automation, Chinese Academy of Sciences (CASIA) and University of Chinese Academy of Sciences (UCAS), supervised by Prof. Jianhua Tao . My research interests lie in affective computing and deep learning, with a specific focus on multimodal learning and self-supervised learning. I have published several papers at the top international AI journals and conferences, such as Information Fusion, IEEE Trans. on Affective Computing, ACM MM, and ICASSP. I am also the winner of several international competitions in affective computing, such as MuSe and MEGC.

Feel free to reach out if you’re interested in my work and want to explore potential collaborations: x@y, where x = sunlicai2019 and y = ia.ac.cn.

I am actively seeking a postdoctoral research position as well. If you are interested in my experience or have relevant information, please feel free to contact me and let me know. Thanks very much!

📜 Research Area

Computer Vision :
   Facial Expression Recognition; Micro-Expression Recognition; Audio-Visual Emotion Recognition; Multimodal Large Language Model
Speech Signal Processing :
   Speech Emotion Recognition
Natural Language Processing :
   Multimodal Sentiment Analysis; Emotion Recognition in Conversation; Large Language Model
Self-supervised Learning :
   Contrastive Learning; Masked Data Modeling

🔥 News

  • 2024.03:  🎉🎉 HiCMAE and GPT-4v with Emotion are accepted by Information Fusion.
  • 2023.07:  🎉🎉 MAE-DFER is accepted by ACM MM 2023.
  • 2023.04:  🎉🎉 EMT-DLFR is accepted by IEEE Trans. on Affective Computing.

📝 Publications

* Equal contribution, # Corresponding author

INFFUS 2024
sym

HiCMAE: Hierarchical Contrastive Masked Autoencoder for Self-Supervised Audio-Visual Emotion Recognition

Licai Sun, Zheng Lian, Bin Liu#, Jianhua Tao#

Information Fusion, 2024 |

  • HiCMAE introduces a novel hierarchical contrastive masked autoencoder for self-supervised audio-visual emotion recognition (AVER) and achieves SOTA performance on nine popular AVER datasets.
INFFUS 2024
sym

GPT-4V with Emotion: A Zero-shot Benchmark for Generalized Emotion Recognition

Zheng Lian, Licai Sun, Haiyang Sun, Kang Chen, Zhuofan Wen, Hao Gu, Bin Liu#, Jianhua Tao#

Information Fusion, 2024 |

  • This paper quantitatively evaluates the emotional intelligence of GPT-4V on 21 benchmark datasets covering 6 emotion recognition tasks.
ACM MM 2023
sym

MAE-DFER: Efficient Masked Autoencoder for Self-supervised Dynamic Facial Expression Recognition

Licai Sun, Zheng Lian, Bin Liu, Jianhua Tao

ACM MM 2023 |
PWC
PWC
PWC

  • MAE-DFER presents an early attempt to leverage large-scale self-supervised pre-training for dynamic facial expression recognition (DFER) and demonstrates great success on six popular DFER datasets.
TAC 2023
sym

Efficient Multimodal Transformer with Dual-Level Feature Restoration for Robust Multimodal Sentiment Analysis

Licai Sun, Zheng Lian, Bin Liu#, Jianhua Tao#

IEEE Trans. on Affective Computing, 2023 |

  • EMT-DLFR aims to address the inefficiency in fusing unaligned multimodal sequences and the vulnerability to missing data in real-world scenarios to achieve efficient and robust multimodal sentiment analysis.

🎖 Honors and Awards

📖 Educations

  • 2019.09 - now, Ph.D. in Computer Applied Technology, Institute of Automation, Chinese Academy of Sciences and University of Chinese Academy Sciences, Beijing, China.

  • 2016.09 - 2019.06, M.Sc. in Computer Technology, University of Chinese Academy Sciences, Beijing, China.

  • 2012.09 - 2016.06, B.Eng. in Electronic and Information Technology, Beijing Forestry University, Beijing, China.

💬 Professional Services

  • Journal Reviewer: IEEE Trans. on Affective Computing, Speech Communication, Engineering Applications of Artificial Intelligence.
  • Conference Reviewer: ACM MM (2024, 2023), ICASSP (2022), InterSpeech (2020).
  • Program Committee: MER2023@ACM MM 2023 Grand Challenge and MRAC2023@ACM MM 2023 Workshop.