- 多模态学习综述 (MultiModal Learning) - 知乎
模态(modal)是事情经历和发生的方式,我们生活在一个由多种模态(Multimodal)信息构成的世界,包括视觉信息、听觉信息、文本信息、嗅觉信息等等,当研究的问题或者数据集包含多种这样的模态信息时我们称之为多模态问题,研究多模态问题是推动人工智能
- Multimodal learning - Wikipedia
Multimodal learning is a type of deep learning that integrates and processes multiple types of data, referred to as modalities, such as text, audio, images, or video
- MULTIMODAL中文 (简体)翻译:剑桥词典 - Cambridge Dictionary
A multimodal agent may do this in multiple ways: through speech and intonation, facial expression and gaze, gesture, body movements and posture
- MULTIMODAL Definition Meaning - Merriam-Webster
The meaning of MULTIMODAL is having or involving several modes, modalities, or maxima How to use multimodal in a sentence
- 多模态学习(Multimodal Learning)简介及其子任务、模型、数据集 | 学习数据 (Datalearner)
多模态学习 Multimodal Learning 多模态学习试图对不同模态的数据组合进行建模,这在现实世界的应用中经常出现。 联合数据的一个例子是将文本(通常表示为离散的字数向量)与由像素强度和注释标签组成的成像数据相结合。
- 推荐一个最近刚出的比较全面的多模态综述:Multimodal Deep Learning-CSDN博客
多模态综述:Multimodal Deep Learning。 对多模态、CV 和 NLP 领域中一些任务的 数据集、模型、评价指标等等 都做了较详细的介绍和总结。
- What is multimodal AI? - IBM
What is multimodal AI? Multimodal AI refers to machine learning models capable of processing and integrating information from multiple modalities or types of data These modalities can include text, images, audio, video and other forms of sensory input
- Multimodality - Wikipedia
Recipes delivered through any medium, whether that be a cookbook or a blog, can be considered multimodal because of the "interaction between body, experience, knowledge, and memory, multimodal literacies" that all relate to one another to create our understanding of the recipe
|