Multimodal AI is suddenly everywhere, with just about every major player in the space talking about it. But what actually is it?
, and Microsoft all using the term to sell people on new AI models and services in just the past few weeks. But what is “multimodal,” and what does it mean?
“It is very safe to assume that future communication between human and machine will also be multimodal,” says Jina AI’s CEO Han Xiao in anIt’s safe to assume, indeed, as that’s precisely how other AI companies say they are approaching the technology right now. on responsible multimodaL AI development published last year. “As human perception and problem-solving in the physical world leverage multiple modalities, such multimodal systems provide an even more natural and seamless support than those operating across a single modality.”, even still, this comes up a bit short of a true multimodal AI system, as contemporary approaches still rely on some form of model fusion to handle different types of inputs and outputs.
“There is no doubt that any chief digital transformation officers or chief AI officers worth their salt will be aware of multimodal AI and are going to be thinking very carefully about what it can do for them,” says Henry Ajder, founder of Latent Space.
United Kingdom Latest News, United Kingdom Headlines
Similar News:You can also read news stories similar to this one that we have collected from other news sources.
Why ‘Multimodal AI’ Is the Hottest Thing in Tech Right NowThere's a new race in technology to make AI see and hear the world around you, and ultimately make sense of it for you.
Read more »
Astra Is Google's ‘Multimodal’ Answer to the New ChatGPTGoogle's new voice-operated AI assistant, called Astra, can make sense of what your phone's camera sees. It was announced one day after OpenAI revealed a similar vision for ChatGPT.
Read more »
Google Lens now supports video search and multimodal AIGoogle’s Lens tool is meant to make it easier to search the web with images, and now the tool supports uploading video and audio to get better results.
Read more »
OpenAI could debut a multimodal AI digital assistant soonAccording to The Information, OpenAI is working on a new model that can understand speech intonation and can use vision to help tutor students or offer information.
Read more »
Advancements in Memes Analysis: Scene Graphs and Multimodal ApproachesExplore the cutting-edge techniques in memes analysis with a focus on scene graphs, knowledge integration, and multimodal approaches.
Read more »
The Ray-Ban Meta Smart Glasses have multimodal AI nowMultimodal AI is now available for all Ray-Ban Meta Smart Glasses users. While finicky, it’s a more natural form factor for this tech.
Read more »