Multimodal Generative AI

Akansha Singh (editor), Krishna Kant Singh (editor)

Hardback (31 Mar 2025)

Save $6.42

RRP $245.15
$238.73

In Stock

Add to basket

Includes delivery to the United States

10+ copies available online - Usually dispatched within 7 days

Publisher's Synopsis

This book stands at the forefront of AI research, offering a comprehensive examination of multimodal generative technologies. Readers are taken on a journey through the evolution of generative models, from early neural networks to contemporary marvels like GANs and VAEs, and their transformative application in synthesizing realistic images and videos. In parallel, the text delves into the intricacies of language models, with a particular on revolutionary transformer-based designs. A core highlight of this work is its detailed discourse on integrating visual and textual models, laying out state-of-the-art techniques for creating cohesive, multimodal AI systems. "Multimodal Generative AI" is more than a mere academic text; it's a visionary piece that speculates on the future of AI, weaving through case studies in autonomous systems, content creation, and human-computer interaction. The book also fosters a dialogue on responsible innovation in this dynamic field. Tailored for postgraduates, researchers, and professionals, this book is a must-read for anyone vested in the future of AI. It empowers its readers with the knowledge to harness the potential of multimodal systems in solving complex problems, merging visual understanding with linguistic prowess. This book can be used as a reference for postgraduates and researchers in related areas.

ISBN:	9789819623549
Publisher:	Springer Nature Singapore
Imprint:	Springer
Pub date:	31 Mar 2025
DEWEY:	006.3
DEWEY edition:	23
Language:	English
Number of pages:	382
Weight:	-1g
Height:	235mm
Width:	155mm