Gå direkt till innehållet
Multimodal Artificial Intelligence and Large Language Models
Spara

Multimodal Artificial Intelligence and Large Language Models

inbunden, 2026
Engelska

The book provides a comprehensive technical analysis of multimodal artificial intelligence systems and implementation frameworks. It offers thorough coverage of cross-modal processing methods for use, including speech recognition and automatic image captioning.

  • It presents a detailed discussion of architecture for integrating text, image, audio, and video modalities, cross-modal processing pipelines, and data fusion techniques.
  • Showcases real-time synchronization mechanisms across different modalities and scalable design patterns for multimodal systems.
  • Discusses multimodal emotion recognition using deep Learning techniques, focusing on recent advancements, challenges, and ethical considerations.
  • Investigates deployment optimization strategies to address issues with latency, resource usage, and scalability of multimodal systems.
  • Focuses on techniques for performance optimization, memory management, and distributed processing for multimodal workloads using frameworks like PyTorch and TensorFlow.

The text is primarily written for senior undergraduates, graduate students, and academic researchers in electrical engineering, electronics and communications engineering, computer science and engineering, and information technology.

Undertitel
A Comprehensive Guide from Theory to Practice
ISBN
9781041152132
Språk
Engelska
Vikt
446 gram
Utgivningsdatum
29.9.2026
Sidor
376