2 models available
Amazon's multimodal model designed to balance "accuracy, speed, and cost for a wide range of tasks." As of December 2024, it demonstrates state-of-the-art performance on visual question answering (Tex...