fromInfoQ2 months agoMistral AI Launches API for LLM-Based OCR of Multimodal DocumentsMistral OCR aims to digitize complex documents by interleaving text, images, and tables, suitable for scientific research and historical artifacts.Marketing tech
Artificial intelligencefromTechzine Global3 months agoMicrosoft launches Phi models optimized for multimodal processingMicrosoft expands its Phi language model line with Phi-4-mini and Phi-4-multimodal for improved multimodal processing and hardware efficiency.
Artificial intelligencefromInfoQ2 months agoGoogle Introduces Gemini 2.5 Pro with Improved Reasoning and Coding CapabilitiesGemini 2.5 Pro enhances AI reasoning and coding capabilities, achieving top scores in multiple benchmarks despite some integration issues.
Artificial intelligencefromTechzine Global3 months agoMicrosoft launches Phi models optimized for multimodal processingMicrosoft expands its Phi language model line with Phi-4-mini and Phi-4-multimodal for improved multimodal processing and hardware efficiency.
Artificial intelligencefromInfoQ2 months agoGoogle Introduces Gemini 2.5 Pro with Improved Reasoning and Coding CapabilitiesGemini 2.5 Pro enhances AI reasoning and coding capabilities, achieving top scores in multiple benchmarks despite some integration issues.