Gemini 1.5 Pro can see, hear & read yet it still sucks at OCR
TL,DR: I tested all major models with vision and neither of them can understand where the text on the screen appears. Here are my notes.
Apr 15, 20247 min read158

Search for a command to run...