Just looking at the pictures and graphs in that paper is enough to become amazed by what they're achieving. The example where they show 3 pictures of an old monitor plug being connected to an iphone to recharge it, and then GPT4 is asked what's funny about it, and answers incredibly accurately, is amazing.
Since we don't have access to this feature lets be skeptical, its feels like "leading the witness," if your asks what be the funny here.
Also if the image is from a forum or sub with funny images is that able to give it away?
Having multiple tests would be a stronger test say with example prompts: "whats going on in this picture", "what would a person think seeing this image" etc..
gpt4 is cool as a numbers box but this is not reasoning logic and without papers hasn't been proven either.