Testing ChatGPT | The Tester Ai

Can AI gather financial information?

Can AI gather financial data reliably? I tested Gemini, ChatGPT, and Claude on pulling key metrics for major tech companies. All delivered usable results with solid tables—but with some caveats around reporting periods and market cap accuracy. Verdict: AI works well here, but still needs human oversight.

Work related AI tests

Apr 27

Can AI do my Geometry homework?

I tested Gemini, ChatGPT, and Claude on simple 5th-grade geometry worksheets. Claude delivered perfect, consistent results across both tests, while Gemini and ChatGPT stumbled on triangles. ChatGPT recovered after a retry, but Gemini doubled down on errors. The takeaway: AI can solve geometry, but reliability is still a real issue.

Home related AI tests

Apr 16

Can AI Replace a Controller?

Can AI replace a controller? I tested ChatGPT on a simple parent-subsidiary scenario. It looked confident—but failed key steps, double-counted equity, and broke the balance sheet. It eventually fixed itself after multiple prompts. Verdict: AI still can’t replace a controller.

Work related AI tests

Apr 9

Can AI do your math homework? I guess it can!

Can AI do my math homework?

Can AI handle basic math? I tested ChatGPT, Gemini, and Claude on 5th-grade word problems. All three delivered perfect results—accurate answers, clear explanations, and zero errors. A simple test. Results can still vary

Home related AI tests

Apr 7

Unicorns in a pasture with San Francisco in the background. 4 different AI chat depictions of the same prompt. Gemini, Grok, Claude and Chat GPT.

Image AI Test: same prompt, different chats

Simple AI image test: ChatGPT, Claude, Grok, and Gemini all given the exact same prompt with no optimization. The differences were striking from cartoonish interpretations to near-photorealistic scenes revealing each model’s instincts, strengths, and blind spots right out of the box.

Home related AI tests

Apr 5