top of page


Can AI gather financial information?
Can AI gather financial data reliably? I tested Gemini, ChatGPT, and Claude on pulling key metrics for major tech companies. All delivered usable results with solid tables—but with some caveats around reporting periods and market cap accuracy. Verdict: AI works well here, but still needs human oversight.
Apr 27


Can AI do my Geometry homework?
I tested Gemini, ChatGPT, and Claude on simple 5th-grade geometry worksheets. Claude delivered perfect, consistent results across both tests, while Gemini and ChatGPT stumbled on triangles. ChatGPT recovered after a retry, but Gemini doubled down on errors. The takeaway: AI can solve geometry, but reliability is still a real issue.
Apr 16


Can AI Replace a Controller?
Can AI replace a controller?
I tested ChatGPT on a simple parent-subsidiary scenario. It looked confident—but failed key steps, double-counted equity, and broke the balance sheet. It eventually fixed itself after multiple prompts. Verdict: AI still can’t replace a controller.
Apr 9


Can AI do my math homework?
Can AI handle basic math? I tested ChatGPT, Gemini, and Claude on 5th-grade word problems. All three delivered perfect results—accurate answers, clear explanations, and zero errors. A simple test.
Results can still vary
Apr 7


Image AI Test: same prompt, different chats
Simple AI image test: ChatGPT, Claude, Grok, and Gemini all given the exact same prompt with no optimization. The differences were striking from cartoonish interpretations to near-photorealistic scenes revealing each model’s instincts, strengths, and blind spots right out of the box.
Apr 5
bottom of page