AI Shopping Showdown 🥊
Same question. Four AIs. Very different answers.
We ran identical, specific shopping queries through ChatGPT (GPT-4o), Google Gemini, Perplexity, and Claude. No cherry-picking — these are real results that reveal how each AI thinks about shopping differently.
Test 1: Best Robot Vacuum Under $400
The Prompt:
"What's the best robot vacuum under $400 for a 1,500 sq ft home with two dogs and hardwood floors throughout? I want strong suction, self-emptying, and good app control."
ChatGPT (GPT-4o)
Top Pick: iRobot Roomba j7+ (~$380)
Reasoning: Emphasized obstacle avoidance (important with pets — avoids dog toys and "accidents"). Called out PrecisionVision navigation and Clean Base self-emptying. Mentioned the J7+ specifically learns your home layout over time.
Weaknesses cited: Higher price point, Clean Base bags are ongoing consumable cost.
Google Gemini
Top Pick: Roborock Q7 Max+ (~$360)
Reasoning: Focused on suction power (4,200Pa — notably higher than competitors). Highlighted that it both vacuums and mops. Mentioned LiDAR navigation for efficient mapping.
Weaknesses cited: Auto-empty dock is loud. Mopping on hardwood requires care.
Perplexity
Top Pick: Ecovacs Deebot T20 Omni (~$390)
Reasoning: Provided sourced recommendations from Tom's Guide, RTINGS, and Wirecutter. Cited hot-water mop washing as a differentiator. Discussed auto-detection of carpet vs. hardwood. Included links to actual reviews.
Weaknesses cited: Large dock footprint. App can be glitchy.
Claude
Top Pick: Roborock Q Revo (~$370)
Reasoning: Focused on practical value — cost-per-year analysis including replacement parts and consumables. Highlighted that Roborock has better long-term software support than several competitors. Suggested checking refurbished options.
Weaknesses cited: Not as well-known as iRobot brand. Fewer third-party accessories.
The Verdict
| Criteria | ChatGPT | Gemini | Perplexity | Claude |
|---|---|---|---|---|
| Product recommended | Roomba j7+ | Roborock Q7 Max+ | Ecovacs T20 Omni | Roborock Q Revo |
| Addressed pet-specific needs | ✅ Strong | ⚠️ Partial | ✅ Strong | ⚠️ Partial |
| Price accuracy | ⚠️ ~Approximate | ✅ Close | ✅ Linked sources | ⚠️ ~Approximate |
| Sources cited | ❌ None | ❌ None | ✅ Multiple | ❌ None |
| Mentioned ongoing costs | ⚠️ Brief | ❌ No | ⚠️ Brief | ✅ Detailed |
| Bias indicator | Favors popular brand | Favors specs | Follows review consensus | Favors value analysis |
Takeaway: All four gave defensible recommendations, but each had a different "lens." ChatGPT prioritized the safest, most recognized brand. Gemini chased raw specs. Perplexity followed expert consensus with receipts. Claude looked at total cost of ownership.
Test 2: Laptop for a College Student
The Prompt:
"Best laptop for a college freshman studying computer science. Budget: $800-$1,100. Needs to last 4 years. Must handle coding IDEs, some Docker containers, and occasional video calls. Portability matters — they'll carry it to class daily."
ChatGPT
Top Pick: MacBook Air M3 (16GB, $1,099)
Runner-up: Lenovo ThinkPad E14 Gen 5 (~$850)
Angle: Heavily weighted battery life and build quality for 4-year longevity. Made the Mac vs Linux dev tooling argument. Mentioned Apple's education pricing.
Gemini
Top Pick: ASUS Zenbook 14 OLED (~$900)
Runner-up: Lenovo IdeaPad Pro 5i (~$850)
Angle: Focused on display quality (OLED for long coding sessions — easier on eyes). Pushed AMD processor options for better multi-core Docker performance per dollar.
Perplexity
Top Pick: MacBook Air M3 (16GB, $1,099)
Runner-up: Framework Laptop 13 (~$1,050)
Angle: Cited RTINGS, Notebookcheck, and student surveys. Uniquely recommended Framework as a CS-student-friendly repairable option. Discussed RAM upgrade paths.
Claude
Top Pick: Lenovo ThinkPad T14 Gen 4 (~$950)
Runner-up: MacBook Air M3 (16GB, $1,099)
Angle: Led with the repairability and Linux compatibility argument. Detailed Docker performance testing considerations. Warned about 8GB vs 16GB RAM as a 4-year regret factor. Suggested buying from Lenovo's education store for discounts.
The Verdict
| Criteria | ChatGPT | Gemini | Perplexity | Claude |
|---|---|---|---|---|
| Addressed CS-specific needs | ✅ Strong | ⚠️ Moderate | ✅ Strong | ✅ Strong |
| 4-year longevity analysis | ✅ Yes | ⚠️ Partial | ⚠️ Partial | ✅ Detailed |
| Pricing accuracy | ⚠️ Close | ⚠️ Close | ✅ Linked | ⚠️ Close |
| Mentioned education discounts | ✅ Yes | ❌ No | ❌ No | ✅ Yes |
| Gave a non-obvious option | ❌ No | ⚠️ Somewhat | ✅ Framework | ✅ ThinkPad |
Takeaway: ChatGPT and Perplexity converged on the MacBook Air M3 — the "safe pick" dominating 2024-2025 laptop recommendations. Gemini went for OLED display quality. Claude went contrarian with ThinkPad for repairability and Linux — arguably the most CS-student-aware answer.
Test 3: Running Shoes for Heavy Runner
The Prompt:
"I'm a 210lb male getting back into running. Max 15 miles/week to start. I overpronate slightly. Budget $120-$160. Last pair was Brooks Ghost 14 — they were fine but my ankles felt unsupported."
ChatGPT
Top Pick: Brooks Adrenaline GTS 24 (~$140)
Reasoning: Directly addressed stability for overpronation. GuideRails technology for ankle support. Noted it's the "stability sibling" of the Ghost line — same fit, more support.
Gemini
Top Pick: ASICS Gel-Kayano 30 (~$160)
Reasoning: Emphasized 4D Guidance System and FF Blast+ cushioning for heavier runners. Mentioned heel drop and stack height specifics. Included gel technology explanation.
Perplexity
Top Pick: New Balance Fresh Foam X 860v14 (~$140)
Reasoning: Cited Runner's World and Believe in the Run reviews. Highlighted medial post for overpronation control. Mentioned width options (important for larger runners). Included a link to a gait analysis guide.
Claude
Top Pick: Brooks Adrenaline GTS 24 (~$140)
Runner-up: Saucony Guide 17 (~$140)
Reasoning: Started from the user's specific Ghost 14 experience and diagnosed what was missing (GuideRails vs neutral). Recommended getting a gait analysis at a running store before buying online. Warned that overpronation can change as running fitness improves.
The Verdict
| Criteria | ChatGPT | Gemini | Perplexity | Claude |
|---|---|---|---|---|
| Addressed overpronation | ✅ Directly | ✅ Directly | ✅ Directly | ✅ Directly |
| Used the Ghost 14 context | ✅ Yes | ❌ No | ❌ No | ✅ Built on it |
| Weight-specific advice | ⚠️ Minimal | ✅ Good | ⚠️ Mentioned | ⚠️ Mentioned |
| Price within budget | ✅ $140 | ⚠️ $160 (top of range) | ✅ $140 | ✅ $140 |
| Actionable next step | ❌ No | ❌ No | ✅ Gait guide link | ✅ In-store gait analysis |
Takeaway: ChatGPT and Claude both nailed the Brooks-to-Brooks upgrade path. Gemini pushed the premium ASICS option (at the top of budget). Perplexity went New Balance with receipts. Claude gave the most contextually aware response by building on the user's existing experience.
Test 4: Best Coffee Setup Under $250
The Prompt:
"I want to make great coffee at home. Total budget: $250 for everything (grinder, brewer, accessories). I drink 2 cups/day, black, and prefer a clean, bright flavor. I'm currently using a drip machine with pre-ground coffee."
ChatGPT
Recommended Setup:
- Baratza Encore ESP grinder (~$170)
- Hario V60 pour-over kit (~$30)
- Gooseneck kettle (~$35)
- Total: ~$235
Angle: Standard specialty coffee recommendation. Emphasized that the grinder is 70% of coffee quality.
Gemini
Recommended Setup:
- 1Zpresso JX hand grinder (~$70)
- AeroPress (~$40)
- Digital scale (~$15)
- Fellow Stagg kettle (~$90)
- Total: ~$215
Angle: Maximized gear variety within budget. AeroPress for "clean, bright" specifically. Hand grinder to reallocate budget.
Perplexity
Recommended Setup:
- Baratza Encore grinder (~$170)
- Clever Dripper (~$25)
- Basic gooseneck kettle ($25)
- Total: ~$220
Angle: Cited James Hoffman's reviews and r/coffee consensus. Clever Dripper recommended for beginners transitioning from drip — forgiving technique.
Claude
Recommended Setup:
- Option A (ease-focused): Baratza Encore ($170) + Clever Dripper ($25) + kettle ($30) = $225
- Option B (budget-focused): 1Zpresso Q2 hand grinder ($100) + AeroPress ($40) + gooseneck kettle ($40) + scale ($15) = $195
Angle: Gave two paths. Explicitly addressed the "clean, bright" flavor preference (recommended light roasts + paper filters). Warned that the jump from pre-ground to fresh-ground is 90% of the improvement — brewer choice is secondary.
The Verdict
| Criteria | ChatGPT | Gemini | Perplexity | Claude |
|---|---|---|---|---|
| Within $250 budget | ✅ $235 | ✅ $215 | ✅ $220 | ✅ Both options |
| Addressed "clean, bright" preference | ⚠️ Implied | ✅ AeroPress | ⚠️ Implied | ✅ Explicit |
| Beginner-appropriate | ⚠️ V60 is technique-sensitive | ⚠️ Hand grinder is effort | ✅ Clever Dripper | ✅ Gave both options |
| Explained WHY each item | ⚠️ Basic | ✅ Good | ✅ Cited sources | ✅ Detailed |
Takeaway: All four gave legitimate coffee upgrade paths. Gemini and Claude included hand grinder options that let you reallocate budget. Perplexity cited the community consensus. Claude was the only one that gave two options and explicitly addressed the flavor preference.
Cross-Test Patterns
After running these (and dozens more tests), clear patterns emerge:
| Pattern | ChatGPT | Gemini | Perplexity | Claude |
|---|---|---|---|---|
| Default bias | Popular/safe picks | Spec-heavy picks | Expert consensus | Value/analytical picks |
| Sources | Rarely cited | Rarely cited | Always cited | Rarely cited |
| Real-time pricing | Approximate | Better (Google data) | Best (live search) | Approximate |
| Follow-up depth | Excellent | Good | Good | Excellent |
| Budget awareness | Good | Pushes to top | Good | Strongest |
| Contrarian options | Rare | Sometimes | Follows consensus | Most likely |
| Personalization | Good | Moderate | Moderate | Strongest |
Which AI Should You Use?
- "I want the safest recommendation" → ChatGPT
- "I want to compare raw specifications" → Gemini
- "I want sourced, verifiable recommendations" → Perplexity
- "I want a value-optimized, analytical answer" → Claude
- "I want to be thorough" → Run the query through at least two
Every comparison on this page used real, identical prompts run in March 2025. AI models update frequently — results may shift over time. That's the point: cross-check, don't trust any single source.
Part of the byPrompt Network. See also: AI Shopping Tools → | Ready-to-Use Prompts → | The Shopping Playbook →