12 min read

Product

Deep Review: AI That Explores Your Entire Codebase Before Reviewing Your PR

Most AI code review tools only scan the diff. Deep Review reads your full project — files, configs, tests, dependencies — and catches cross-file bugs that diff-only tools miss. Here's how it works.

Git AutoReview TeamUpdated May 15, 202612 min read

Tired of slow code reviews? AI catches issues in seconds. You decide what gets published.

Try it free5.0 on VS Code Marketplace

They all do the same thing: scan the diff, generate comments, post them to your PR.

The diff is a narrow window. It shows what changed. Not what that change breaks three directories away.

A renamed function on Friday breaks prod on Tuesday because the test suite didn't cover three of the fourteen files that imported it. Three reviewers approved the PR. Nobody opened the other files. This pattern shows up constantly — a renamed enum constant that silently passes CI, a refactor PR that takes down a metrics pipeline for eleven minutes because the reviewer only looked at the diff.

A diff-only tool would never flag this. The renamed file wasn't in the diff. The broken imports weren't in the diff. The missing test coverage wasn't in the diff. Nobody looked.

We built Deep Review to close that gap.

So what is Deep Review, actually?

It's an agent mode in Git AutoReview that uses Claude Code CLI to walk through your full codebase before reviewing a pull request.

Instead of feeding a diff to an LLM and hoping for useful comments, Deep Review spins up an agent that:

Reads the PR diff to understand what changed
Opens related files (imports, configs, tests, types, build scripts)
Traces data flow across modules looking for broken connections
Runs your linter on affected files
Checks whether the changed code paths actually have tests
Produces findings with severity ratings, file references, and fix suggestions

The agent doesn't guess from a context window. It opens your files, follows your imports, reads your tests. Think of it like the difference between looking at a floor plan and actually walking through the building.

Where does diff-only code review fail?

I want to be concrete here. These failure modes affect every diff-based tool: CodeRabbit, GitHub Copilot, Qodo Merge, any custom GPT wrapper you've duct-taped together.

Cross-file dependency breaks

You rename utils/formatDate.ts to utils/formatDateTime.ts. The diff shows a clean rename. But formatDate is imported in OrderConfirmation.tsx, InvoiceGenerator.ts, and EmailTemplate.tsx. None of those files are in the diff. The diff-only tool sees a perfectly good rename and moves on.

Hardcoded secrets in untouched files

Your PR adds a new API endpoint. The review focuses on the controller. Meanwhile, staging.env has an AWS key that was committed six months ago and nobody noticed. A diff-only tool never opens staging.env because it wasn't changed in this PR. Why would it?

Data flow vulnerabilities

A PR modifies input sanitization in the request handler — the diff looks clean, proper escaping, parameterized queries. But the sanitized value gets passed to a downstream function in another file that re-concatenates it into a raw SQL string. The vulnerability isn't in the diff. It's in the path the data takes afterward.

Architecture drift

A developer adds a caching layer to a service. Looks reasonable in isolation. But the system uses eventual consistency, and the caching introduces a race condition that only shows up if you read the architecture docs and the event handlers in a different module.

Missing test coverage

The PR adds a new feature. 200 lines of code. The tests pass. But there are zero tests for the new code — the existing tests cover old paths, nobody wrote new ones. A diff-only tool sees "tests pass" and gives a thumbs up.

Deep Review would have caught every one of these. The only difference is that it actually opens the files.

How does Deep Review work under the hood?

When you trigger a Deep Review, here's the sequence:

Diff analysis. The agent reads the full PR diff to build a map of what changed — which files, which functions, which lines. Starting point, not the whole picture.

Dependency mapping. It follows imports, requires, and type references from changed files outward. UserController.ts imports from AuthService.ts? The agent opens AuthService.ts. That imports from TokenStore.ts? Opens that too. It builds a dependency graph around the PR.

Codebase exploration. Based on that graph, the agent opens relevant files — environment configs, build scripts, test files, type definitions. It reads your actual project, not a truncated context window.

Linter execution. Runs your project's linter (ESLint, Pylint, whatever you've got) against changed files and their immediate dependencies. Lint results inform the review but don't dominate it.

Test analysis. Checks which test files cover the changed code. Reads the tests to understand what's actually tested. If a new function has zero coverage, flags that.

Finding generation. Produces findings, each with a severity level (Critical through Info), file references, explanation, and fix suggestion.

You see all of this in a real-time activity log inside VS Code. When the agent finishes, you review each finding and decide what to publish.

What does it look like when the Deep Review agent runs?

This is the part I like most about Deep Review compared to cloud tools: you can watch it work.

The activity log shows every step:

[Agent] Reading PR diff... 12 files changed, 847 lines
[Agent] Opening src/services/AuthService.ts (imported by UserController)
[Agent] Opening src/config/database.ts (referenced in AuthService)
[Agent] Opening tests/auth.test.ts (test file for AuthService)
[Agent] Running ESLint on 4 changed files...
[Agent] Found: database.ts uses connection string from env without validation
[Agent] Checking test coverage for new handleRefresh() method...
[Agent] No tests found for handleRefresh — flagging as coverage gap
[Agent] Generating findings... 4 issues found (1 High, 2 Medium, 1 Info)

When a cloud tool tells you "this line might have an issue," you either trust it or you don't. With the activity log, you see what the agent read. If a finding seems off, you trace back through the log and figure out where it went wrong. Or where you went wrong.

What real bugs has Deep Review found in production code?

Five things Deep Review has actually caught:

1. Hardcoded secrets across config files

Severity: CRITICAL

The agent opened 3 config files and found API keys in staging.env that weren't in .gitignore. The changed file in the PR was a controller. A diff-only review saw nothing.

2. Broken dependency path after refactor

Severity: HIGH

A config file referenced a build hook that no longer existed after a rename. The agent traced the path through package.json, tsconfig.json, and the build script to find the dead reference.

3. Error handler silently swallowing failures

Severity: HIGH

A try-catch in a service file caught all exceptions and logged them but never re-threw or returned an error. The caller assumed success. The agent found it by tracing the error path from controller through service to database layer.

4. Missing type validation on API boundary

Severity: MEDIUM

The API accepted userId as a string but passed it to a database query expecting a number. TypeScript types were correct at each layer individually, but the runtime conversion wasn't handled. The agent found the mismatch by reading the route handler, service layer, and database query together.

5. Test file testing the wrong function

Severity: MEDIUM

After a refactor, a test file still imported and tested an old version of a function. Tests passed because the old function still existed (marked deprecated). The agent compared test imports against current module exports and flagged the staleness.

How does Deep Review assess code quality?

Beyond individual findings, Deep Review scores your PR across categories:

Category	What it checks
Security	Secrets, injection points, auth gaps, input validation
Architecture	Separation of concerns, dependency direction, coupling
Error handling	Try-catch coverage, error propagation, fallback logic
Type safety	Runtime type mismatches, any-casting, boundary validation
Test coverage	Coverage gaps, dead tests, assertion quality
Code standards	Linter compliance, naming conventions, dead code

Each category gets a score and a summary of what needs attention. Your senior devs still do the real review — they just don't have to waste time catching mechanical stuff anymore.

How does Deep Review compare to other AI code review tools?

Feature	Git AutoReview Deep Review	CodeRabbit	Greptile	GitHub Copilot
Analysis scope	Full codebase	Full codebase (cloud sandbox)	Full codebase (indexed)	Diff only
Runs where	Locally in VS Code	Cloud	Cloud	Cloud
Activity log	Real-time in VS Code	No	No	No
Human approval	Required before publishing	Optional	Optional	No
Multi-model	Claude, Gemini, GPT (BYOK)	Fixed	Fixed	Fixed (Copilot)
Platforms	GitHub, GitLab, Bitbucket	GitHub, GitLab, Bitbucket, Azure	GitHub, GitLab	GitHub only
Review time	5-25 min	2-10 min	2-5 min	30 sec
Pricing	From $9.99/mo + Claude sub	$24-30/user/mo	$30/user/mo	$10-39/user/mo
BYOK	Yes	No	No	No

The trade-off is obvious: Deep Review is slower because it does more. Need a fast linter-level check? Use Quick Review mode (API-based, 15-30 seconds). Need the kind of review a senior engineer would do with coffee on a quiet Sunday? Deep Review.

When should you use Deep Review vs Quick Review?

Git AutoReview gives you both, and they're built for different situations.

Quick Review makes sense for small PRs under 100 lines, routine changes like dependency bumps or formatting fixes, and when you're batch-reviewing a pile of PRs and need quick feedback.

Deep Review is for the PRs that keep you up at night. Large refactors touching business logic. New features with cross-cutting concerns. Security-sensitive changes. Anything going to main or production that you really don't want to break.

In practice, most teams run Quick Review on about 80% of their PRs and Deep Review on the 20% that actually matter. Review Profiles let you set this up once and switch with one click.

How do you set up Deep Review? takes about 5 minutes

Install Git AutoReview from the VS Code marketplace.
Install Claude Code CLI and sign in with your Anthropic account (Pro or Max subscription required).
Switch to "Deep Review" mode in the Git AutoReview sidebar. The extension detects Claude Code automatically.
Open a PR, click Review, watch the activity log. When it finishes, review findings and publish what you agree with.

No CI/CD changes. No GitHub App. No cloud config. Everything runs on your machine.

What are the trade-offs of using Deep Review?

We'd rather tell you the downsides upfront than have you find out and feel misled.

It takes time. 2-5 minutes typical, 5-8 minutes for large PRs. For quick feedback, use Quick Review mode instead.

The cost is higher too. Claude Pro ($20/mo monthly, $17/mo annual) subscription on top of your Git AutoReview plan. For teams reviewing critical code daily, makes total sense. Solo devs reviewing small PRs? Quick Review with Gemini or Haiku is a better deal.

The agent is thorough but not omniscient. It catches real issues, and it still misses things a human with deep domain knowledge would spot. Strong first pass. Not a replacement for your team.

And it only works with Claude Code CLI. Quick Review works with any model (Claude, Gemini, GPT) via standard API keys, so you're not locked in.

We think being straight about this builds more trust than pretending it's magic. Deep Review is the best option available today for catching cross-file issues in code review. It's not the fastest or cheapest. But when a PR touches critical infrastructure, 15 minutes of thorough analysis beats 15 seconds of diff scanning.

What do developers say about Deep Review?

"Claude Opus gives the best review comments, but for day-to-day reviews I use Gemini and Haiku — great price/performance balance." — Camilo H., Software Developer

"The AI catches things I would have missed, and I love that I can review everything before it gets published." — Viktor B., Sr. Software Architect

"It catches the things you'd normally miss yourself." — Jason O., Sr. Software Engineer

Try it

Deep Review is available now in Git AutoReview for VS Code.

Free plan: 10 reviews/day, includes Deep Review
Developer ($9.99/mo): 100 reviews/day, 10 repos
Team ($14.99/mo): Unlimited reviews, team features

All plans support Deep Review. You just need Claude Code CLI installed separately.

Install Git AutoReview →

Every finding requires your approval before it reaches your PR. AI suggests. You decide.

Tired of slow code reviews? AI catches issues in seconds. You decide what gets published.

Try it free5.0 on VS Code Marketplace

Frequently Asked Questions

What is Deep Review in Git AutoReview?

Deep Review is an agent mode that uses Claude Code CLI to analyze your entire codebase — not just the PR diff. It reads files, runs your linter, checks test coverage, traces data flow, and finds cross-file issues that diff-only review tools miss completely.

How long does a Deep Review take?

Between 5 and 25 minutes, depending on project size and PR complexity. The agent reads your full codebase, so larger projects take longer. You'll see a real-time activity log showing exactly what the agent is doing, and you can keep coding while it runs.

How is Deep Review different from CodeRabbit or Greptile?

CodeRabbit and Greptile run in the cloud and post comments automatically. Deep Review runs locally in your VS Code, gives you a real-time activity log, and requires your approval before publishing any comment. You stay in control.

Do I need a Claude Code subscription for Deep Review?

Yes. Deep Review uses Claude Code CLI, which requires a Claude Pro ($20/mo monthly, $17/mo annual) subscription from Anthropic. Quick Review mode (API-based) doesn't require this — it uses standard API keys.

Can I use Deep Review with GitHub, GitLab, and Bitbucket?

Yes, all three. Deep Review works with any Git platform supported by Git AutoReview — GitHub, GitLab (Cloud and Self-Managed), and Bitbucket.

What does Deep Review actually catch that regular AI review misses?

Cross-file dependency issues, hardcoded secrets in config files untouched by the PR, broken import paths after refactors, data flow vulnerabilities across modules, test coverage gaps, and architectural violations. The diff shows a clean change — the codebase tells a different story.

Is my code sent to external servers during Deep Review?

Deep Review runs locally in your VS Code using Claude Code CLI. Your code stays on your machine and is processed through Anthropic's API with their standard privacy policy. Nothing is stored on Git AutoReview servers.

Can I keep working while Deep Review runs?

Absolutely. Deep Review runs in the background. You'll see a real-time activity log in VS Code showing what the agent is reading and analyzing. When it finishes, you get a notification with findings. Keep coding, review when ready.

deep-reviewagentic-code-reviewclaude-codeai-code-reviewagent-modecross-file-bugscode-quality

Try it on your next PR

AI reviews your code for bugs, security issues, and logic errors. You approve what gets published.

5.0 on Marketplace2-min setupYour code stays with you (BYOK)

Install Free Extension See Pricing

Free: 10 AI reviews/day, 1 repo. No credit card.

Tutorials

Bitbucket Pull Request Automation: Complete Guide 2026

Bitbucket PR automation in 2026: Pipelines triggers, AI code review, merge checks, and how to cut review time by 60% without leaving VS Code. Works on Cloud and Data Center.

14 min read

AI Code Review

Code Review Checklist for AI-Generated Code: 12 Things to Verify

AI writes code faster than developers can review it. Here are 12 things to check in every AI-generated PR — from hallucinated packages to security gaps, logic errors, and test coverage.

10 min read

Tutorials

Self-Hosted Code Review: GitLab Self-Managed + VS Code 2026

Running GitLab self-managed? The built-in review is web-only, the VS Code extension is read-only, and most AI tools need your instance publicly accessible. Here's what actually works.

13 min read

Get the AI Code Review Checklist

25 PR bugs AI catches that humans miss — with real code examples. Free PDF, sent instantly.

One-click unsubscribe. We never share your email.

[Agent] Reading PR diff... 12 files changed, 847 lines [Agent] Opening src/services/AuthService.ts (imported by UserController) [Agent] Opening src/config/database.ts (referenced in AuthService) [Agent] Opening tests/auth.test.ts (test file for AuthService) [Agent] Running ESLint on 4 changed files... [Agent] Found: database.ts uses connection string from env without validation [Agent] Checking test coverage for new handleRefresh() method... [Agent] No tests found for handleRefresh — flagging as coverage gap [Agent] Generating findings... 4 issues found (1 High, 2 Medium, 1 Info)

Frequently Asked Questions

Try it on your next PR

Related Articles

Bitbucket Pull Request Automation: Complete Guide 2026

Code Review Checklist for AI-Generated Code: 12 Things to Verify

Self-Hosted Code Review: GitLab Self-Managed + VS Code 2026

Get the AI Code Review Checklist

Frequently Asked Questions

Try it on your next PR

Related Articles

Bitbucket Pull Request Automation: Complete Guide 2026

Code Review Checklist for AI-Generated Code: 12 Things to Verify

Self-Hosted Code Review: GitLab Self-Managed + VS Code 2026

Get the AI Code Review Checklist