APXY is a local network proxy and debugging tool that captures, inspects, mocks, replays, and diffs HTTP/HTTPS traffic. It is built for developers and AI coding agents with a CLI-first workflow and an optional Web UI.

How is APXY different from Charles Proxy or mitmproxy?

APXY is purpose-built for the modern developer workflow including AI coding agents. It provides structured output (TOON format) that AI agents can reason over, a CLI-first API for scripting and automation, and first-class support for CI/CD, Docker, and headless environments. Unlike Charles Proxy, APXY requires no GUI and unlike mitmproxy, it includes API mocking with scripting and a built-in Web UI.

Does APXY work with AI coding agents like Claude Code and Cursor?

Yes. APXY has dedicated integrations for Claude Code, Cursor, GitHub Copilot, and OpenAI Codex. It returns compact, structured network traffic so AI coding agents can reason over real HTTP requests and responses instead of screenshots or raw HAR files.

APXY has a free tier that lets you capture traffic and explore the Web UI with no account or credit card required. Paid one-time licenses (Pro at $59, Personal at $79) unlock unlimited traffic history, API diagnosis, scripting, breakpoints, network simulation, and multi-device access.

Does APXY work in CI/CD pipelines and Docker?

Yes. APXY runs headlessly and works in local development, CI pipelines, SSH sessions, Docker containers, and background scripts. The core CLI workflow stays the same across all environments.

What operating systems does APXY support?

APXY supports macOS, Linux, and Windows.

Insight

The Real Cost of Flaky API Tests

A test that sometimes passes and sometimes fails is not a test -- it is noise. Here is what flaky API tests actually cost your team, and what to do about it.

APXY TeamMarch 4, 20267 min read

Every engineering team has them: tests that fail occasionally for no obvious reason. Re-run the suite and they pass. Merge the PR. Move on.

The problem is that "re-run and it passes" is not a solution. It is a coping mechanism. And that coping mechanism has a real cost that compounds silently over time.

What makes an API test flaky?

Most flaky API tests share a common root cause: the test depends on something outside its control.

The most common culprits:

External API calls in tests. If your test calls a real third-party API, it can fail because the API is rate-limiting you, is experiencing an outage, or simply responded slowly enough to hit a timeout. None of these failures say anything about your code.

Shared test state. If two tests write to the same database or API resource, they interfere with each other depending on execution order. A test that passes in isolation fails when run in parallel.

Time-dependent behavior. Tests that check timestamps or polling intervals fail when the CI machine is slow. A test that waits 500ms for an event might fail when the CI runner is under load.

Network latency variance. The same HTTP request can take 80ms in one run and 3 seconds in another. If your test has a fixed timeout, it flaps.

The hidden costs

Developer trust erodes

When tests fail randomly, developers stop trusting them. The moment a team starts ignoring red builds because "it is probably a flaky test", the test suite has lost its purpose. A failing test that gets re-run instead of investigated is a test that no longer serves as a quality gate.

This erosion is gradual and invisible. You will not notice it until a real bug slips through a build that "looked fine after the re-run."

CI time compounds

A flaky test that requires an average of 1.3 runs to pass adds 30% to that test's CI cost. Across a suite of 200 tests with a 10% flakiness rate, that adds up to significant wasted compute time every day.

More painfully, it adds developer waiting time. Waiting 8 minutes for a re-run to confirm what you already suspected is 8 minutes of broken flow.

Debugging real failures becomes harder

When tests fail intermittently, teams train themselves to look for "the real failure"—the one that is consistent. Intermittent failures become background noise.

The problem is that some "intermittent" failures are not random at all. They are failures that reproduce under specific conditions—high load, specific data, a race condition. By treating them as noise, you lose the signal they were trying to give you.

The fix: replace external dependencies with mocks

The most durable solution to flaky API tests is the simplest one: do not call real APIs in your tests.

If your test calls the Stripe API to verify payment intent creation, replace that call with a mock that returns a deterministic response. Your test now checks whether your code constructs the right request and handles the expected response correctly—which is what it should have been testing all along.

# Import a Stripe mock rule set
curl -fsSL https://raw.githubusercontent.com/apxydev/apxy/main/mock-templates/stripe/rules.json \
  -o stripe-rules.json
apxy rules import stripe-rules.json

Run your test suite with APXY intercepting all API traffic. Stripe calls return your mocked responses. The tests run in milliseconds, offline, with identical results every time.

But won't mocks give false confidence?

This is the most common objection, and it is worth addressing directly.

Yes, a mock is not a real API. If the real API changes its response shape, your mocked tests will not catch it.

But this is a false tradeoff. The choice is not between "tests that call real APIs" and "tests that are wrong." The choice is between:

Unit tests with mocks: fast, deterministic, verify logic in isolation
Integration tests with real APIs: slower, flaky, verify the real integration

You need both. Mocked unit tests catch most bugs. Integration tests (run less frequently, perhaps only on release branches) catch API contract changes.

The mistake most teams make is using integration-style tests in every PR build. That is what creates flakiness. Move the real-API tests to a scheduled job or a pre-release gate where flakiness is tolerable. Use mocks in the fast path.

What a healthy test setup looks like

| Test layer | What it tests | API approach | When it runs | |---|---|---|---| | Unit tests | Business logic | Mocked with APXY | Every commit | | Integration tests | API contracts | Real or sandbox | Daily / pre-release | | E2E tests | Full user flows | Staging environment | Pre-deploy |

The unit and integration layers are distinct. Each has a clear job. Flakiness is isolated to the integration layer where it is tolerable.

The compounding benefit

When your unit tests are fast and reliable, developers run them constantly. They catch bugs earlier, in a tighter feedback loop, before they compound into bigger issues. The cost of finding a bug in the first five minutes of a PR is a fraction of the cost of finding it in QA, staging, or production.

Reliable tests change developer behavior. Developers who trust their test suite write more tests. Teams with more tests ship with more confidence. The investment pays for itself.

If your CI has a known flaky API test, that is not a minor inconvenience. It is a leak in your quality system. Fix it once with a mock, and get the compound return of a test suite your team actually trusts.

See What is API Mocking? A Developer's Guide for a practical introduction to mocking, and How to Mock a REST API in 5 Minutes with APXY to get a mock running today.

api-testingci-cdreliabilityinsightsdeveloper-productivity

Debug your APIs with APXY

Capture, inspect, mock, and replay HTTP/HTTPS traffic. Free to install.

Install Free

Insight

Why Your AI Coding Agent Needs Network Visibility

AI coding agents are excellent at reading code. They cannot see the network. That gap is where most agent-assisted debugging sessions get stuck. Here is how to close it.

Insight

Why Local-First API Tools Are Winning

A wave of developers is moving away from cloud-hosted API tools. Pricing changes, data sovereignty concerns, and the rise of CLI-native workflows are driving a shift toward tools that live on your machine and sync through Git.

The Real Cost of Flaky API Tests

What makes an API test flaky?

The hidden costs

Developer trust erodes

CI time compounds

Debugging real failures becomes harder

The fix: replace external dependencies with mocks

But won't mocks give false confidence?

What a healthy test setup looks like

The compounding benefit

Related articles

Debug and mock every API call in one local tool