OneSafe Logo

Case Study — OneSafe

How OneSafe Cut Release QA From Four Days to One

A conversation between
Sneha Sivakumar
Sneha Sivakumar
CEO of Spur
Denise Anne Gamboa
Denise Anne Gamboa
Product & Project Manager, OneSafe

COMPANY

OneSafe is a regulated neobank that gives businesses and individuals an account for moving and managing money, including digital assets.

INDUSTRY

Fintech ($200 Million)

COMPANY SIZE

1–50

FOUNDED

2021

75%

First stats for One Safe

Less QA time per release

down from four days to one.

80%

Second stats for One Safe

Bug catch rate,

up from roughly 60% manually.

95%

Third stats for One Safe

Release confidence,

up from under 80% on a finance platform.

The Challenge

Manual QA on a fintech platform with one person and a day to do it.

Denise came into QA from customer service and operations, no testing background, no fintech experience. She built the process herself, working from hundreds of test cases in a Notion spreadsheet, running every flow by hand.

The problem wasn't effort. It was math. Across seven or eight payment methods, a manual pass might cover two or three. Flows got walked halfway rather than completed end to end. With a weekly code freeze and roughly a day for QA, there wasn't time for more.

When bugs surfaced days apart, releases slipped to Tuesday or Wednesday. Changes made after code freeze were sometimes missed entirely. And withdrawals, the most critical flow on the platform, where a single bug was effectively a system-down event, still left Denise less than 80% confident the platform was clean.

"A withdrawal bug was effectively a system-down event."
Automated QA for payment methods preventing system-down events

The Solution

47 suites, 124 tests, 2,500 runs a month, all written in plain English.

OneSafe now runs automated regression and legacy feature testing through Spur, about 2,500 test runs a month across 47 suites and 124 tests. Instead of scripting each case individually, Denise connects tests to scenario tables: a set of withdrawal scenarios written once, then reused and multiplied across new tests automatically.

Spur runs on staging before release and again on production afterward, executing every flow end to end, including the payment methods and full withdrawal completions that manual testing consistently skipped.

"I just let Spur run by itself. It is its own employee."
Staging of multiple QA test scenarios by Spur

For writing new suites, Denise uses Spur's MCP integration with Claude. New tests that used to take one to two days to write now take one to two hours. Platform-wide edits that would once mean touching every affected test by hand take seconds.

Spur also works as a change log. When developers ship an unexpected UI or copy change, affected tests fail, surfacing changes that engineering hadn't flagged and bridging the gap between dev and ops.

The Results

Four days of QA became one and confidence crossed 95%.

The shift was immediate and significant. QA time per release fell from four days to one, sometimes two. Where a manual pass might catch five bugs over three days, Spur surfaces a comparable set in an hour or two. Bug catch rate climbed from roughly 60% to about 80%. Release confidence went from under 80% to roughly 95%.

But the bigger change was what Denise could do with the time she got back. Automating regression freed her to focus on the product itself and on managing her operations team, work that actually requires her judgment, rather than spending days clicking through payment flows.

"It is definitely one of the most useful things we have had, not just for QA but for our company in general. I would just suggest other fintech teams try it out. It would give you more security that your actual money and your actual processes and flows are being covered very comprehensively."

Results 1 OneSafe

2,500

test runs per month

Results 2 OneSafe

47

automated suites running on staging and production

Results 3 OneSafe

1–2h

to write new test suites with Claude

Critical e-commerce flows across 30+ regions
Every regional price, discount rule, and product variant automatically tested before your sale goes live, no manual spot-checking required.
Hundreds of partner landing pages
Ensuring that every audience coming from podcasts, newsletters, and other partnerships lands on a page that is on brand and error free.
Staging and production environments
Running tests in staging for high confidence before launch, then validating again on production as a final safety net.

Key Insights

OneSafe didn't hire a QA team. They gave one person, with no QA background, the tools to build one. Automated regression across every payment method. Tests that multiply themselves through scenario tables. A change log that catches what engineering forgets to mention. Four days of manual clicking became one. And Denise went from running flows to running the operation.

CUSTOMER STORIES

More teams, same results.

OneSafe Customer Card
QA time per release dropped from four days to one
How OneSafe Cut Release QA From Four Days to One
Our Place Case Study
Launch-day QA went from three or four people running manual regression to one engineer
How Our Place Automated 80% of Its QA Coverage and Freed Its Team’s Time Up
UncommonGoods cut QA time in half with AI-driven testing
How Uncommon Goods stopped spending 50% of their QA time on Selenium
From Manual QA Bottlenecks to Fast, Reliable Releases with Spur
How Wondr Health enabled an entire team to work on more interesting problems
Scaling shoppable UGC QA across dozens of brands by adding a single URL to a shared Spur scenario table
How Hue QAs shoppable widgets across 20+ merchants without rebuilding anything.
Regression Done by Noon, Every Release
The regression marathon that ran 9am to midnight, every two weeks
2× Faster Deployments, Zero Manual Testing
August deploys every six hours. 25% of those releases had to be rolled back
20x Increase in Release Velocity
Testing Wander with traditional tools was impossible