Our Place Logo

Case Study — Our Place

How Our Place Automated 80% of Its QA Coverage and Freed Its Team’s Time Up

A conversation between
Sneha Sivakumar
Sneha Sivakumar
CEO of Spur
Sebastian Vilanueva
Sebastian Villanueva
QA Engineer, OurPlace
Our Place Case Study

COMPANY

OurPlace is a direct-to-consumer cookware brand running five international storefronts on Shopify.

INDUSTRY

E-Commerce ($20 Million)

COMPANY SIZE

51–200

FOUNDED

2019

80%

First Stat

Automated test coverage

reached across storefronts.

5

Second Stat

International storefronts

covered by automated smoke tests.

1

Third Stat

Engineer now runs QA

for all five storefronts.

The Challenge

Launches used to pull in the whole team and that was a problem.

When OurPlace's QA engineer joined two years ago, there was no testing process at all. He built one from scratch. But even with a structured workflow in place, launches remained a heavy lift, a multi-hour call, three or four people, every storefront, every device. Developers and product managers doing work that had nothing to do with their jobs.

"I kind of had the feeling that QA was a burden for other people that had different technical roles  and that shouldn't be their responsibility."

Automation was the obvious fix. But the obvious fix had already failed. Open-source frameworks like Playwright couldn't survive Shopify's pace of change: new products, sales events, app updates, admin settings touched by teams outside engineering. Scripts would break constantly, and maintenance became a job in itself.

The only automation that stuck was API-level price checking. Which meant they could verify numbers in a database, but not pricing badges, banners, or anything a shopper actually sees on the page.

Open-source frameworks like Playwright couldn't survive Shopify's pace of change

The Solution

The tool that worked from minute one.

OurPlace ran a proper evaluation, Rainforest, Lambda Test's Kai, and Spur, measured against test reliability, ease of creation, and the pricing use case that mattered most.

Kai struggled. A basic end-to-end test took over 30 minutes. Marketing popups broke runs outright. Instructions had to be extremely verbose just to get the agent to behave.

Spur was different from the start.

"From minute one we had the feeling that the tools and the agents Spur was leveraging were more advanced. With the demos, you were just writing test steps live and they were actually working."

His workflow has since evolved into something closer to orchestration than scripting. He feeds his existing manual test cases through the Spur MCP, using Claude or ChatGPT to translate them into Spur test steps, reviews each proposal before it's pushed, triggers focused runs, and has the agent pull and triage the results. A custom rules file tells Spur how he likes to work.

the tools and the agents Spur was leveraging were more advanced
"I am not writing Spur test cases manually anymore. I do it all through the MCP."

Smoke tests now run across all five storefronts and a full regression suite covers the US market.

The Results

Launches changed and the team got their time back.

OurPlace reached 80% automated test coverage, the threshold the team defined as enough to release with confidence. But the bigger shift was what launch day actually felt like.

"During these launch calls, I am able to focus more on exploratory testing and just delegate the regular regression and smoke testing to Spur. I know that in the background, Spur is doing the heavy lift."

The repetitive regression checks happen automatically. Developers and PMs are off the hook. The QA engineer is free for the visual and content bugs that only human judgment catches, the kind of testing he was hired to do.

Spur's support model helped too. Issues he reported were resolved with fixes in place the next day through a direct Slack line.

"It feels like more casual support."
Spur exploratory testing

Asked what it would be like to go back, his answer was direct:

"I wouldn't feel comfortable, because that would mean I would need to spend a lot of time running repetitive tasks again. With Spur, we do feel more comfortable."
Result 1

80%

Automated coverage

Result 2

5

Storefronts automated

Result 3

1

QA engineer, whole operation

Critical e-commerce flows across 30+ regions
Every regional price, discount rule, and product variant automatically tested before your sale goes live, no manual spot-checking required.
Hundreds of partner landing pages
Ensuring that every audience coming from podcasts, newsletters, and other partnerships lands on a page that is on brand and error free.
Staging and production environments
Running tests in staging for high confidence before launch, then validating again on production as a final safety net.

Key Insights

OurPlace didn't just automate tests. They automated the entire launch ritual, the multi-hour calls, the pulled-in developers, the repetitive regression that ate up the one person actually responsible for quality. 80% coverage across five storefronts. One engineer. No scripts to maintain. That's what happens when QA stops being everyone's problem and starts being a system.

CUSTOMER STORIES

More teams, same results.

OneSafe Customer Card
QA time per release dropped from four days to one
How OneSafe Cut Release QA From Four Days to One
Our Place Case Study
Launch-day QA went from three or four people running manual regression to one engineer
How Our Place Automated 80% of Its QA Coverage and Freed Its Team’s Time Up
UncommonGoods cut QA time in half with AI-driven testing
How Uncommon Goods stopped spending 50% of their QA time on Selenium
From Manual QA Bottlenecks to Fast, Reliable Releases with Spur
How Wondr Health enabled an entire team to work on more interesting problems
Scaling shoppable UGC QA across dozens of brands by adding a single URL to a shared Spur scenario table
How Hue QAs shoppable widgets across 20+ merchants without rebuilding anything.
Regression Done by Noon, Every Release
The regression marathon that ran 9am to midnight, every two weeks
2× Faster Deployments, Zero Manual Testing
August deploys every six hours. 25% of those releases had to be rolled back
20x Increase in Release Velocity
Testing Wander with traditional tools was impossible