8.8 - Integration Tests

Episode Transcript

Welcome to the Deep Dive, where we take your sources and extract the most important insights.

Today we're embarking on a deep dive into a really critical layer of software quality assurance integration tests.

Our guide for this is Software Engineering a Modern Approach by Marco Tulio Valente.

We're focusing specifically on Section 8.8, which is all about this topic.

Our mission really is to unpack what these tests actually are, why there's such an essential bridge between, you know, individual code units and a fully working system, and what value they bring to developers and, well, to you as the end user.

Now, we've often discussed unit tests on the show, those microscopic checks on single code components, making sure each little piece works perfectly in isolation, like checking just one gear in a clockwork mechanism, maybe.

Exactly.

But what happens when that gear needs to mesh with the next one in the spring and the whole assembly?

What happens when these perfectly working individual parts need to interact, especially with things outside the code like databases or external systems?

Right.

And that's precisely where integration tests, or sometimes people call them service tests, come into play.

They step onto the stage here, this intermediate level of testing.

It's just vital for getting from those isolated parts to a cohesive working hole.

Yeah, if you think about the the bigger picture of software testing, integration tests really do fill that crucial middle ground.

Unit tests, like you said, they're focused on the smallest bits, maybe a single class, a single function, totally isolated.

Integration tests, they bump up the scope.

Their main goal is to exercise a whole service or maybe a significant feature of the system.

So more complex.

Definitely.

It means they involve multiple classes.

Sometimes these classes are even from different packages, you know different modules or parts of the application.

But the real defining characteristic, and where the complexity often lies, is their interaction with real dependencies.

Real ones.

Not fake exactly.

We're talking actual databases, live remote services it needs to call, external API's, message queues, that kind of thing.

Unlike unit tests which often use mocks or simulated versions of these things.

Right.

To keep them isolated precisely, integration tests insist on using the real deal.

They want to see how your code actually behaves when it's talking to the real database it'll use in production or the actual third party service it relies on.

That feels like a really important distinction.

So for the developers building these systems and I guess for us using them too, what does that mean practically?

How does it change their testing routine?

Oh it.

Absolutely changes things.

The very nature of these tests means a different scale and definitely a different frequency because they involve more moving parts, real external systems, maybe network calls, they're just inherently more complex and consequently they take significantly more time to run compared to those super fast unit tests.

Right.

Unit tests can run in milliseconds sometimes.

Exactly.

Think about the difference between, say, flicking a single light switch versus firing up the entire electrical system of a building check and the lights, the AC, the elevators, everything working together.

It just takes longer.

OK, so if they're slower I'm guessing they don't run on them like after every tiny code change.

You're spot on.

Developers might run their unit tests constantly, maybe hundreds of times a day.

Integration tests much less frequently, maybe a few times a day, or perhaps as part of an automated build process like a continuous integration pipeline before things get deployed.

So there's a trade off speed versus depth.

That's a great way to put it.

Speed versus depth of validation.

There's slower, yes, but the kind of validation they provide is invaluable.

They catch a whole category of bugs that unit tests just physically cannot find.

Like what kind of bugs?

Well, think about things like misconfigurations between different services, or maybe the structure.

The schema of the database doesn't quite match what the code expects or network issues causing timeouts or subtle problems where your code isn't quite using an external services API correctly.

These are the kinds of problems that often only show up when all the pieces are actually trying to talk to each other for real and finding those late, like in production, that can be really painful and expensive to fix.

Yeah, I can imagine.

OK, this abstract idea starts to make more sense now, and I think the source helps here, right?

It gives a concrete example.

It does, yeah.

A really good one actually.

It uses AQ and a form application.

Like a mini stack overflow.

Pretty much, yeah.

A simplified version.

And at the core of this app there's a class they call Forum.

Think of it as the main controller.

It handles the basic stuff, adding a new question listing all the questions that have been asked.

OK, simple enough.

Right so picture this forum class.

It has methods like add question and list all questions.

An integration test for this system is designed to actually use these methods in a realistic set, meaning it interacts directly with the real database where the questions are stored.

I see and the source material shows how they set up a specific integration test class to do this.

Before each test runs, there's this really important setup step.

It makes sure every test starts with a clean slate.

First it truncates the database tables.

Basically it empties them out completely.

Wipes and clean.

Then it creates a new instance of the form class and gives it a live, real connection to that now empty database.

And just as importantly, after each test finishes, whether it passed or failed, a cleanup step runs automatically to close that database connection.

Why is that setup and tear down so critical?

It's all about consistency and reliability.

You want each test to be independent.

You don't want data leftover from a previous test run to mess up the results of the current one.

That clean slate is key.

Makes sense.

So within this Q&A form example, what specific things are the tests actually checking?

What scenarios?

The example highlights 2 really clear scenarios.

The first one is, well, pretty fundamental.

It just checks that if the database is empty right after that cleaning step, calling list all questions actually returns an empty list.

It confirms the initial state is correct.

Simple baseline checkout exactly.

The second test is more involved.

It simulates more typical user action.

It uses the add question method to add 3 different questions to the forum.

Then it calls list all questions to get them back.

But it doesn't just check that it got three questions back, it goes deeper.

It actually verifies the content.

The text of each question retrieved matches exactly what was put in.

So it checks the whole round trip, adding, saving, retrieving and checking the data integrity.

Precisely.

It demonstrates that complete life cycle, adding data, making sure it's persisted correctly in the database and then successfully getting it back out unchanged.

It really proved the components are working together properly.

Looking at that example then what really jumps out at you as the most important take away about integration tests?

For me, there are two fascinating things here.

First, the tests themselves are written using J unit.

Which people might know from unit testing.

Exactly, which shows how versatile A framework like J Unit is.

It's not just for tiny unit tests, it can handle these more complex integration scenarios too.

2nd, and this is probably the most crucial point, it's a true integration test because it's testing that forum class with its actual dependency, the.

Real database.

The real database, not some simulated stand in.

That's absolutely critical for finding those real world interaction problems we talked about.

And that process of resetting the database before each test, that's vital to it ensures every test starts from the same known predictable state.

It stops tests becoming flaky or unreliable because of leftover data.

So essentially this test really puts the core form functions through their paces, checking how the different parts work together right down to the database layer.

But notice it deliberately doesn't worry about the user interface, how things look on screen.

The focus is squarely on that back end logic and the data interaction.

That's kind of the sweet spot for integration tests.

That sounds really thorough, but you know, with that kind of real world interaction, it sounds like it could also be tricky.

What are some common headaches developers run into with these tests?

Oh.

Absolutely.

They're powerful, but they definitely come with challenges.

Test flakiness is probably the biggest one.

Flakiness, meaning they sometimes fail for no obvious reason.

Exactly because they're talking to real databases, real external services over real networks.

They can fail because of temporary network glitches, or a database being a bit slow, or an external service having a brief outage.

The test fails, but there's actually no bug in the code being tested.

And that flakiness can really damage developers confidence in the tests.

If they fail randomly, people start ignoring them.

Yeah, that's not good.

What else?

Managing the test data can be a significant hurdle too.

We saw the example just wipe the tables clean, which is great for simple cases.

But imagine a really complex system with lots of interconnected data.

Resetting everything might be too slow, or you might need very specific data setups for certain tests.

So developers often have to spend quite a bit of effort carefully creating the right test data before a test and then cleaning it up afterwards, making sure tests don't interfere with each other sounds.

Like a lot of overhead.

It can be, and there's also the operational cost.

You might need separate test databases, maybe special accounts for external services just for testing, or ways to simulate parts you can't easily control.

Cools like test containers have become popular to help with this.

They let you spin up things like a database inside a docker container just for the test run, give you an isolated disposable environment.

That's clever.

Yeah, it helps manage some of that complexity, but ultimately it's always a balancing act.

You want that deep validation integration tests give you, but you need to keep them reasonably fast and crucially reliable and maintainable.

They need careful design and ongoing attention.

So it requires a thoughtful.

Approach, definitely.

They're a powerful tool, but not one you just slap together.

You need to think about what you're trying to achieve and how to do it reliably.

OK, well, that really clarifies their role.

So that wraps up our deep dive into integration tests for today.

We've seen how they act as this vital middle layer in testing, making sure whole features or services work correctly with their actual real dependencies like databases or other services.

Bridging that gap.

Exactly.

They're more complex, they take longer to run, so they're used less often than unit tests, but that deep validation they provide gives really valuable confidence that the whole system hangs together properly.

Confidence is the keyword there I think.

It really is confidence that the software you rely on everyday has had these crucial connections rigorously checked.

Thank you for joining us on this deep dive.

8.8 - Integration Tests

Episode Transcript

Never lose your place, on any device