Fixing Legacy: What Should I Blow Up First?

Another good question came over the wires at work. My reply grew too long and I figured more people would want to see it. Besides, this way I can blog and call it a legitimate business activity.

Problem statement: what patterns and strategies work for choosing when and what to refactor? Does this change at scale?

Who Should I Listen To

The people I listen to most for this sort of thing are James ShoreÂ (who needs more names or fewer good articles to link to),Â Joshua Kerievsky,Â Michael Feathers,Â Corey Haines, and J.B. Rainsberger. They share 2 key traits:

They think deeply about reflective engineering and the systems and humans that do it.
They are active coders who work with real products (in all their legacy glory) every day.

I like to think that describes me too. So here’s my whack at it.

What Matters?

I see a couple of aspects to this (listed in descending order of importance):

Value: In refactoring legacy code (and, honestly, almost no one is working with anything but legacy code), the goal is to do the most valuable changes first. We might not ever refactor everything. So we need to get the high-value stuff first. But how do we find that?
Risk: Refactoring, like any code change, has the chance to introduce bugs. Not all bugs are equal. Some areas of the code are more difficult to observe, so bugs take longer to discover. Some are more critical scenarios, so bugs have more impact. We need a good way to take risk into account when choosing what to refactor.
Cost: Some code is just more expensive to change. It has more dependencies, more duplication (or near-duplication), more special cases, has seen less refactoring, is less documented, uses more arcane technologies, or is just written in a language with less tool support. All else being equal (it usually isnâ€™t), we can get better ROI if we work on the easy stuff. In any case, refactoring is a skill. It takes time to learn. So it makes sense to start with some easy problems and to mix in harder problems as skill improves.
Authority: Depending on the team, there may be some code you arenâ€™t allowed to change. Itâ€™s always nice when those boundaries donâ€™t exist (you donâ€™t end up with refactorings that extend just to the point of some boundary), but some of them are necessary for other reasons. This is lowest in importance because often the best answer is to figure out how to expand the team’s authority without losing the “other reasons.”

Now for some approaches that Iâ€™ve seen work and heard about working for others.

Leave each campsite a little cleaner

Features tend to cluster in most products. And new features tend to cluster with themselves and away from old features. The fact that you are working with a piece of code right now indicates a higher than average probability that you will work with it again soon. Therefore, focus your refactoring efforts on code that you touch while introducing features.

Some strategiesÂ of this kindÂ include:

Add a refactoring budget to each story. For example, â€œfor every hour that you spend working on a feature, you are required to then spend an hour refactoring some part of the code that you touched while working on that feature.â€
Pair programming. Often the devs see the messes as they are working. When they are solo, they work around the problem, intending to come back later. And when they finish, they just want to get this code checked in and move the task. Pair partners really help people hold themselves accountable. They may decide to or not to do a particular refactoring, but the decision will be explicit.

We’re on a mission from god

It can be very helpful toÂ have the whole team share a current mission. This helps motivation, allows clear reporting of progress, and allows you to take on one systemic issue at a time. It often makes sense to start with a simple, mechanical mission, and then work to more invasive ones as skill and confidence improves.