Building an Awesome List: What Top Engineering Teams Actual…

By Vaibhav Verma

The gap between shipping code and shipping great code isn't talent. It's systems.

Good engineering teams have good people. Great engineering teams have good people + systems that make excellence the default.

A great engineer in a bad system ships slower than a mediocre engineer in a great system. The system wins.

Here's what the systems are.

Decision 1: Enforce Code Standards Through Tools, Not Through People

Good teams have code reviews. Great teams have linters, type checkers, and formatters that prevent entire categories of issues before review even happens.

SonarQube. ESLint. TypeScript. Mypy. These aren't nice-to-haves. They're force multipliers.

When the tool enforces the standard, junior engineers don't get yelled at in code review. They get a clear, actionable error. "Type mismatch: expected string, got number." That's a teaching moment that doesn't require a human.

This compounds. After a year of using TypeScript, new engineers don't make type mistakes anymore. It's not because they're smarter. It's because the tool trained them.

Decision 2: Make Testing Easy, Not Optional

Great teams have test infrastructure that makes testing easier than not testing.

One command runs all tests. Tests run in seconds, not minutes. When a test fails, the error message is clear. You can run tests in parallel. Coverage is tracked and enforced.

Bad teams have test infrastructure that's annoying to use. Running tests takes 5 minutes. Coverage tracking is manual. Everyone knows tests are important. Nobody wants to write them because it's painful.

Good teams invest 2-3 weeks upfront in test infrastructure. Then testing is easy. Coverage naturally goes up.

Decision 3: Automate Everything Repetitive

Deployments. Environment setup. Database migrations. Linting. Building. Everything that doesn't require human judgment should be automated.

Great teams have deployment pipelines where one commit = one production deployment (or close to it). You don't deploy. The system deploys.

The first time you set this up, it's slow. The hundredth time, it's instant. And it's the same every time, which means you find issues early.

Decision 4: Make Knowledge Codified, Not Siloed

Knowledge that lives in one person's head is fragile. When they leave, it leaves with them.

Great teams codify knowledge:

Decision records. Every major decision is written down. "We chose microservices because X. We considered Y. This is why X wins." Years later, when someone asks "why did we do this?", you have the answer.

Architecture documentation. Not pretty pictures. Living documents that describe how the system actually works. Updated when systems change.

Runbooks. How do you deploy? How do you scale? How do you debug in production? Written down. Not tribal knowledge.

Code comments. Not "this variable is x." Explain non-obvious decisions. "We cache this because repeated queries are slow. The cache invalidates after 1 hour because data needs to be fresh."

Decision 5: Make It Safe to Refactor

Teams that fear refactoring get stuck with bad code. Bad code slows everything down.

Great teams have test coverage, type checking, and CI/CD that makes refactoring safe. You refactor. Tests fail or succeed. You know immediately if you broke something.

Cultures matter too. "We should refactor this before adding the feature" is encouraged, not discouraged. Technical excellence is valued.

Decision 6: Distribute Knowledge Through Rotation

You have an expert in database systems. Great. But what happens when they take a vacation? What happens when they leave?

Great teams rotate responsibilities. Expert owns X for 6 months. They become the go-to person. After 6 months, rotate. Someone else becomes the expert. The original expert helps the new expert.

After a year, multiple people know every system. Bus factor goes up. Knowledge spreads.

Decision 7: Make Incidents Learning Opportunities

Great teams have blameless post-mortems. When something breaks:

What happened?
Why did it happen?
Why didn't we catch it earlier?
What systems will prevent this in the future?

You're not blaming people. You're improving systems. "Alice deployed bad code" is a people problem. "Our CI/CD didn't catch the bug" is a system problem. Fix the system.

Decision 8: Optimize Onboarding Aggressively

New hire starts. Great teams have:

Setup script that works in one command
First PR in day 1 (something small and safe)
Week 1: understands the architecture
Week 2: can deploy something
Week 4: shipping features independently

Bad teams: Week 8 and they're still figuring out how to run the code locally.

The difference is systems. Great teams invest in onboarding infrastructure. Setup script. Documentation. Pair programming. Clearly defined learning path.

Decision 9: Make Observability the Default

Great teams can answer: "What's broken right now?" in 30 seconds.

Structured logging. Metrics on every layer. Distributed tracing. Dashboards that show the health of the system.

All of this is infrastructure. When it's there, responding to incidents is fast and data-driven. When it's not, it's firefighting.

Decision 10: Make the Boring Part Invisible

Boring but necessary things: dependency updates, security patches, infrastructure upgrades.

Great teams automate them. Dependabot updates dependencies. Security scanners run automatically. Infrastructure-as-code means upgrades are tested before deployment.

The result: nobody spends time on boring work. Systems take care of it.

Getting Started

You don't build all of this at once. Pick your biggest pain point.

Is deployment slow? Fix that first. It's high leverage. Faster deployments mean faster iteration means better decisions.

Is onboarding slow? Fix that. Every new person becomes productive faster.

Is code quality bad? Add linting and type checking. That's quick and impactful.

Is testing painful? Fix test infrastructure. Once testing is easy, engineers do it.

Pick one. Make it great. Move to the next.

Over 2-3 years, you build a system where excellence is the default. Not because you hired amazing people (though you should). But because the systems make it easy.

Frequently Asked Questions

Q: Can we do all this at once?

No. Pick one, execute, measure. Show success. Do the next one.

Q: What if the team resists investing time in systems?

Show them the benefit. "If we spend 2 weeks improving test infrastructure, we'll spend 20% less time debugging. That's 100 hours saved per year."

Q: Which system should we invest in first?

Whatever is slowing you down most. Slow deploys? Fix that. Slow tests? Fix that. Difficult onboarding? Fix that. High incident rate? Improve observability.

By Vaibhav Verma

The gap between shipping code and shipping great code isn't talent. It's systems.

Good engineering teams have good people. Great engineering teams have good people + systems that make excellence the default.

A great engineer in a bad system ships slower than a mediocre engineer in a great system. The system wins.

Here's what the systems are.

Decision 1: Enforce Code Standards Through Tools, Not Through People

Good teams have code reviews. Great teams have linters, type checkers, and formatters that prevent entire categories of issues before review even happens.

SonarQube. ESLint. TypeScript. Mypy. These aren't nice-to-haves. They're force multipliers.

This compounds. After a year of using TypeScript, new engineers don't make type mistakes anymore. It's not because they're smarter. It's because the tool trained them.

Decision 2: Make Testing Easy, Not Optional

Great teams have test infrastructure that makes testing easier than not testing.

One command runs all tests. Tests run in seconds, not minutes. When a test fails, the error message is clear. You can run tests in parallel. Coverage is tracked and enforced.

Good teams invest 2-3 weeks upfront in test infrastructure. Then testing is easy. Coverage naturally goes up.

Decision 3: Automate Everything Repetitive

Deployments. Environment setup. Database migrations. Linting. Building. Everything that doesn't require human judgment should be automated.

Great teams have deployment pipelines where one commit = one production deployment (or close to it). You don't deploy. The system deploys.

The first time you set this up, it's slow. The hundredth time, it's instant. And it's the same every time, which means you find issues early.

Decision 4: Make Knowledge Codified, Not Siloed

Knowledge that lives in one person's head is fragile. When they leave, it leaves with them.

Great teams codify knowledge:

Architecture documentation. Not pretty pictures. Living documents that describe how the system actually works. Updated when systems change.

Runbooks. How do you deploy? How do you scale? How do you debug in production? Written down. Not tribal knowledge.

Code comments. Not "this variable is x." Explain non-obvious decisions. "We cache this because repeated queries are slow. The cache invalidates after 1 hour because data needs to be fresh."

Decision 5: Make It Safe to Refactor

Teams that fear refactoring get stuck with bad code. Bad code slows everything down.

Great teams have test coverage, type checking, and CI/CD that makes refactoring safe. You refactor. Tests fail or succeed. You know immediately if you broke something.

Cultures matter too. "We should refactor this before adding the feature" is encouraged, not discouraged. Technical excellence is valued.

Decision 6: Distribute Knowledge Through Rotation

You have an expert in database systems. Great. But what happens when they take a vacation? What happens when they leave?

Great teams rotate responsibilities. Expert owns X for 6 months. They become the go-to person. After 6 months, rotate. Someone else becomes the expert. The original expert helps the new expert.

After a year, multiple people know every system. Bus factor goes up. Knowledge spreads.

Decision 7: Make Incidents Learning Opportunities

Great teams have blameless post-mortems. When something breaks:

What happened?
Why did it happen?
Why didn't we catch it earlier?
What systems will prevent this in the future?

You're not blaming people. You're improving systems. "Alice deployed bad code" is a people problem. "Our CI/CD didn't catch the bug" is a system problem. Fix the system.

Decision 8: Optimize Onboarding Aggressively

New hire starts. Great teams have:

Setup script that works in one command
First PR in day 1 (something small and safe)
Week 1: understands the architecture
Week 2: can deploy something
Week 4: shipping features independently

Bad teams: Week 8 and they're still figuring out how to run the code locally.

The difference is systems. Great teams invest in onboarding infrastructure. Setup script. Documentation. Pair programming. Clearly defined learning path.

Decision 9: Make Observability the Default

Great teams can answer: "What's broken right now?" in 30 seconds.

Structured logging. Metrics on every layer. Distributed tracing. Dashboards that show the health of the system.

All of this is infrastructure. When it's there, responding to incidents is fast and data-driven. When it's not, it's firefighting.

Decision 10: Make the Boring Part Invisible

Boring but necessary things: dependency updates, security patches, infrastructure upgrades.

Great teams automate them. Dependabot updates dependencies. Security scanners run automatically. Infrastructure-as-code means upgrades are tested before deployment.

The result: nobody spends time on boring work. Systems take care of it.

Getting Started

You don't build all of this at once. Pick your biggest pain point.

Is deployment slow? Fix that first. It's high leverage. Faster deployments mean faster iteration means better decisions.

Is onboarding slow? Fix that. Every new person becomes productive faster.

Is code quality bad? Add linting and type checking. That's quick and impactful.

Is testing painful? Fix test infrastructure. Once testing is easy, engineers do it.

Pick one. Make it great. Move to the next.

Over 2-3 years, you build a system where excellence is the default. Not because you hired amazing people (though you should). But because the systems make it easy.

Frequently Asked Questions

Q: Can we do all this at once?

No. Pick one, execute, measure. Show success. Do the next one.

Q: What if the team resists investing time in systems?

Show them the benefit. "If we spend 2 weeks improving test infrastructure, we'll spend 20% less time debugging. That's 100 hours saved per year."

Q: Which system should we invest in first?

Whatever is slowing you down most. Slow deploys? Fix that. Slow tests? Fix that. Difficult onboarding? Fix that. High incident rate? Improve observability.

Building an Awesome List: What Top Engineering Teams Actually Do Differently

Decision 1: Enforce Code Standards Through Tools, Not Through People

Decision 2: Make Testing Easy, Not Optional

Decision 3: Automate Everything Repetitive

Decision 4: Make Knowledge Codified, Not Siloed

Decision 5: Make It Safe to Refactor

Decision 6: Distribute Knowledge Through Rotation

Decision 7: Make Incidents Learning Opportunities

Decision 8: Optimize Onboarding Aggressively

Decision 9: Make Observability the Default

Decision 10: Make the Boring Part Invisible

Getting Started

Frequently Asked Questions

More articles

Cursor and Copilot Don't Reduce Technical Debt — Here's What Does

GitHub Copilot Doesn't Know What Your Codebase Does — That's the Problem

AI Coding Tools Are Creating Technical Debt 4x Faster Than Humans

Your product has answers. You just can't see them yet.

Building an Awesome List: What Top Engineering Teams Actually Do Differently

Decision 1: Enforce Code Standards Through Tools, Not Through People

Decision 2: Make Testing Easy, Not Optional

Decision 3: Automate Everything Repetitive

Decision 4: Make Knowledge Codified, Not Siloed

Decision 5: Make It Safe to Refactor

Decision 6: Distribute Knowledge Through Rotation

Decision 7: Make Incidents Learning Opportunities

Decision 8: Optimize Onboarding Aggressively

Decision 9: Make Observability the Default

Decision 10: Make the Boring Part Invisible

Getting Started

Frequently Asked Questions

More articles

Cursor and Copilot Don't Reduce Technical Debt — Here's What Does

GitHub Copilot Doesn't Know What Your Codebase Does — That's the Problem

AI Coding Tools Are Creating Technical Debt 4x Faster Than Humans

Your product has answers. You just can't see them yet.