Skip to main content
Version: 2.0

Data Collection And Analysis

For 20+ years, I’ve seen engineering teams either drowning in data – or starved for it. It’s a surprisingly common paradox. We're told data is king, and we dutifully collect it. But collecting data isn't the same as leveraging it. Too often, I’ve witnessed teams chasing “vanity metrics” – numbers that look good but don’t drive meaningful improvement. This post is about moving beyond that, and building a truly data-driven culture within your engineering organization. It's not about more data, it's about better data, and more importantly, how you use it to deliver value.

Why This Matters: The ROI of Data-Driven Engineering

Before diving into the “how,” let's address the “why.” As engineering leaders, we're constantly juggling priorities: managing teams, justifying investments, demonstrating ROI to stakeholders, and ultimately, building products that users love. A data-driven approach isn't just about pretty dashboards; it’s about making informed decisions that directly impact these goals. It allows us to move beyond gut feelings and assumptions, and instead, build, measure, and iterate based on concrete evidence. Teams that embrace data-driven practices consistently outperform those that don't, leading to faster innovation, reduced costs, and increased customer satisfaction.

The Problem With Just Collecting Data

Let's be honest: many engineering teams treat data collection as a checkbox. "We need analytics!" leads to integrating a tool (and there are many options – see the end of this post for a quick rundown), piping in everything, and then…staring at dashboards no one understands, or worse, ignoring altogether.

This happens for a few key reasons:

  • Lack of Focus: Without clearly defined goals, you end up measuring everything and therefore, nothing.
  • Poor Instrumentation: Data that isn't correctly tracked or consistently labeled is useless. Garbage in, garbage out.
  • No Actionable Insights: Dashboards displaying numbers without context or recommendations are just pretty pictures.
  • Siloed Data: Data living in separate systems (frontend, backend, database, etc.) makes it hard to get a holistic view.

Shifting From Collection to Action: A Three-Phase Approach

Here’s a framework I’ve used successfully to turn data into a lever for genuine improvement:

Phase 1: Define Your "North Star" and Supporting Metrics

Before you write a single line of tracking code, you need to answer a fundamental question: What does success look like? This isn't just about revenue; it's about the core value your engineering team delivers. I call this your “North Star” metric.

For example, if you're building a real-time collaboration tool, your North Star might be “Daily Active Users Engaged in Collaborative Sessions.” A vanity metric for this tool, however, might be ‘Total number of sessions,’ which doesn't tell you whether users are actually collaborating effectively. Supporting metrics then become the leading indicators that influence your North Star. These could include:

  • Session Start Time: How quickly do users get into a session?
  • Features Used Per Session: What features are most engaging?
  • Error Rate During Collaboration: How stable is the experience?
  • Latency of Collaborative Actions: How responsive is the tool?

Phase 2: Instrument Strategically, Not Exhaustively

Now that you know what to measure, focus on instrumenting your code specifically for those metrics. Resist the urge to track everything. Don’t just track errors; track where users are encountering them and how they're trying to recover. This is where tools like Statsig can be incredibly helpful, allowing you to easily add feature flags and A/B testing capabilities along with robust analytics.

Consider these principles:

  • Standardize Event Names & Properties: Consistency is key. A "button_click" event should always have the same properties, no matter where it originates.
  • Focus on User Actions, Not Just System Events: Tracking system errors is important, but understanding how users interact with your product provides more actionable insights.
  • Add Contextual Data: Include user segments (e.g., free vs. paid, new vs. returning) and other relevant data to enable deeper analysis.
  • Consider a Reverse ETL approach: Platforms like Census can help you move data out of your analytics tools and into systems like Salesforce or your data warehouse, allowing product managers to personalize in-app messaging based on user behavior.

Phase 3: Analyze, Iterate, and Share

Collecting data is only half the battle. You need to actively analyze it, draw conclusions, and iterate on your product based on those findings.

  • Regular Data Reviews: Schedule regular meetings (weekly or bi-weekly) to review key metrics with the team.
  • Focus on Trends, Not Just Numbers: Look for patterns and anomalies. Is a metric trending up or down? Why?
  • A/B Test Everything: Don’t rely on gut feelings. Use A/B testing to validate your hypotheses and measure the impact of your changes. Remember to consider statistical significance and sample size to ensure your results are reliable.
  • Share Findings Widely: Make data accessible to everyone on the team. Use dashboards, reports, and presentations to communicate insights and foster a data-driven culture. Focus on telling a compelling story with the data, rather than just presenting numbers. Tools like Yandex.Datalens can make data visualization and sharing easier.

Tools of the Trade (A Quick Rundown)

There's a plethora of analytics and data management tools available. Here are a few to consider:

  • Keen: Custom analytics with a focus on flexibility.
  • Statsig: Feature management, A/B testing, and analytics platform.
  • Census: Reverse ETL and operational analytics.
  • CrateDB: Open-source SQL database for real-time analytics.
  • Yandex.Datalens: Data visualization and analysis service.

The best tool depends on your specific needs and budget.

Moving Beyond Vanity Metrics

Building a data-driven culture isn’t about installing a fancy analytics platform. It’s about cultivating a mindset of continuous learning and improvement. It's also important to acknowledge that this takes effort. Start small – focus on one North Star metric and build from there. By focusing on the right metrics, instrumenting strategically, and actively analyzing your data, you can empower your engineering team to build better products and deliver more value to your users.

What's one North Star metric you can identify for your team this week?