mirror of https://github.com/ROCm/composable_kernel.git synced 2026-06-30 19:57:40 +00:00

Files

John Shumway 0caf06e6f1 Add build trace analysis tools for -ftime-trace data

Introduces a new Python toolset in script/analyze_build/ for analyzing
Clang -ftime-trace JSON output to identify compilation bottlenecks and
optimize C++ metaprogramming build times.

Key features:
- Fast parallel processing of trace json files  using all CPU cores (> 100 files/sec)
- Simple, cache-free architecture for consistent performance
- Comprehensive analysis of template instantiations and event types
- Command-line tools and Jupyter notebook support
- Automatic orjson detection for JSON parsing speedup

Components:
- trace_analysis/: Core library (models, parser, transformer)
- examples/: CLI tools for single-file and directory analysis
- notebooks/: Comprehensive Jupyter notebook with analysis patterns
- Detailed README with usage examples and performance data

Also adds ruff configuration to pyproject.toml to ignore E402 (module
level import not at top of file) for Jupyter notebooks, which commonly
have imports after markdown cells.

This toolset addresses the critical problem of long build times in CK's
C++17 metaprogramming codebase by treating -ftime-trace as a big data
problem, using pandas and modern analysis tools to understand compilation
patterns and measure improvement opportunities.

2026-01-03 18:28:22 -05:00

__init__.py

Add build trace analysis tools for -ftime-trace data

2026-01-03 18:28:22 -05:00

models.py

Add build trace analysis tools for -ftime-trace data

2026-01-03 18:28:22 -05:00

parser.py

Add build trace analysis tools for -ftime-trace data

2026-01-03 18:28:22 -05:00

README.md

Add build trace analysis tools for -ftime-trace data

2026-01-03 18:28:22 -05:00

transformer.py

Add build trace analysis tools for -ftime-trace data

2026-01-03 18:28:22 -05:00

README.md

Trace Analysis Library

Modern data pipeline for analyzing Clang -ftime-trace build performance data. Provides fast JSON parsing and conversion to pandas DataFrames for immediate analysis in scripts and Jupyter notebooks.

Design Principles

Modular, modern data pipeline architecture - Clean separation between parsing, transformation, and analysis
Fast path to pandas DataFrames - Get to analyzable data structures quickly
Simplicity and efficiency - Straightforward code with optimized data structures
Concurrent processing - Leverage parallelism where it helps performance
Flexible integration - Standalone library that works well in scripts and interactive sessions

Quick Start

from pathlib import Path
from trace_analysis import TraceFile, TraceParser, TraceTransformer

# Parse a trace file
trace_file = TraceFile.from_path(Path("build.json"))
events = TraceParser.parse(trace_file)

# Convert to DataFrames
events_df = TraceTransformer.to_events_dataframe(events)
templates_df = TraceTransformer.to_templates_dataframe(events)

# Analyze
print(f"Total events: {len(events_df):,}")
print(f"Total time: {events_df['dur'].sum() / 1e6:.2f}s")
print(f"Template time: {templates_df['dur'].sum() / 1e6:.2f}s")

Module Overview

models.py - Data Models

TraceFile: Lightweight metadata wrapper for trace files.

@dataclass
class TraceFile:
    path: Path
    size_bytes: int
    mtime_ns: int
    
    @classmethod
    def from_path(cls, path: Path) -> "TraceFile"

Usage:

trace_file = TraceFile.from_path(Path("build.json"))
print(f"File: {trace_file.name}, Size: {trace_file.size_bytes:,} bytes")

parser.py - JSON Parsing

TraceParser: Fast JSON parsing with automatic orjson optimization.

Key Methods:

@staticmethod
def parse(trace_file: TraceFile) -> List[Dict[str, Any]]
    """Parse trace file and return all events."""

@staticmethod
def is_template_event(event: Dict[str, Any]) -> bool
    """Check if event is template-related."""

@staticmethod
def extract_template_detail(event: Dict[str, Any]) -> str
    """Extract template detail from event."""

Performance Notes:

Automatically uses orjson if available (1.65x faster than stdlib json)
Gracefully falls back to standard library
Optimized for single-line JSON format used by -ftime-trace

Usage:

events = TraceParser.parse(trace_file)
template_events = [e for e in events if TraceParser.is_template_event(e)]

transformer.py - DataFrame Conversion

TraceTransformer: Convert parsed events to optimized pandas DataFrames.

Key Methods:

@staticmethod
def to_events_dataframe(events: List[Dict[str, Any]]) -> pd.DataFrame
    """Convert all events to DataFrame with optimized dtypes."""

@staticmethod
def to_templates_dataframe(events: List[Dict[str, Any]]) -> pd.DataFrame
    """Convert template events to DataFrame with template details."""

@staticmethod
def compute_file_stats(events_df: pd.DataFrame, templates_df: pd.DataFrame, 
                       file_name: str) -> Dict[str, Any]
    """Compute summary statistics for a file."""

@staticmethod
def aggregate_event_types(events_df: pd.DataFrame) -> pd.DataFrame
    """Aggregate events by type with count, sum, mean, max."""

@staticmethod
def aggregate_templates(templates_df: pd.DataFrame) -> pd.DataFrame
    """Aggregate template instantiations by detail."""

Usage:

# Convert to DataFrames
events_df = TraceTransformer.to_events_dataframe(events)
templates_df = TraceTransformer.to_templates_dataframe(events)

# Aggregate analysis
event_summary = TraceTransformer.aggregate_event_types(events_df)
template_summary = TraceTransformer.aggregate_templates(templates_df)

DataFrame Schemas

Note

: The DataFrame schemas are evolving as analysis needs develop. The current focus is on fast intake of raw JSON data into a simple, analyzable structure.

Events DataFrame

Columns and optimized dtypes:

Column	Type	Description
`name`	category	Event type (e.g., "InstantiateFunction")
`dur`	int64	Duration in microseconds
`ts`	int64	Timestamp in microseconds
`pid`	int32	Process ID
`tid`	int32	Thread ID
`ph`	category	Phase (usually "X" for complete events)

Example:

events_df.head()
#                      name      dur         ts   pid   tid ph
# 0  InstantiateFunction  12500  1000000  1234  1234  X
# 1       ParseClass       8300  1012500  1234  1234  X

Templates DataFrame

Columns for template-specific analysis:

Column	Type	Description
`name`	category	Template event type
`dur`	int64	Duration in microseconds
`template_detail`	object	Full template signature

Example:

templates_df.head()
#                  name    dur                    template_detail
# 0  InstantiateFunction  12500  std::vector<int, std::allocator<int>>
# 1   InstantiateClass    8300  MyClass<double, 3>

Parallel Processing Pattern

For analyzing multiple files concurrently:

from pathlib import Path
from concurrent.futures import ProcessPoolExecutor
import pandas as pd

def process_file(json_path: Path):
    trace_file = TraceFile.from_path(json_path)
    events = TraceParser.parse(trace_file)
    return TraceTransformer.to_events_dataframe(events)

# Process all files in parallel
trace_dir = Path("../../build-trace")
json_files = list(trace_dir.rglob("*.json"))

with ProcessPoolExecutor() as executor:
    dfs = list(executor.map(process_file, json_files))

# Combine results
all_events = pd.concat(dfs, ignore_index=True)

Analysis Examples

Top Event Types by Duration

event_totals = events_df.groupby('name', observed=True)['dur'].sum()
top_events = event_totals.sort_values(ascending=False).head(10)
print(top_events / 1e6)  # Convert to seconds

Most Expensive Templates

template_totals = templates_df.groupby('template_detail')['dur'].agg(['sum', 'count', 'mean'])
expensive = template_totals.sort_values('sum', ascending=False).head(10)
print(expensive)

Template Time Percentage

total_time = events_df['dur'].sum()
template_time = templates_df['dur'].sum()
print(f"Template instantiation: {(template_time / total_time) * 100:.1f}% of build time")

Future Directions

This library is actively evolving:

Schema evolution: DataFrame schemas will expand as analysis needs develop
Visualization helpers: Planned addition of common visualization utilities
Analysis utilities: Additional aggregation and analysis functions as patterns emerge

The core pipeline architecture (parse → transform → analyze) will remain stable while specific schemas and utilities evolve based on practical analysis needs.

README.md

Trace Analysis Library

Design Principles

Quick Start

Module Overview

models.py - Data Models

parser.py - JSON Parsing

transformer.py - DataFrame Conversion

DataFrame Schemas

Events DataFrame

Templates DataFrame

Parallel Processing Pattern

Analysis Examples

Top Event Types by Duration

Most Expensive Templates

Template Time Percentage

Future Directions

See Also