Log Management

EDSL provides comprehensive log management capabilities through the LogManager class. This powerful tool allows you to filter, analyze, and manage EDSL log files with advanced features for debugging, monitoring, and data analysis.

Quick Start

from edsl.logger import LogManager

# Create LogManager instance
log_manager = LogManager()

# View overview with statistics and available commands
log_manager

This displays a rich overview of your log file:

LogManager(
  📁 Log file: /Users/john/.edsl/logs/edsl.log
  📊 Total entries: 9,420
  📅 Date range: 2025-08-15 23:40 to 2025-08-16 09:23
  📈 Levels: INFO:5471, ERROR:3949
  🔍 Top logger: edsl.edsl.coop.coop

  💡 Common commands:
    .get_filtered_entries(level='ERROR', n=50)          # Get recent errors
    .to_scenario_list(level='ERROR', since_hours=24)    # Convert to scenarios
    .analyze_patterns(n=100)                            # Pattern analysis
    .archive()                                          # Archive and clear logs
    .clear()                                            # Clear all log entries
)

Core Features

Log Filtering

Filter log entries using multiple criteria:

from edsl.logger import LogManager

log_manager = LogManager()

# Filter by log level
errors = log_manager.get_filtered_entries(level='ERROR', n=50)
warnings_and_above = log_manager.get_filtered_entries(min_level='WARNING')

# Time-based filtering
recent = log_manager.get_filtered_entries(since_hours=24)
last_week = log_manager.get_filtered_entries(since_hours=168)  # 7 days
date_range = log_manager.get_filtered_entries(
    start_date='2024-08-15',
    end_date='2024-08-16'
)

# Pattern matching
auth_issues = log_manager.get_filtered_entries(pattern='authentication|login')
failures = log_manager.get_filtered_entries(pattern='fail.*|error.*', n=100)

# Logger-specific filtering
coop_logs = log_manager.get_filtered_entries(logger_pattern='coop')
job_logs = log_manager.get_filtered_entries(logger_pattern='jobs')

# Complex combinations
recent_job_errors = log_manager.get_filtered_entries(
    level='ERROR',
    since_hours=12,
    logger_pattern='jobs',
    n=25
)

Scenario Conversion for Analysis

Convert log entries to EDSL scenarios for advanced analysis:

# Convert logs to scenarios
error_scenarios = log_manager.to_scenario_list(level='ERROR', n=50)

# Each log entry becomes a scenario with rich metadata
first_scenario = error_scenarios[0]
print(f"Error level: {first_scenario['level']}")
print(f"Timestamp: {first_scenario['timestamp_str']}")
print(f"Logger: {first_scenario['logger_module']}")
print(f"Message: {first_scenario['message']}")
print(f"Is error: {first_scenario['is_error']}")
print(f"Hour of day: {first_scenario['hour']}")

Available scenario fields: - timestamp_str: Formatted timestamp - logger_name: Full logger name - logger_module: Last part of logger name - level: Log level (ERROR, INFO, etc.) - level_int: Numeric level for comparison - message: Log message content - raw_line: Original log line - hour, minute, weekday: Time components - date_str: Date in YYYY-MM-DD format - is_error: Boolean for ERROR/CRITICAL levels - is_warning_or_above: Boolean for WARNING+ levels - has_exception: Boolean if message contains “exception” - has_failed: Boolean if message contains “fail” - message_length: Character count of message - words_count: Word count of message

Using Log Scenarios with EDSL

Analyze log entries using EDSL questions:

from edsl import QuestionFreeText, QuestionMultipleChoice

# Convert error logs to scenarios
error_scenarios = log_manager.to_scenario_list(level='ERROR', n=25)

# Analyze error causes
analysis_question = QuestionFreeText(
    question_name="error_analysis",
    question_text="What likely caused this error: {{ message }}?"
)

# Categorize errors
category_question = QuestionMultipleChoice(
    question_name="error_category",
    question_text="Categorize this error: {{ message }}",
    question_options=[
        "Authentication/Authorization",
        "Network/Connection",
        "Data/Validation",
        "System/Resource",
        "Configuration",
        "Other"
    ]
)

# Run analysis
from edsl import Survey
survey = Survey([analysis_question, category_question])
results = survey.by(error_scenarios).run()

# View results
print(results.select("error_analysis", "error_category").print())

Pattern Analysis

Extract insights from log patterns:

# Analyze recent log patterns
analysis = log_manager.analyze_patterns(n=1000)

# View time patterns
print(f"Busiest hour: {analysis['time_patterns']['busiest_hour']}")
print(f"Daily distribution: {analysis['time_patterns']['daily_distribution']}")

# View level distribution
print(f"Level counts: {analysis['level_patterns']}")

# View top loggers
print(f"Top loggers: {analysis['logger_patterns']}")

# Error analysis
if 'error_patterns' in analysis:
    error_info = analysis['error_patterns']
    print(f"Error percentage: {error_info['error_percentage']:.1f}%")
    print(f"Exception count: {error_info['exception_count']}")

Statistics and Reporting

Get comprehensive statistics about your logs:

# Overall statistics
stats = log_manager.get_stats()
print(f"Total entries: {stats['total']:,}")
print(f"Date range: {stats['date_range']['earliest']} to {stats['date_range']['latest']}")
print(f"Level distribution: {stats['level_counts']}")
print(f"Top 5 loggers: {list(stats['top_loggers'].items())[:5]}")

# Statistics for filtered entries
recent_errors = log_manager.get_filtered_entries(level='ERROR', since_hours=24)
error_stats = log_manager.get_stats(recent_errors)
print(f"Recent errors: {error_stats['total']}")

Log Maintenance

Archive Logs

Create timestamped backups of your logs:

# Archive with compression and clear original (default)
archive_path = log_manager.archive()

# Archive only, keep original
backup_path = log_manager.archive(clear_after_archive=False)

# Custom archive location without compression
custom_backup = log_manager.archive(
    archive_path=Path('~/my_backups'),
    compress=False,
    clear_after_archive=False
)

The archive process shows detailed progress:

📦 About to archive EDSL log file:
   📁 Source: /Users/john/.edsl/logs/edsl.log
   📊 Entries: 9,420 (2025-08-15 to 2025-08-16)
   📦 Archive: /Users/john/.edsl/archives/edsl_log_20250816_092142.log.gz
   🗜️  Compress: Yes
   🧹 Clear after: Yes
   Continue? (type 'yes' to confirm):

Clear Logs

Remove all log entries to start fresh:

# Clear with confirmation prompt (safe)
success = log_manager.clear()

# Clear immediately for scripts (use with caution)
success = log_manager.clear(confirm=True)

The clear process includes safety confirmation:

⚠️  About to clear EDSL log file:
   📁 File: /Users/john/.edsl/logs/edsl.log
   📊 Entries: 9,420
   ⚠️  This action cannot be undone!
   Continue? (type 'yes' to confirm):

Advanced Usage

Custom Log Files

Work with non-default log files:

from pathlib import Path

# Custom log file location
custom_log_manager = LogManager(Path('/path/to/custom.log'))

# Check if log file exists
if custom_log_manager.log_file_path.exists():
    stats = custom_log_manager.get_stats()
    print(f"Custom log has {stats['total']} entries")

Filtering Best Practices

Combine filters for precise results:

# Find authentication errors in the last day
auth_errors = log_manager.get_filtered_entries(
    level='ERROR',
    pattern='auth.*|login.*|token.*',
    since_hours=24,
    case_sensitive=False
)

# Find slow operations (assuming they log with "slow" keyword)
slow_operations = log_manager.get_filtered_entries(
    pattern='slow.*|timeout.*|.*ms$',
    since_hours=6,
    n=50
)

# Debug specific component
coop_debug = log_manager.get_filtered_entries(
    logger_pattern='coop.*',
    min_level='DEBUG',
    since_minutes=30
)

Performance Optimization

LogManager automatically caches statistics for better performance:

# First call computes and caches stats
log_manager = LogManager()
print(log_manager)  # Caches statistics

# Subsequent calls use cached data
stats = log_manager.get_stats()  # Fast - uses cache

# Cache is cleared after maintenance operations
log_manager.clear(confirm=True)  # Clears cache automatically
print(log_manager)  # Recomputes statistics

Integration Examples

Debugging Workflows

# Daily error review
def daily_error_review():
    log_manager = LogManager()

    # Get yesterday's errors
    errors = log_manager.get_filtered_entries(
        level='ERROR',
        since_hours=24
    )

    if errors:
        print(f"Found {len(errors)} errors in last 24 hours:")
        for error in errors[-5:]:  # Show last 5
            print(f"  {error.timestamp}: {error.message}")

        # Convert to scenarios for analysis
        error_scenarios = log_manager.to_scenario_list(entries=errors)

        # Analyze with EDSL
        from edsl import QuestionFreeText
        q = QuestionFreeText(
            question_name="priority",
            question_text="Rate the priority of this error (1-5): {{ message }}"
        )
        priorities = q.by(error_scenarios[:10]).run()  # Analyze top 10
        print(priorities.select("priority").print())

Monitoring Automation

# Automated log monitoring
def monitor_system_health():
    log_manager = LogManager()

    # Check for recent critical issues
    critical = log_manager.get_filtered_entries(
        level='CRITICAL',
        since_minutes=60
    )

    if critical:
        # Send alert via Coop
        from edsl.coop import Coop
        coop = Coop()
        alert_result = coop.send_log(
            level='CRITICAL',
            since_minutes=60,
            as_scenario_list=True,
            alias='critical-alerts'
        )
        print(f"Alert sent: {alert_result['url']}")

    # Weekly log maintenance
    from datetime import datetime
    if datetime.now().weekday() == 0:  # Monday
        archive_path = log_manager.archive()
        print(f"Weekly archive created: {archive_path}")

Research and Analysis

# Research log patterns for insights
def analyze_usage_patterns():
    log_manager = LogManager()

    # Convert logs to scenarios
    scenarios = log_manager.to_scenario_list(n=1000)

    # Research questions about usage
    from edsl import QuestionMultipleChoice, Survey

    q1 = QuestionMultipleChoice(
        question_name="time_category",
        question_text="What time category is {{ hour }}:00?",
        question_options=["Night (0-5)", "Morning (6-11)", "Afternoon (12-17)", "Evening (18-23)"]
    )

    q2 = QuestionMultipleChoice(
        question_name="operation_type",
        question_text="What type of operation does this suggest: {{ message }}",
        question_options=["API Call", "Data Processing", "User Interface", "System Maintenance", "Other"]
    )

    survey = Survey([q1, q2])
    results = survey.by(scenarios).run()

    # Analyze patterns
    time_patterns = results.select("time_category").to_list()
    operation_patterns = results.select("operation_type").to_list()

    print("Usage patterns analysis complete!")
    return results

Jupyter Notebook Integration

LogManager provides rich HTML displays in Jupyter notebooks:

# In Jupyter notebook
from edsl.logger import LogManager

log_manager = LogManager()
log_manager  # Shows rich HTML table with colored log levels

The HTML display includes: - Overview with file path, entry count, and date range - Color-coded log level distribution (red for errors, blue for info) - Top loggers table with counts - Interactive command examples with syntax highlighting

API Reference

LogManager Class

class LogManager(log_file_path=None)

Main class for EDSL log management.

Parameters:: log_file_path (Path, optional) – Optional path to log file. Defaults to ~/.edsl/logs/edsl.log

get_filtered_entries(**kwargs)

Filter log entries by various criteria.

Parameters:

n (int, optional) – Maximum number of entries to return
level (str or List[str], optional) – Specific level(s) to include (‘DEBUG’, ‘INFO’, ‘WARNING’, ‘ERROR’, ‘CRITICAL’)
min_level (str, optional) – Minimum level (includes this level and above)
since_hours (float, optional) – Include entries from last N hours
since_minutes (float, optional) – Include entries from last N minutes
start_date (str or datetime, optional) – Start date for filtering (‘YYYY-MM-DD’)
end_date (str or datetime, optional) – End date for filtering (‘YYYY-MM-DD’)
pattern (str, optional) – Regex pattern for message content
logger_pattern (str, optional) – Regex pattern for logger names
case_sensitive (bool, optional) – Whether pattern matching is case sensitive
reverse (bool, optional) – Return in reverse chronological order

Returns:

List of filtered log entries

Return type:

List[LogEntry]

to_scenario_list(entries=None, **filter_kwargs)

Convert log entries to EDSL scenarios.

Parameters:

entries (List[LogEntry], optional) – Pre-filtered entries to convert
filter_kwargs – Arguments for get_filtered_entries()

Returns:

ScenarioList with log data

Return type:

ScenarioList

analyze_patterns(**filter_kwargs)

Analyze log patterns and extract insights.

Parameters:: filter_kwargs – Arguments for get_filtered_entries()
Returns:: Dictionary with pattern analysis
Return type:: Dict[str, Any]

get_stats(entries=None)

Get statistics about log entries.

Parameters:: entries (List[LogEntry], optional) – Specific entries to analyze
Returns:: Statistics dictionary
Return type:: Dict[str, Any]

export_filtered_logs(output_path, **filter_kwargs)

Export filtered log entries to file.

Parameters:

output_path (Path) – Path for output file
filter_kwargs – Arguments for get_filtered_entries()

Returns:

Number of entries exported

Return type:

int

archive(archive_path=None, clear_after_archive=True, compress=True, confirm=False)

Archive log file with timestamp.

Parameters:

archive_path (Path, optional) – Directory for archive (default: ~/.edsl/archives/)
clear_after_archive (bool) – Clear original after archiving
compress (bool) – Gzip compress the archive
confirm (bool) – Skip confirmation prompt

Returns:

Path to created archive

Return type:

Path or None

clear(confirm=False)

Clear all log entries.

Parameters:: confirm (bool) – Skip confirmation prompt
Returns:: Success status
Return type:: bool

LogEntry Class

class LogEntry

Represents a parsed log entry.

Parameters:

timestamp (datetime) – When the log entry was created
logger_name (str) – Name of the logger
level (str) – Log level (DEBUG, INFO, WARNING, ERROR, CRITICAL)
message (str) – Log message content
raw_line (str) – Original log line
level_int (int) – Numeric level for comparisons

Best Practices

Safety Guidelines

Always use confirmation prompts for destructive operations (default behavior)
Archive logs before clearing in production environments
Use specific filters to avoid processing unnecessary log entries
Test complex regex patterns with small datasets first

Performance Tips

Use n parameter to limit results when exploring large logs
Combine multiple filters in single calls rather than chaining
Cache LogManager instances when doing multiple operations
Consider archiving very large log files before analysis

Common Patterns

Error Investigation:

Filter recent errors: get_filtered_entries(level='ERROR', since_hours=24)
Convert to scenarios for analysis: to_scenario_list()
Use EDSL questions to categorize and analyze
Export findings or share via Coop

Regular Maintenance:

Weekly archive: archive() (includes clearing)
Monitor critical issues: get_filtered_entries(level='CRITICAL')
Pattern analysis: analyze_patterns() for trends

Debugging Workflows:

Filter by component: logger_pattern='component_name'
Time-bound investigation: since_hours=X for specific incidents
Pattern matching: pattern='specific_error_text'
Export for sharing: export_filtered_logs()

Troubleshooting

Common Issues

Log file not found:

Check the log file path. EDSL logs are typically at ~/.edsl/logs/edsl.log

No entries returned:

Verify log file has content: log_manager.get_stats()
Check filter criteria are not too restrictive
Ensure date ranges are correct

Performance issues with large logs:

Use n parameter to limit results
Consider archiving old entries
Filter by time ranges first, then other criteria

Permission errors:

Ensure read access to log file
For archive/clear operations, ensure write access to log directory

Getting Help

View available commands: print(log_manager)
Check method documentation: help(log_manager.method_name)
View log overview: log_manager.get_stats()
Test filters with small limits: get_filtered_entries(n=10, ...)

The LogManager provides powerful capabilities for understanding, analyzing, and maintaining your EDSL logs. Whether you’re debugging issues, monitoring system health, or conducting research on usage patterns, LogManager offers the tools you need for effective log management.

Log Management

Quick Start

Core Features

Log Filtering

Scenario Conversion for Analysis

Using Log Scenarios with EDSL

Pattern Analysis

Statistics and Reporting

Export and Sharing

Log Maintenance

Archive Logs

Clear Logs

Advanced Usage

Custom Log Files

Filtering Best Practices

Performance Optimization

Integration Examples

Debugging Workflows

Monitoring Automation

Research and Analysis

Jupyter Notebook Integration

API Reference

LogManager Class

LogEntry Class

Best Practices

Safety Guidelines

Performance Tips

Common Patterns

Troubleshooting

Common Issues

Getting Help