Ferrum

A modular, open-source FORTRAN tooling ecosystem in Rust. Ferrum provides a collection of small, composable libraries that form the foundation for FORTRAN analysis, refactoring, and modernization tools.

✨ NEW: Dead Code Analysis

Ferrum now includes a production-ready dead code analyzer that identifies:

🗑️ Unused variables and parameters
☠️ Dead/unreachable subroutines and functions
📊 Comprehensive cleanup recommendations with statistics

Perfect for refactoring large legacy FORTRAN codebases in aerospace, physics simulations, and scientific computing.

🎯 Goals

Modular: Small, focused crates that work together
Fast: Built with Rust for performance
Composable: Use what you need, combine as needed
Open Source: MIT licensed
Production Ready: Comprehensive error handling and testing

📦 Crates

fortran-lexer

Fast, modular lexer for FORTRAN source code supporting both fixed-format and free-format FORTRAN.

Features:

✅ Free-format FORTRAN lexing (FORTRAN 90+)
✅ Fixed-format FORTRAN lexing (FORTRAN 77 and earlier)
✅ Automatic format detection (fixed vs free format)
✅ Column-based parsing for fixed-format (labels, continuation, code sections)
✅ Comment line handling with any characters (c, C, *, !)
✅ Case-insensitive keyword recognition
✅ Comprehensive token types (keywords, identifiers, literals, operators, punctuation)
✅ Source location tracking (line, column, span)
✅ Error reporting with precise location information

Status: ✅ Production ready with full FORTRAN 77 support

📖 Documentation | Examples

fortran-ast

Abstract Syntax Tree (AST) data structures and analysis infrastructure for FORTRAN programs.

Features:

✅ Complete AST representation of FORTRAN program units (PROGRAM, SUBROUTINE, FUNCTION, MODULE)
✅ Declaration structures (variable declarations, type specifications, attributes)
✅ Expression trees (arithmetic, logical, comparison, function calls)
✅ Statement structures (IF, DO, SELECT CASE, I/O statements, etc.)
✅ Source span tracking for all nodes
✅ Visitor pattern for AST traversal
✅ Symbol table system for tracking definitions and usage
✅ Call graph builder for procedure dependency analysis
✅ AnalysisVisitor trait for extensible static analysis
✅ Comprehensive scope management (global, program, subroutine, function)
✅ Optional serialization support (serde)

Status: ✅ Production ready with analysis infrastructure

📖 Documentation

fortran-parser

Recursive descent parser that converts tokens into a structured AST with full FORTRAN 77 support.

Features:

✅ Automatic format detection and parsing (fixed-format and free-format)
✅ Parses FORTRAN program units (PROGRAM, SUBROUTINE, FUNCTION, MODULE)
✅ Parses declarations (variable declarations, type specifications, attributes)
✅ Parses executable statements (IF, DO, READ, WRITE, PRINT, RETURN, STOP, etc.)
✅ Parses expressions (arithmetic, logical, comparison, function calls)
✅ Error reporting with location information
✅ Handles whitespace and comments gracefully
✅ Legacy FORTRAN 77 support with fixed-format parsing
✅ Real-world compatibility with production numerical libraries

Status: ✅ Production ready with legacy FORTRAN support

📖 Documentation | Examples

ferrum-fmt

Auto-formatter for FORTRAN source code (like rustfmt or black).

Features:

✅ Configurable indentation (spaces/tabs, width)
✅ Keyword case normalization (UPPER, lower, preserve)
✅ Identifier case normalization
✅ Spacing around operators
✅ Column alignment for declarations
✅ Line length enforcement
✅ Comment preservation
✅ FORTRAN 77 and modern style presets

Status: ✅ Core functionality ready

📖 Documentation | Examples

🚀 Quick Start

Installation

# Clone the repository
git clone https://github.com/MarsZDF/ferrum.git
cd ferrum

# Build all crates
cargo build --all

Using the Lexer

use fortran_lexer::{tokenize, detect_format};

// Free-format FORTRAN (90+)
let modern_source = r#"
program hello_world
    implicit none
    print *, 'Hello, World!'
end program hello_world
"#;

// Fixed-format FORTRAN (77 and earlier)
let legacy_source = r#"
      SUBROUTINE HELLO
      PRINT *, 'Hello from FORTRAN 77!'
      END
"#;

// Automatic format detection and tokenization
let format = detect_format(modern_source);
let tokens = tokenize(modern_source, format)?;
for token in tokens {
    if !token.is_trivial() {
        println!("{:?} at line {}:{}", token.token_type, token.line, token.column);
    }
}

Dead Code Analysis

use fortran_parser::parse;
use fortran_ast::AnalysisVisitor;

// Legacy FORTRAN with dead code
let fortran_code = r#"
      SUBROUTINE MAIN  
      INTEGER USED_VAR, UNUSED_VAR
      REAL ANOTHER_UNUSED
      USED_VAR = 42
      CALL USED_SUBROUTINE(USED_VAR)
      END

      SUBROUTINE USED_SUBROUTINE(X)
      INTEGER X
      PRINT *, X
      END
      
      SUBROUTINE DEAD_SUBROUTINE
      INTEGER DEAD_VAR
      DEAD_VAR = 99
      END
"#;

let program = parse(fortran_code)?;
let mut analyzer = DeadCodeAnalyzer::new();
analyzer.analyze_program(&program);

// Outputs detailed analysis:
// 🗑️  UNUSED VARIABLES: UNUSED_VAR, ANOTHER_UNUSED
// ☠️  DEAD PROCEDURES: DEAD_SUBROUTINE  
// 📊 Dead code percentage: 50.0%
println!("{}", analyzer.generate_report());

Running Examples

# Run the lexer example
cd fortran-lexer && cargo run --example basic_tokenize

# Modernization analysis tools
cd fortran-parser && cargo run --example extract_signature your_fortran_file.f
cd fortran-parser && cargo run --example extract_docs your_fortran_file.f
cd fortran-parser && cargo run --example type_mapper your_fortran_file.f

# Convert fixed-format to free-format
cd fortran-parser && cargo run --example fixed_to_free legacy.f modern.f90

# Dead code analysis for legacy FORTRAN cleanup  
cd fortran-parser && cargo run --example dead_code_analyzer your_fortran_file.f

# Parse any FORTRAN file (auto-detects format)
cd fortran-parser && cargo run your_fortran_file.f

🏗️ Architecture

┌─────────────────────┐
│  FORTRAN Source     │
│  (.f, .f90, .f95)   │
└──────────┬──────────┘
           │
           v
┌─────────────────────┐
│  fortran-lexer      │ Tokenizes source code
│                     │ (fixed-format ✅, free-format ✅)
└──────────┬──────────┘
           │
           v
┌─────────────────────┐
│  fortran-parser     │ Parses tokens into AST
│                     │ (recursive descent)
└──────────┬──────────┘
           │
           v
┌─────────────────────┐
│   fortran-ast       │ AST + Symbol Tables + Call Graphs
│                     │ (visitor pattern + analysis)
└──────────┬──────────┘
           │
           v
┌─────────────────────┐
│   Analysis Tools    │ Dead code detection ✅
│                     │ Performance hints, migration tools
└─────────────────────┘

🧪 Development

Prerequisites

Rust 1.70+ (stable, beta, or nightly)
Cargo (comes with Rust)

Building

# Build all crates
cargo build --all

# Build a specific crate
cd fortran-lexer && cargo build

Testing

# Run all tests
cargo test --all

# Run tests for a specific crate
cd fortran-lexer && cargo test

# Run with output
cargo test --all -- --nocapture

Running Examples

# Run lexer example
cd fortran-lexer && cargo run --example basic_tokenize

Linting and Formatting

# Format code
cargo fmt --all

# Run clippy
cargo clippy --all -- -D warnings

🚦 CI/CD

We use GitHub Actions for continuous integration:

✅ Tests on stable, beta, and nightly Rust
✅ Tests on Linux, Windows, and macOS
✅ Linting with clippy and rustfmt
✅ Builds examples and documentation
✅ All crates tested in the pipeline

See .github/workflows/ci.yml for details.

📝 Contributing

Contributions are welcome! This project follows standard Rust conventions:

Fork the repository
Create a feature branch (git checkout -b feature/amazing-feature)
Make your changes
Add tests for new functionality
Ensure all tests pass (cargo test --all)
Run clippy and fix warnings (cargo clippy --all)
Format code (cargo fmt --all)
Update documentation as needed
Submit a pull request

Development Guidelines

Follow Rust naming conventions
Write comprehensive tests
Document public APIs with rustdoc
Handle errors explicitly (use Result types)
Keep crates focused and modular
Use workspace dependencies where appropriate

📄 License

Licensed under the MIT License - see LICENSE for details.

🗺️ Roadmap

Completed ✅

In Progress 🚧

fortran-parser - Full FORTRAN grammar support (remaining statements and expressions)

Planned 📋

fortran-analyzer-* - Additional analysis modules
- Performance analysis hints
- Code quality metrics
- Automated migration from fixed-format to free-format
- Modernization suggestions
Language server support (LSP)
Enhanced formatter with more configuration options
Refactoring tools
REPL for FORTRAN exploration

🤝 Acknowledgments

This project aims to modernize FORTRAN tooling using Rust's excellent performance and safety guarantees. Special thanks to:

The Rust community for excellent tooling and documentation
FORTRAN maintainers for keeping scientific computing systems running
Contributors and users of this project

📚 Additional Resources

💬 Community

Issues: GitHub Issues
Discussions: GitHub Discussions
Pull Requests: GitHub Pull Requests

🤖 AI Assistant Guide

This section is designed to help AI coding assistants (like Cursor, Claude Code, GitHub Copilot) quickly understand and work with the Ferrum codebase.

Project Structure

ferrum/
├── fortran-lexer/          # Tokenization layer
│   ├── src/
│   │   ├── lib.rs          # Main exports
│   │   ├── token.rs        # Token types and Token struct
│   │   ├── lexer.rs        # Lexer implementation (FreeFormatLexer)
│   │   └── error.rs        # LexError types
│   ├── tests/
│   └── examples/
├── fortran-ast/            # AST data structures + analysis
│   ├── src/
│   │   ├── lib.rs          # Main exports
│   │   ├── program.rs      # Program, ProgramUnit, MainProgram, etc.
│   │   ├── declaration.rs  # Declarations, TypeSpec, Attributes
│   │   ├── statement.rs    # Statements (IF, DO, SELECT CASE, etc.)
│   │   ├── expression.rs   # Expressions (arithmetic, logical, calls)
│   │   ├── span.rs         # Source location tracking
│   │   ├── visitor.rs      # Visitor pattern + AnalysisVisitor trait
│   │   └── analysis.rs     # Symbol tables, call graphs, analysis infra
│   └── tests/
├── fortran-parser/         # Parsing layer
│   ├── src/
│   │   ├── lib.rs          # Main exports and parse() function
│   │   └── parser.rs       # Recursive descent parser (large file ~1500 lines)
│   ├── tests/
│   │   └── integration_tests.rs  # Comprehensive test suite
│   └── examples/
│       └── dead_code_analyzer.rs # Production dead code analysis tool
└── Cargo.toml              # Workspace configuration

Key Architectural Patterns

Tokenization → Parsing → AST: Source code flows through lexer → parser → AST
Recursive Descent Parsing: The parser uses recursive descent with precedence climbing
Zero-Copy Where Possible: Uses &str references instead of owned Strings
Source Location Tracking: All AST nodes are wrapped in Spanned<T> for error reporting
Error Handling: Custom error types (LexError, ParseError) with precise location info

Common Tasks and Locations

Adding a New FORTRAN Keyword

Lexer: Add to KEYWORDS constant in fortran-lexer/src/token.rs
Token Type: Add variant to TokenType enum in fortran-lexer/src/token.rs
Parser: Add handling in fortran-parser/src/parser.rs (usually in parse_statement_opt() or similar)
AST: Add corresponding AST structure if needed (in fortran-ast/src/statement.rs or declaration.rs)

Adding a New Statement Type

AST: Define in fortran-ast/src/statement.rs (add to Statement enum)
Parser: Add parsing method in fortran-parser/src/parser.rs:
- Add detection in is_executable_statement() if needed
- Add case in parse_statement_opt()
- Implement parse_<statement_type>() method
Tests: Add test in fortran-parser/tests/integration_tests.rs

Adding a New Expression Operator

Lexer: Add operator to TokenType in fortran-lexer/src/token.rs
AST: Add to BinaryOp or UnaryOp in fortran-ast/src/expression.rs
Parser:
- Add to parse_binary_operator() or parse_unary_operator()
- Add precedence in get_operator_precedence()
- Handle in expression parsing logic

Fixing a Parsing Bug

Identify: Run cargo test --package fortran-parser --test integration_tests to see failing tests
Locate: Find the relevant parsing method in fortran-parser/src/parser.rs
Common Issues:
- Token position tracking: peek(), advance(), self.current management
- Whitespace handling: Use is_trivial() to skip whitespace/comments
- Expression precedence: Check parse_binary_expression() and precedence values
- Statement boundaries: Check is_executable_statement() detection logic

Parser Implementation Details

The parser (fortran-parser/src/parser.rs) is a large recursive descent parser:

Main entry point: parse() function creates Parser and calls parse_program()
Token management:
- peek() - get next non-trivial token (skips whitespace/comments)
- advance() - consume current token and return next non-trivial one
- check_token() - check if current token matches expected type
- self.current - index into self.tokens vector
Common patterns:
- parse_*_opt() methods return Option (for optional constructs)
- parse_*() methods return Result (for required constructs)
- Use Spanned::new() to wrap AST nodes with source location
- Use self.create_span() for span creation

Error Handling Conventions

Lexer errors: LexError with line/column info
Parser errors: ParseError enum with variants:
- UnexpectedToken { expected, found }
- UnexpectedEof { expected }
- InvalidSyntax { message, line, column }
Always include expected tokens and found token in error messages
Use self.current_line() and self.current_column() for error locations

Testing Approach

Integration tests: fortran-parser/tests/integration_tests.rs - comprehensive test suite
Unit tests: Inline #[cfg(test)] modules in source files
Test naming: test_parse_<feature> for parser tests

Test structure:

#[test]
fn test_parse_feature() {
    let source = r#"FORTRAN code here"#;
    let result = parse(source);
    assert!(result.is_ok(), "Failed to parse: {:?}", result.err());
    // ... assertions
}

Fixed-Format FORTRAN Support

Status: ✅ COMPLETE - Full FORTRAN 77 fixed-format support implemented

Implementation Details:

✅ FixedFormatLexer struct in fortran-lexer/src/lexer.rs
✅ Column-based parsing:
- Columns 1-5: Statement label (optional)
- Column 6: Continuation indicator (space/0 = new statement, other = continuation)
- Columns 7-72: FORTRAN code
- Columns 73-80: Comments/sequence numbers (ignored)
- Column 1 = C, c, *, !: Comment line
✅ Automatic format detection and parser integration
✅ Real-world compatibility with legacy numerical libraries

Usage:

use fortran_lexer::{tokenize, detect_format};

let legacy_fortran = r#"
c This is a comment
      SUBROUTINE HELLO
      PRINT *, 'Hello World'
      END
"#;

let format = detect_format(legacy_fortran); // Returns FixedFormat
let tokens = tokenize(legacy_fortran, format)?; // Works seamlessly

Code Style and Conventions

Error handling: Prefer Result<T, E> over panics
Ownership: Use references (&str) where possible, clone only when necessary
Documentation: Public APIs should have rustdoc comments
Naming: Follow Rust conventions (snake_case for functions, PascalCase for types)
Imports: Group by: std, external crates, workspace crates, local modules

Debugging Tips

Add debug prints: eprintln!("Current token: {:?}", self.peek());
Check token stream: eprintln!("Tokens: {:?}", self.tokens);
Verify position: eprintln!("Current index: {}", self.current);
Test incrementally: Run cargo test --package fortran-parser --test integration_tests <test_name> for specific tests
Use backtrace: RUST_BACKTRACE=1 cargo test ...

Quick Reference: File Locations

Token definitions: fortran-lexer/src/token.rs
Lexer logic: fortran-lexer/src/lexer.rs
AST definitions: fortran-ast/src/*.rs
Analysis infrastructure: fortran-ast/src/analysis.rs
Parser logic: fortran-parser/src/parser.rs (main file)
Dead code analyzer: fortran-parser/examples/dead_code_analyzer.rs
Parser tests: fortran-parser/tests/integration_tests.rs
Workspace config: Cargo.toml (root)
CI/CD: .github/workflows/ci.yml

When Adding New Features

Start with tests: Write a failing test first
Update AST: Add necessary data structures
Update lexer: Add token types if needed
Update parser: Implement parsing logic
Run tests: cargo test --all
Update docs: Add examples and documentation
Run linter: cargo clippy --all -- -D warnings

Built with ❤️ in Rust

Name		Name	Last commit message	Last commit date
Latest commit History 22 Commits
.github/workflows		.github/workflows
ferrum-fmt		ferrum-fmt
fortran-ast		fortran-ast
fortran-lexer		fortran-lexer
fortran-parser		fortran-parser
.gitignore		.gitignore
CHANGELOG.md		CHANGELOG.md
Cargo.toml		Cargo.toml
LICENSE		LICENSE
PRODUCTION_READINESS.md		PRODUCTION_READINESS.md
README.md		README.md

License

MarsZDF/ferrum

Folders and files

Latest commit

History

Repository files navigation

Ferrum

✨ NEW: Dead Code Analysis

🎯 Goals

📦 Crates

fortran-lexer

fortran-ast

fortran-parser

ferrum-fmt

🚀 Quick Start

Installation

Using the Lexer

Dead Code Analysis

Running Examples

🏗️ Architecture

🧪 Development

Prerequisites

Building

Testing

Running Examples

Linting and Formatting

🚦 CI/CD

📝 Contributing

Development Guidelines

📄 License

🗺️ Roadmap

Completed ✅

In Progress 🚧

Planned 📋

🤝 Acknowledgments

📚 Additional Resources

💬 Community

🤖 AI Assistant Guide

Project Structure

Key Architectural Patterns

Common Tasks and Locations

Adding a New FORTRAN Keyword

Adding a New Statement Type

Adding a New Expression Operator

Fixing a Parsing Bug

Parser Implementation Details

Error Handling Conventions

Testing Approach

Fixed-Format FORTRAN Support

Code Style and Conventions

Debugging Tips

Quick Reference: File Locations

When Adding New Features

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Uh oh!

Languages

Packages