CodingAnarchy
diff --git a/‎CHANGELOG.md‎
Lines changed: 73 additions & 0 deletions b/‎CHANGELOG.md‎
Lines changed: 73 additions & 0 deletions
diff --git a/‎Cargo.toml‎
Lines changed: 7 additions & 2 deletions b/‎Cargo.toml‎
Lines changed: 7 additions & 2 deletions
diff --git a/‎README.md‎
Lines changed: 89 additions & 6 deletions b/‎README.md‎
Lines changed: 89 additions & 6 deletions
diff --git a/‎ROADMAP.md‎
Lines changed: 1 addition & 53 deletions b/‎ROADMAP.md‎
Lines changed: 1 addition & 53 deletions
diff --git a/‎src/error.rs‎
Lines changed: 3 additions & 0 deletions b/‎src/error.rs‎
Lines changed: 3 additions & 0 deletions
@@ -5,6 +5,79 @@ All notable changes to this project will be documented in this file.
 The format is based on [Keep a Changelog](https://keepachangelog.com/en/1.0.0/),
 and this project adheres to [Semantic Versioning](https://semver.org/spec/v2.0.0.html).
 
+## [1.2.0] - 2025-06-29
+
+### Added
+- **🔍 Job Tracing & Correlation** - Comprehensive distributed tracing system for production observability
+  - **Core Tracing Fields**: Added `trace_id`, `correlation_id`, `parent_span_id`, and `span_context` fields to Job struct
+  - **Database Migrations**: New migration 009_add_tracing for PostgreSQL and MySQL with optimized indexes for trace/correlation ID lookups
+  - **Job Builder Methods**: Added `.with_trace_id()`, `.with_correlation_id()`, `.with_parent_span_id()`, and `.with_span_context()` for easy job tracing configuration
+  - **TraceId and CorrelationId Types**: New strongly-typed identifiers with generation, conversion, and validation methods
+  - **OpenTelemetry Integration**: Feature-gated OpenTelemetry support with OTLP export to Jaeger, Zipkin, DataDog, etc.
+  - **TracingConfig**: Complete OpenTelemetry configuration with service metadata, resource attributes, and endpoint configuration
+  - **Automatic Span Creation**: `create_job_span()` function creates spans with rich job metadata and trace context propagation
+  - **Span Context Management**: `set_job_trace_context()` for extracting and storing trace context from OpenTelemetry spans
+
+- **🎯 Worker Event Hooks** - Lifecycle event system for custom tracing and monitoring integration
+  - **JobHookEvent**: Event data structure with job metadata, timestamps, duration, and error information
+  - **JobEventHooks**: Configurable lifecycle callbacks for job start, completion, failure, timeout, and retry events
+  - **Builder Pattern**: Convenient `.on_job_start()`, `.on_job_complete()`, `.on_job_fail()`, `.on_job_timeout()`, `.on_job_retry()` methods
+  - **Worker Integration**: Event hooks integrated into job processing pipeline with automatic event firing
+  - **Automatic Span Management**: OpenTelemetry spans automatically created and updated throughout job lifecycle
+
+- **⚡ Production-Ready Tracing Infrastructure**
+  - **Feature Gated**: All tracing functionality behind optional `tracing` feature flag for minimal overhead
+  - **Backward Compatible**: Existing jobs and workers continue working unchanged
+  - **Database Optimized**: Indexed trace and correlation ID columns for efficient querying
+  - **OpenTelemetry Standards**: Full OTLP support with configurable exporters and sampling
+  - **Span Attributes**: Rich span metadata including job ID, queue name, priority, status, and custom business data
+  - **Error Tracking**: Automatic span status updates for success, failure, and timeout scenarios
+
+- **🧪 Comprehensive Testing** - 177 total tests including 22 new tracing-specific tests
+  - **Unit Tests**: Complete coverage of TraceId, CorrelationId, TracingConfig, and span creation functionality
+  - **Integration Tests**: Event hook testing with realistic job processing scenarios
+  - **Feature Testing**: Validation of tracing feature flag behavior and optional inclusion
+  - **Span Testing**: OpenTelemetry span creation and attribute validation
+
+### Enhanced
+- **📖 Documentation Updates**
+  - **README.md**: Added Job Tracing & Correlation feature to main features list with comprehensive example
+  - **Installation Guide**: Updated to show tracing feature installation options
+  - **Tracing Example**: Complete OpenTelemetry setup example with worker event hooks and correlation tracking
+  - **Feature Flags**: Updated to include `tracing` (optional) feature flag documentation
+  - **Database Schema**: Updated schema documentation to mention distributed tracing fields
+
+- **🗺️ ROADMAP.md**: Removed completed "Job Tracing & Correlation" feature from Phase 1 priorities
+
+### Technical Implementation
+- **OpenTelemetry Dependencies**: Added feature-gated dependencies: `opentelemetry`, `opentelemetry_sdk`, `opentelemetry-otlp`, `tracing-opentelemetry`
+- **Async Integration**: Full async/await support with tokio runtime integration
+- **Memory Efficient**: Trace context stored as optional strings with minimal memory overhead
+- **Type Safety**: Strongly typed trace and correlation IDs with comprehensive validation
+- **Database Agnostic**: Tracing works identically across PostgreSQL and MySQL backends
+- **Export Support**: OTLP export to all major observability platforms (Jaeger, Zipkin, DataDog, New Relic, etc.)
+
+### Usage Example
+```rust
+// Initialize tracing
+let config = TracingConfig::new()
+    .with_service_name("job-processor")
+    .with_otlp_endpoint("http://jaeger:4317");
+init_tracing(config).await?;
+
+// Create traced jobs  
+let job = Job::new("email_queue".to_string(), json!({"to": "[email protected]"}))
+    .with_trace_id("trace-123")
+    .with_correlation_id("order-456");
+
+// Worker with event hooks
+let worker = Worker::new(queue, "email_queue".to_string(), handler)
+    .on_job_start(|event| { /* custom tracing logic */ })
+    .on_job_complete(|event| { /* success tracking */ });
+```
+
+This release provides comprehensive distributed tracing capabilities essential for debugging and monitoring job processing in production distributed systems.
+
 ## [1.1.0] - 2025-06-29
 
 ### Added
 
@@ -8,7 +8,7 @@ members = [
 resolver = "2"
 
 [workspace.package]
-version = "1.1.0"
+version = "1.2.0"
 edition = "2024"
 license = "MIT"
 repository = "https://github.com/CodingAnarchy/hammerwork"
@@ -18,7 +18,7 @@ documentation = "https://docs.rs/hammerwork"
 rust-version = "1.86"
 
 [workspace.dependencies]
-hammerwork = { version = "1.0.0", path = "." }
+hammerwork = { version = "1.2.0", path = "." }
 tokio = { version = "1.0", features = ["full"] }
 sqlx = { version = "0.8", features = ["runtime-tokio-rustls", "chrono", "uuid", "json"] }
 chrono = { version = "0.4", features = ["serde"] }
@@ -77,13 +77,18 @@ reqwest = { version = "0.12", features = ["json"], optional = true }
 warp = { version = "0.3", optional = true }
 clap = { workspace = true }
 tracing-subscriber = { workspace = true }
+opentelemetry = { version = "0.22", optional = true }
+opentelemetry_sdk = { version = "0.22", features = ["rt-tokio"], optional = true }
+opentelemetry-otlp = { version = "0.15", features = ["tokio"], optional = true }
+tracing-opentelemetry = { version = "0.23", optional = true }
 
 [features]
 default = ["metrics", "alerting"]
 postgres = ["sqlx/postgres"]
 mysql = ["sqlx/mysql"]
 metrics = ["prometheus", "warp"]
 alerting = ["reqwest"]
+tracing = ["opentelemetry", "opentelemetry_sdk", "opentelemetry-otlp", "tracing-opentelemetry"]
 
 [dev-dependencies]
 tokio-test = { workspace = true }
 
@@ -4,6 +4,7 @@ A high-performance, database-driven job queue for Rust with comprehensive featur
 
 ## Features
 
+- **🔍 Job Tracing & Correlation**: Comprehensive distributed tracing with OpenTelemetry integration, trace IDs, correlation IDs, and lifecycle event hooks
 - **🔗 Job Dependencies & Workflows**: Create complex data processing pipelines with job dependencies, sequential chains, and parallel processing with synchronization barriers
 - **Multi-database support**: PostgreSQL and MySQL backends with optimized dependency queries
 - **Advanced retry strategies**: Exponential backoff, linear, Fibonacci, and custom retry patterns with jitter
@@ -24,15 +25,18 @@ A high-performance, database-driven job queue for Rust with comprehensive featur
 ```toml
 [dependencies]
 # Default features include metrics and alerting
-hammerwork = { version = "1.0", features = ["postgres"] }
+hammerwork = { version = "1.2", features = ["postgres"] }
 # or
-hammerwork = { version = "1.0", features = ["mysql"] }
+hammerwork = { version = "1.2", features = ["mysql"] }
+
+# With distributed tracing
+hammerwork = { version = "1.2", features = ["postgres", "tracing"] }
 
 # Minimal installation
-hammerwork = { version = "1.0", features = ["postgres"], default-features = false }
+hammerwork = { version = "1.2", features = ["postgres"], default-features = false }
 ```
 
-**Feature Flags**: `postgres`, `mysql`, `metrics` (default), `alerting` (default)
+**Feature Flags**: `postgres`, `mysql`, `metrics` (default), `alerting` (default), `tracing` (optional)
 
 ## Quick Start
 
@@ -41,6 +45,7 @@ See the [Quick Start Guide](docs/quick-start.md) for complete examples with Post
 ## Documentation
 
 - **[Quick Start Guide](docs/quick-start.md)** - Get started with PostgreSQL and MySQL
+- **[Job Tracing & Correlation](docs/tracing.md)** - Distributed tracing, correlation IDs, and OpenTelemetry integration
 - **[Job Dependencies & Workflows](docs/workflows.md)** - Complex pipelines, job dependencies, and orchestration
 - **[Job Types & Configuration](docs/job-types.md)** - Job creation, priorities, timeouts, cron jobs
 - **[Worker Configuration](docs/worker-configuration.md)** - Worker setup, rate limiting, statistics
@@ -126,6 +131,83 @@ queue.enqueue_workflow(workflow).await?;
 
 Jobs will only execute when their dependencies are satisfied, enabling sophisticated data processing pipelines and business workflows.
 
+## Tracing Example
+
+Enable comprehensive distributed tracing with OpenTelemetry integration:
+
+```rust
+use hammerwork::{Job, Worker, tracing::{TracingConfig, init_tracing}, queue::DatabaseQueue};
+use serde_json::json;
+use std::sync::Arc;
+
+#[tokio::main]
+async fn main() -> Result<(), Box<dyn std::error::Error>> {
+    // Initialize distributed tracing
+    let tracing_config = TracingConfig::new()
+        .with_service_name("job-processor")
+        .with_service_version("1.0.0")
+        .with_environment("production")
+        .with_otlp_endpoint("http://jaeger:4317");
+    
+    init_tracing(tracing_config).await?;
+
+    let pool = sqlx::PgPool::connect("postgresql://localhost/hammerwork").await?;
+    let queue = Arc::new(JobQueue::new(pool));
+
+    // Create traced jobs with correlation for business workflows
+    let trace_id = "trace-12345";
+    let correlation_id = "order-67890";
+    
+    let payment_job = Job::new("payment_queue".to_string(), json!({
+        "order_id": "67890",
+        "amount": 299.99
+    }))
+    .with_trace_id(trace_id)
+    .with_correlation_id(correlation_id);
+    
+    let email_job = Job::new("email_queue".to_string(), json!({
+        "order_id": "67890", 
+        "template": "order_confirmation"
+    }))
+    .with_trace_id(trace_id)
+    .with_correlation_id(correlation_id)
+    .depends_on(&payment_job.id);
+
+    // Worker with lifecycle event hooks for observability
+    let handler = Arc::new(|job: Job| Box::pin(async move {
+        println!("Processing: {:?}", job.payload);
+        // Your business logic here
+        Ok(())
+    }));
+
+    let worker = Worker::new(queue.clone(), "payment_queue".to_string(), handler)
+        .on_job_start(|event| {
+            println!("Job {} started (trace: {}, correlation: {})", 
+                event.job.id,
+                event.job.trace_id.unwrap_or_default(),
+                event.job.correlation_id.unwrap_or_default());
+        })
+        .on_job_complete(|event| {
+            println!("Job {} completed in {:?}", 
+                event.job.id, 
+                event.duration.unwrap_or_default());
+        })
+        .on_job_fail(|event| {
+            eprintln!("Job {} failed: {}", 
+                event.job.id, 
+                event.error.unwrap_or_default());
+        });
+
+    // Enqueue jobs - they'll be automatically traced
+    queue.enqueue(payment_job).await?;
+    queue.enqueue(email_job).await?;
+
+    Ok(())
+}
+```
+
+This enables end-to-end tracing across your entire job processing pipeline with automatic span creation, correlation tracking, and integration with observability platforms like Jaeger, Zipkin, or DataDog.
+
 ## Database Setup
 
 ### Using Migrations (Recommended)
@@ -160,12 +242,12 @@ queue.enqueue(job).await?;
 ### Database Schema
 
 Hammerwork uses optimized tables with comprehensive indexing:
-- **`hammerwork_jobs`** - Main job table with priorities, timeouts, cron scheduling, retry strategies, and result storage
+- **`hammerwork_jobs`** - Main job table with priorities, timeouts, cron scheduling, retry strategies, result storage, and distributed tracing fields
 - **`hammerwork_batches`** - Batch metadata and tracking (v0.7.0+)
 - **`hammerwork_job_results`** - Job result storage with TTL and expiration (v0.8.0+)
 - **`hammerwork_migrations`** - Migration tracking for schema evolution
 
-The schema supports all features including job prioritization, advanced retry strategies, timeouts, cron scheduling, batch processing, result storage with TTL, worker autoscaling, and comprehensive lifecycle tracking. See [Database Migrations](docs/migrations.md) for details.
+The schema supports all features including job prioritization, advanced retry strategies, timeouts, cron scheduling, batch processing, result storage with TTL, distributed tracing with trace/correlation IDs, worker autoscaling, and comprehensive lifecycle tracking. See [Database Migrations](docs/migrations.md) for details.
 
 ## Development
 
@@ -194,6 +276,7 @@ Working examples in `examples/`:
 - `retry_strategies.rs` - Advanced retry patterns with exponential backoff and jitter
 - `result_storage_example.rs` - Job result storage and retrieval
 - `autoscaling_example.rs` - Dynamic worker pool scaling based on queue depth
+- `tracing_example.rs` - Distributed tracing with OpenTelemetry and event hooks
 
 ```bash
 cargo run --example postgres_example --features postgres
 
@@ -2,60 +2,9 @@
 
 This roadmap outlines planned features for Hammerwork, prioritized by impact level and implementation complexity. Features are organized into phases based on their value proposition to users and estimated development effort.
 
-## ✅ Completed Features
-
-### 🔗 Job Dependencies & Workflows
-**Impact: Very High** | **Complexity: High** | **Status: ✅ COMPLETED**
-
-**Game-changing feature for complex data processing pipelines and business workflows.**
-
-✅ **Core Implementation Complete:**
-- Job dependency tracking with `depends_on()` and `depends_on_jobs()` methods
-- `JobGroup` workflow orchestration with sequential and parallel job execution
-- Dependency graph validation with cycle detection
-- Database schema with dependency fields for PostgreSQL and MySQL
-- Dependency-aware job polling (only executes jobs with satisfied dependencies)
-- Failure policy configuration (`FailFast`, `ContinueOnFailure`, `Manual`)
-
-```rust
-// Sequential job chains
-let job1 = Job::new("process_data".to_string(), data1);
-let job2 = Job::new("transform_data".to_string(), data2)
-    .depends_on(&job1.id);
-let job3 = Job::new("export_data".to_string(), data3)
-    .depends_on(&job2.id);
-
-// Parallel job groups with barriers
-let job_group = JobGroup::new("data_pipeline")
-    .add_parallel_jobs(vec![job_a, job_b, job_c])
-    .then(final_job); // Runs after all parallel jobs complete
-```
-
-🚧 **Remaining Work:** Full workflow method implementations, completion triggers, and CLI integration.
-
 ## Phase 1: High Impact, Medium-High Complexity
 *Features that provide significant value but require more substantial implementation effort*
 
-### 🔍 Job Tracing & Correlation
-**Impact: High** | **Complexity: Medium-High** | **Priority: Medium-High**
-
-Essential for debugging and monitoring in distributed systems.
-
-```rust
-// Distributed tracing support
-let job = Job::new("process_order".to_string(), order_data)
-    .with_trace_id("trace-123")
-    .with_correlation_id("order-456")
-    .with_span_context(span_context);
-
-// Job lifecycle events
-worker.on_job_start(|job| tracing::info!("Job started: {}", job.id));
-worker.on_job_complete(|job, duration| {
-    metrics::histogram!("job.duration", duration, "queue" => job.queue_name);
-});
-```
-
-
 ### 🌐 Admin Dashboard & CLI Tools
 **Impact: High** | **Complexity: Medium-High** | **Priority: Medium**
 
@@ -240,8 +189,7 @@ Features are ordered within each phase by priority and should generally be imple
 
 **Phase 1 (Advanced Features) - CURRENT PRIORITIES**
 1. Job Dependencies & Workflows
-2. Job Tracing & Correlation
-3. Admin Dashboard & CLI Tools
+2. Admin Dashboard & CLI Tools
 
 **Phase 2 (Operational Features)**
 1. Job Archiving & Retention
 
@@ -37,6 +37,9 @@ pub enum HammerworkError {
 
     #[error("Workflow error: {message}")]
     Workflow { message: String },
+
+    #[error("Tracing error: {message}")]
+    Tracing { message: String },
 }
 
 #[cfg(test)]
Original file line number	Diff line number	Diff line change
`@@ -37,6 +37,9 @@ pub enum HammerworkError {`
`37`	`37`
`38`	`38`	`#[error("Workflow error: {message}")]`
`39`	`39`	`Workflow { message: String },`
	`40`	`+`
	`41`	`+ #[error("Tracing error: {message}")]`
	`42`	`+ Tracing { message: String },`
`40`	`43`	`}`
`41`	`44`
`42`	`45`	`#[cfg(test)]`