STRATINT

An AI-powered Open Source Intelligence (OSINT) platform that continuously monitors RSS feeds, enriches data with AI analysis, and provides a real-time intelligence feed with event correlation and deduplication.

Features

🔍 Intelligent Data Pipeline

RSS Feed Monitoring - Track multiple news sources with configurable feed URLs
Simplified Architecture - Direct RSS content processing without scraping
AI-Powered Enrichment - OpenAI GPT-4 analysis for entity extraction and summarization
Event Correlation - Automatic deduplication and novel facts detection
Threshold-based Publishing - Configurable confidence and magnitude filters

📊 Admin Dashboard

Pipeline Funnel Visualization - Real-time bottleneck detection
Source Management - Track and configure RSS feeds
Event Moderation - Review and manage enriched events
System Monitoring - Activity logs, error tracking, and metrics
AI Configuration - OpenAI settings and threshold tuning

🎨 Brutalist Cyberpunk UI

Terminal-style event cards with real-time updates
Dark theme with scan line effects and glitch animations
Comprehensive filtering (magnitude, confidence, category, time range)
Responsive design with custom scrollbars

Screenshots

Signal Stream

Real-time intelligence feed with event categorization, confidence scoring, and entity extraction.

AI-Powered Forecasts

Probabilistic predictions with OHLC-style visualization showing prediction confidence over time.

Portfolio Strategies

AI-generated portfolio allocations based on emerging signals and market forecasts.

Architecture

┌─────────────────────────────────────────────────────────────────┐
│                        RSS FEED SOURCES                         │
└───────────────────────────┬─────────────────────────────────────┘
                            │
                            ▼
┌─────────────────────────────────────────────────────────────────┐
│                   1. RSS INGESTION                              │
│  • Fetch RSS feed content directly                              │
│  • Use feed descriptions as source content                      │
│  • Store to PostgreSQL with status="completed"                  │
└───────────────────────────┬─────────────────────────────────────┘
                            │
                            ▼
┌─────────────────────────────────────────────────────────────────┐
│               2. AI ENRICHMENT (OpenAI GPT-4)                   │
│  • Entity extraction (people, orgs, locations)                  │
│  • Event summarization and categorization                       │
│  • Confidence and magnitude scoring                             │
│  • Create events from RSS sources                               │
└───────────────────────────┬─────────────────────────────────────┘
                            │
                            ▼
┌─────────────────────────────────────────────────────────────────┐
│           3. EVENT CORRELATION & DEDUPLICATION                  │
│  • OpenAI-based similarity analysis                             │
│  • Merge duplicate events                                       │
│  • Detect and extract novel facts                               │
│  • Create "Additional Details" events                           │
└───────────────────────────┬─────────────────────────────────────┘
                            │
                            ▼
┌─────────────────────────────────────────────────────────────────┐
│              4. THRESHOLD FILTERING & PUBLISHING                │
│  • Configurable confidence threshold                            │
│  • Configurable magnitude threshold                             │
│  • Auto-publish qualifying events                               │
│  • Reject low-quality events                                    │
└───────────────────────────┬─────────────────────────────────────┘
                            │
                            ▼
┌─────────────────────────────────────────────────────────────────┐
│                    PUBLISHED EVENT FEED                         │
│  • REST API with filtering                                      │
│  • Real-time web interface                                      │
│  • Admin moderation dashboard                                   │
└─────────────────────────────────────────────────────────────────┘

Quick Start

Prerequisites

Go 1.21+ - Backend server
Node.js 18+ - Frontend build
PostgreSQL 15+ - Database

Installation

Clone the repository

git clone https://github.com/brutus-gr/STRATINT-ai.git
cd STRATINT-ai

Set up PostgreSQL

createdb stratint
export DATABASE_URL="postgres://user:password@localhost:5432/stratint?sslmode=disable"

Run migrations

# Migrations are auto-applied on startup
# Or manually: psql $DATABASE_URL < migrations/*.sql

Configure environment

cp .env.example .env
# Edit .env with your settings:
# - DATABASE_URL
# - OPENAI_API_KEY
# - ADMIN_JWT_SECRET

Build and run backend

go build -o server ./cmd/server
./server
# Server starts on http://localhost:8080

Build and run frontend (separate terminal)

cd web
npm install
npm run dev
# Frontend runs on http://localhost:5173

Access the application
- Main UI: http://localhost:5173
- Admin Panel: http://localhost:5173/admin (password: see .env)
- API Health: http://localhost:8080/healthz
- Metrics: http://localhost:8080/metrics

Using the Admin Dashboard

Navigate to http://localhost:5173/admin
Enter admin password
Configure your first RSS source:
- Go to "SOURCES" tab
- Click "Add Source"
- Enter feed URL and settings
Trigger scraping:
- Go to "SCRAPER" or "PIPELINE" tab
- Click "Scrape Pending Sources"
Monitor the pipeline:
- Check "PIPELINE" tab for funnel visualization
- Watch bottlenecks and conversion rates
View enriched events:
- Go to main UI (http://localhost:5173)
- Events appear as they're published

Configuration

Environment Variables

Variable	Description	Default
`DATABASE_URL`	PostgreSQL connection string	Required
`OPENAI_API_KEY`	OpenAI API key for enrichment	Required
`ADMIN_JWT_SECRET`	Secret key for admin JWT tokens	`change-this-secret`
`SERVER_PORT`	HTTP server port	`8080`
`LOG_LEVEL`	Logging level (debug/info/warn/error)	`info`
`LOG_FORMAT`	Log format (json/text)	`json`

Database Configuration

All configuration is stored in PostgreSQL and manageable via the admin UI:

OpenAI Settings - Model, temperature, max tokens
Threshold Config - Min confidence, min magnitude
RSS Sources - Feed URLs, fetch intervals, status
Scraper Config - Worker count, timeout settings

API Endpoints

Public API

Endpoint	Method	Description
`/api/events`	GET	List published events with filtering
`/api/events/:id`	GET	Get single event by ID
`/api/feed.rss`	GET	RSS 2.0 feed of recent events
`/api/stats`	GET	System statistics
`/healthz`	GET	Health check
`/metrics`	GET	Prometheus metrics

Admin API

Endpoint	Method	Description
`/api/sources`	GET/POST	Manage sources
`/api/pipeline/metrics`	GET	Pipeline funnel metrics
`/api/scraper/scrape`	POST	Trigger scraping
`/api/scraper/status`	GET	Scraping status
`/api/openai-config`	GET/PUT	OpenAI configuration
`/api/thresholds`	GET/POST	Threshold settings
`/api/activity-logs`	GET	Activity logs
`/api/ingestion-errors`	GET	Error tracking

Key Features Explained

Split Scraping Architecture

The system separates RSS fetching from content scraping for better performance:

Fast RSS Ingestion - Fetches feed metadata in seconds
Async Scraping - Content scraped independently with worker pool
Status Tracking - Sources have scrape_status: pending/in_progress/completed/failed/skipped
Retry Logic - Failed scrapes can be retried without re-fetching RSS

See: SCRAPING_SPLIT_IMPLEMENTATION.md

Event Correlation & Novel Facts

The system intelligently merges duplicate events while preserving new information:

OpenAI Similarity - Compares new sources against existing events
Smart Merging - Adds sources to existing events when similar
Novel Facts Detection - Extracts new information from merged sources
Additional Events - Creates separate events for novel details

See: NOVEL_FACTS_IMPLEMENTATION.md

Pipeline Funnel Visualization

Real-time monitoring of the processing pipeline:

Bottleneck Detection - Automatically identifies where processing is stuck
Conversion Metrics - Track scrape completion, enrichment, and publish rates
Status Breakdown - Detailed view of sources and events by status
Auto-refresh - Updates every 5 seconds

Development

Running Tests

# Run all tests
go test ./...

# Run with coverage
go test -cover ./...

# Run specific package
go test ./internal/enrichment/

Database Migrations

Migrations are in migrations/ and auto-applied on startup. Manual application:

psql $DATABASE_URL -f migrations/001_initial_schema.sql
psql $DATABASE_URL -f migrations/002_tracked_accounts.sql
# ... etc

Frontend Development

cd web
npm run dev      # Development server with hot reload
npm run build    # Production build
npm run preview  # Preview production build
npm run lint     # Lint code

Project Structure

stratint/
├── cmd/
│   └── server/          # Main server entry point
├── internal/
│   ├── api/             # REST API handlers
│   ├── config/          # Configuration management
│   ├── database/        # PostgreSQL repositories
│   ├── enrichment/      # AI enrichment (OpenAI)
│   ├── eventmanager/    # Event lifecycle management
│   ├── ingestion/       # RSS + scraping pipeline
│   ├── logging/         # Structured logging
│   ├── metrics/         # Prometheus metrics
│   ├── models/          # Data models
│   └── server/          # HTTP server
├── migrations/          # Database migrations
├── web/                 # React + TypeScript frontend
│   ├── src/
│   │   ├── admin/       # Admin dashboard
│   │   ├── components/  # Shared components
│   │   └── pages/       # Main UI pages
│   └── public/
├── docs/                # Design documents
├── archive/             # Historical documentation
└── README.md            # This file

Documentation

ARCHITECTURE.md - System architecture overview
SCRAPING_SPLIT_IMPLEMENTATION.md - Split scraping design
NOVEL_FACTS_IMPLEMENTATION.md - Event correlation design
TEST_FAILURE_ANALYSIS.md - Integration test analysis
CONTRIBUTING.md - Contribution guidelines
docs/DEPLOYMENT.md - Production deployment guide

Module Documentation

internal/models/README.md - Data models
internal/ingestion/README.md - Ingestion pipeline
internal/enrichment/README.md - AI enrichment
internal/database/README.md - Database layer

Deployment

The application can be deployed to various platforms:

Google Cloud Run - See docs/GOOGLE_CLOUD_DEPLOYMENT.md
Docker - Dockerfile included (under development)
Traditional VPS - Binary + PostgreSQL + reverse proxy

Performance Considerations

Scraping Speed: ~5 concurrent workers, ~5-10 seconds per article
Enrichment: ~2-8 seconds per source (OpenAI API dependent)
Database: Indexed for fast querying, supports 10k+ events efficiently
Bottleneck: Typically scraping or OpenAI API rate limits

Troubleshooting

Sources stuck in "pending"

Check Playwright installation: npx playwright install
Verify scraper service is running
Check network connectivity
Review error logs

High scraping failure rate

Check scrape_error field in sources table
Verify target sites are accessible
Consider adding domains to skip list
Increase timeout/retry settings

Low enrichment rate

Verify OpenAI API key is valid
Check OpenAI API quota/rate limits
Review enrichment prompt effectiveness
Check event correlation thresholds

Contributing

Contributions are welcome! Please see CONTRIBUTING.md for detailed guidelines on:

Setting up your development environment
Code style and standards
Submitting pull requests
Testing requirements

Quick start:

Fork the repository
Create a feature branch
Make your changes with tests
Submit a pull request

License

MIT License

Permission is hereby granted, free of charge, to any person obtaining a copy of this software and associated documentation files (the "Software"), to deal in the Software without restriction, including without limitation the rights to use, copy, modify, merge, publish, distribute, sublicense, and/or sell copies of the Software, and to permit persons to whom the Software is furnished to do so, subject to the following conditions:

The above copyright notice and this permission notice shall be included in all copies or substantial portions of the Software.

THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND, EXPRESS OR IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY, FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL THE AUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM, OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN THE SOFTWARE.

See the LICENSE file for details.

Contact & Support

Author: Jacob Tyler Davis (jtdavis0492@gmail.com)
Issues: Report bugs or request features via GitHub Issues
Discussions: Join community discussions on GitHub Discussions
Documentation: Full docs available in the docs/ directory
Security: Report security vulnerabilities privately via GitHub Security Advisories

For deployment and production support, see docs/DEPLOYMENT.md.

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
.claude		.claude
.github		.github
cmd		cmd
docs		docs
internal		internal
migrations		migrations
scripts		scripts
test		test
web		web
.DS_Store		.DS_Store
.dockerignore		.dockerignore
.env.example		.env.example
.gcloudignore		.gcloudignore
.gitignore		.gitignore
.golangci.yml		.golangci.yml
ARCHITECTURE.md		ARCHITECTURE.md
CONTRIBUTING.md		CONTRIBUTING.md
DEPLOYMENT.md		DEPLOYMENT.md
DOCUMENTATION.md		DOCUMENTATION.md
Dockerfile		Dockerfile
Dockerfile.api		Dockerfile.api
Dockerfile.mcp		Dockerfile.mcp
LICENSE		LICENSE
Makefile		Makefile
NOVEL_FACTS_IMPLEMENTATION.md		NOVEL_FACTS_IMPLEMENTATION.md
QUICK_START.md		QUICK_START.md
README.md		README.md
RELEASE_CHECKLIST.md		RELEASE_CHECKLIST.md
SECURITY.md		SECURITY.md
SUBDOMAIN_MAPPING.md		SUBDOMAIN_MAPPING.md
add_column_to_osintmcp_db.sql		add_column_to_osintmcp_db.sql
add_forecast_history_count_column.sql		add_forecast_history_count_column.sql
apply_gld_forecasts.sh		apply_gld_forecasts.sh
apply_migration_011.sh		apply_migration_011.sh
apply_migration_018.sh		apply_migration_018.sh
apply_migration_018_prod.sh		apply_migration_018_prod.sh
apply_migration_019_prod.sh		apply_migration_019_prod.sh
apply_migration_040_prod.sh		apply_migration_040_prod.sh
check_strategies.sql		check_strategies.sql
cloud-sql-proxy		cloud-sql-proxy
cloudbuild.yaml		cloudbuild.yaml
deploy-api.sh		deploy-api.sh
deploy-mcp.sh		deploy-mcp.sh
deploy-quick.sh		deploy-quick.sh
deploy.sh		deploy.sh
docker-compose.yml		docker-compose.yml
go.mod		go.mod
go.sum		go.sum
mcp		mcp
osintmcp-server		osintmcp-server
setup-db-simple.sh		setup-db-simple.sh
setup-db.sh		setup-db.sh
setup-secrets.sh		setup-secrets.sh
start_server.sh		start_server.sh
test_select_strategies.sql		test_select_strategies.sql
verify_and_fix_migration.sql		verify_and_fix_migration.sql
wait-for-db.sh		wait-for-db.sh

Folders and files

Latest commit

History

Repository files navigation

STRATINT

Features

🔍 Intelligent Data Pipeline

📊 Admin Dashboard

🎨 Brutalist Cyberpunk UI

Screenshots

Signal Stream

AI-Powered Forecasts

Portfolio Strategies

Architecture

Quick Start

Prerequisites

Installation

Using the Admin Dashboard

Configuration

Environment Variables

Database Configuration

API Endpoints

Public API

Admin API

Key Features Explained

Split Scraping Architecture

Event Correlation & Novel Facts

Pipeline Funnel Visualization

Development

Running Tests

Database Migrations

Frontend Development

Project Structure

Documentation

Module Documentation

Deployment

Performance Considerations

Troubleshooting

Sources stuck in "pending"

High scraping failure rate

Low enrichment rate

Contributing

License

Contact & Support

About

Resources

License

Contributing

Security policy

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages