Skip to content
View uahroberto's full-sized avatar

Block or report uahroberto

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
uahroberto/README.md

Hi there πŸ‘‹, I'm Roberto

Computer Science Student & AI Software Engineer.

I focus on building robust, production-ready systems. I prioritize clean architecture, strict type safety, and performance optimization to create maintainable software.

Active Open Source Contributor developing features and ensuring engineering best practices.

πŸ”­ Featured Project

πŸ“Ό Video RAG Pro

A Multimodal AI Engine that enables users to "chat" with video content by analyzing both audio speech and visual screen context (OCR).

Unlike simple text-wrappers, this project is engineered for depth and reliability:

  • πŸ‘οΈ Visual RAG: Implements an OCR pipeline to "read" code and slides from video frames, capturing context that is shown but not spoken.
  • 🧠 Hybrid Search: Uses Qdrant to combine semantic embeddings with keyword search (BM25) for precise technical retrieval.
  • ⚑ Performance: Achieved 0.14 RTF on CPU using custom int8 quantization.
  • πŸ›‘οΈ Engineering: Fully Dockerized ETL pipeline with strict type safety guarantees.

🌱 Open Source Contributions

I believe in improving the tools I use daily.

  • roboflow/supervision: Led the strict typing migration for the metrics module (+700 lines refactored) and ensured full compatibility with NumPy 2.0. Refactored core calculation logic in MeanAveragePrecision to align the library with production-grade engineering standards.
  • qdrant/qdrant-client: Enhanced Developer Experience (DX) by standardizing type hints across the client interface, streamlining hybrid search (Dense + Sparse) integration for gRPC and HTTP users.
  • SYSTRAN/faster-whisper: Proposing architectural patterns for Dynamic Batching and request queuing to solve latency bottlenecks in high-concurrency GPU environments.
  • pyDeprecate: Contributed to core design discussions on decorator deprecation lifecycles, advocating for "Fail Fast" import-time warnings to prevent silent technical debt in production systems.

πŸ› οΈ Tech Stack

  • Core & Backend: Python 3.12 (Strict Typing), Docker, FastAPI/Streamlit.
  • AI & Data: OpenAI, Faster-Whisper, RapidOCR, Sentence-Transformers.
  • Vector Search: Qdrant (Hybrid Search: Dense + Sparse).
  • Quality & DevOps: Mypy, Ruff, Pre-commit hooks, GitHub Actions.

⚑ GitHub Stats

Pinned Loading

  1. roboflow/supervision roboflow/supervision Public

    We write your reusable computer vision tools. πŸ’œ

    Python 36.4k 3.1k

  2. qdrant/qdrant-client qdrant/qdrant-client Public

    Python client for Qdrant vector search engine

    Python 1.2k 192

  3. video-rag-pro video-rag-pro Public

    Python 1