Skip to content

This project is a multimodal, voice-enabled assistant designed for the civil engineering domain. It uses Azure AI services and a Retrieval-Augmented Generation (RAG) pipeline, all integrated into a simple web interface.

Notifications You must be signed in to change notification settings

Manoj632004/Civil-AI

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

38 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Multimodal Voice-Enabled Civil Engineering Assistant (Using by Azure AI Services)

What?

This project is a multimodal, voice-enabled assistant designed for the civil engineering domain. It uses Azure AI services and a Retrieval-Augmented Generation (RAG) pipeline, all integrated into a simple web interface.

  • Azure Speech Service for voice-to-text: Ask questions via voice input, with Azure Speech Service handling speech-to-text conversion for a seamless experience.
  • Multimodal Input Support: Upload and analyze multiple PDFs and images (e.g., structural drawings, charts). The system extracts relevant information using Azure Form Recognizer and Computer Vision.
  • A custom RAG pipeline for domain-specific contextual answers: Retrieve contextual answers using a custom Retrieval-Augmented Generation (RAG) pipeline built on top of Azure OpenAI (GPT-4o mini).

Why?

This assistant is designed as a proof-of-concept to explore how generative AI can support various tasks in civil and structural engineering. While not fully tested for production use, it shows potential in aiding the following areas:

1. Design Optimization and Generative Design

  • Assists in creating reports and documentation
  • Helps explain design concepts, constraints, and material properties

2. Structural Analysis and Simulation

  • Offers basic natural language interaction to discuss simulation inputs and interpret summary-level results.

3. Material Selection and Research

  • Suggests materials based on application context
  • Retrieves material property data from uploaded docs or external sources

4. Seismic Design and Retrofitting

  • Can assist in early-stage brainstorming for retrofitting strategies and simulate conversational walkthroughs for damage scenarios.

5. Smart Structures and Sensor Integration

  • Demonstrates how sensor readings could be interpreted conversationally and how trends or anomalies might be discussed for proactive maintenance.

How?

GPT-4o mini

Acts as the main conversational model, handling natural language understanding, generation, and document-based QA using RAG.

RAG

Documents used in the Retrieval-Augmented Generation (RAG) component include:

  • PDF textbooks and manuals on structural engineering
  • Seismic design standards
  • Research papers and datasets on material properties
  • Uploaded reports and technical documentation

Azure AI Search

Indexes uploaded documents (PDFs, images) and retrieves relevant text snippets using vector search during user queries

Azure AI Vision

Extracts text from uploaded images using OCR and document layout understanding (used in RAG pipeline).

Azure AI Document Intelligence

Parses complex documents like PDFs, design specs, and scanned engineering drawings to make them queryable.

Azure AI Speech

Converts user speech input into text for voice-enabled chat, and optionally converts responses back to audio for accessibility.

📚 Reference

image image image

About

This project is a multimodal, voice-enabled assistant designed for the civil engineering domain. It uses Azure AI services and a Retrieval-Augmented Generation (RAG) pipeline, all integrated into a simple web interface.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published