Multi-Modal RAG Application for Document Analysis

Application Summary

This application is a sophisticated Retrieval-Augmented Generation (RAG) system that allows users to interact with and analyze PDF documents from the CFA Institute Research Foundation Publications. It features a multi-modal approach, combining text and image analysis for comprehensive document exploration.

Links to Resources:

Key Features

Data Ingestion and Storage: Automated scraping of CFA Institute publications, including titles, images, summaries, and PDF files.
Document Exploration: User-friendly interface for browsing and selecting documents.
On-the-Fly Summary Generation: Utilizes NVIDIA services for dynamic document summarization.
Multi-Modal RAG: Advanced querying system integrating both text and image data.
Q&A Interface: Interactive system for document-specific queries.
Report Generation: Produces research notes with links to relevant graphs, tables, and pages.

Frontend (Streamlit)

The frontend provides an intuitive interface with multiple pages:

User Management: Registration and login functionality.
Document Selection: Grid view and dropdown list for document browsing.
Summary Generation: On-demand document summarization.
Q&A Interface: Interactive querying system for selected documents.
Report Generation: Creation of detailed research notes.
Search Functionality: Comprehensive search across documents and research notes.

Backend (FastAPI)

The backend handles core functionalities:

Data Processing: Integration with Airflow for data scraping and S3 uploads.
Database Management: Snowflake integration for efficient data storage and retrieval.
Multi-Modal RAG: Implements advanced querying capabilities.
Authentication: Secure JWT-based user authentication.
API Endpoints: For document exploration, summary generation, and Q&A interactions.

Deployment

Containerized using Docker for easy deployment and scalability.
Publicly accessible API and Streamlit application.

Usage Instructions

Access the Streamlit frontend via the provided URL.
Create an account or log in to an existing one.
Explore documents using the grid view or dropdown list.
Generate summaries, ask questions, and create research notes for selected documents.
Use the search functionality to find specific information across documents and notes.

Installation

Clone the repository

  git clone https://github.com/BigDataIA-Fall2024-TeamA2/Assignment3 && cd Assignment3

Setup local environment by creating a virtual environment and the .env file (For *unix systems)

python3 -m venv venv
./venv/bin/activate.sh
 poetry install
 cp .env.template .env

Fill in the relevant secrets in .env file.
The application is dockerized and doesn't depend on external dependencies. Using the following command the frontend, backend applications can be started:

docker compose up -d



## Resources

- LLAMA Multimodal Report Generation Example
- Multimodal RAG Slide Deck Example
- NVIDIA Multimodal RAG Example

## Attestation

WE ATTEST THAT WE HAVEN’T USED ANY OTHER STUDENTS’ WORK IN OUR ASSIGNMENT AND ABIDE BY THE POLICIES LISTED IN THE STUDENT HANDBOOK

Contribution:

    a. Gopi Krishna Gorle: 33%
    b. Pranali Chipkar: 33%
    c. Mubin Modi: 33%

Repository Overview

.
├── README.md
├── airflow.Dockerfile
├── app.py
├── architecture
│   ├── diagrams
│   │   └── assignment3_architecture.png
│   ├── generate_diagrams.py
│   ├── images.png
│   ├── streamlit-logo-primary-colormark-darktext.png
│   └── v1.drawio
├── backend
│   ├── _init_.py
│   ├── config.py
│   ├── data
│   │   └── Lorem_ipsum.pdf
│   ├── database
│   │   ├── _init_.py
│   │   ├── articles.py
│   │   ├── init_db.py
│   │   ├── qa.py
│   │   ├── research_notes.py
│   │   ├── summary.py
│   │   └── users.py
│   ├── logging.conf
│   ├── main.py
│   ├── schemas
│   │   ├── _init_.py
│   │   ├── articles.py
│   │   ├── auth.py
│   │   ├── qa.py
│   │   └── users.py
│   ├── services
│   │   ├── _init_.py
│   │   ├── articles.py
│   │   ├── auth.py
│   │   ├── auth_bearer.py
│   │   ├── qa.py
│   │   ├── rag.py
│   │   ├── summary_generation.py
│   │   └── users.py
│   ├── test.py
│   ├── utilities
│   │   ├── _init_.py
│   │   ├── base_utils.py
│   │   └── nvidia_utils.py
│   └── views
│       ├── _init_.py
│       ├── articles.py
│       ├── auth.py
│       ├── qa.py
│       └── users.py
├── dags
│   ├── _init_.py
│   ├── articles.py
│   ├── data_indexer
│   │   ├── _init_.py
│   │   ├── document_processors.py
│   │   ├── pdf_indexer.py
│   │   └── utils.py
│   ├── data_ingestion
│   │   ├── _init_.py
│   │   ├── scraper.py
│   │   ├── uploader.py
│   │   └── utils.py
│   ├── pipeline.py
│   └── updated_articles_data.json
├── docker-compose.yml
├── frontend
│   ├── _init_.py
│   ├── config.py
│   ├── pages
│   │   ├── _init_.py
│   │   ├── chat.py
│   │   ├── document_viewer.py
│   │   ├── list_docs.py
│   │   ├── reports.py
│   │   ├── summary_generation.py
│   │   ├── user_creation.py
│   │   └── user_login.py
│   └── utils
│       ├── _init_.py
│       ├── api_utils.py
│       ├── auth.py
│       └── chat.py
├── poetry.lock
└── pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Multi-Modal RAG Application for Document Analysis

Application Summary

Key Features

Frontend (Streamlit)

Backend (FastAPI)

Deployment

Usage Instructions

Installation

Repository Overview

About

Releases

Packages

Contributors 3

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 43 Commits
architecture		architecture
backend		backend
dags		dags
frontend		frontend
.dockerignore		.dockerignore
.env.template		.env.template
.gitignore		.gitignore
README.md		README.md
airflow.Dockerfile		airflow.Dockerfile
app.py		app.py
backend.Dockerfile		backend.Dockerfile
docker-compose-airflow.yml		docker-compose-airflow.yml
docker-compose-app.yaml		docker-compose-app.yaml
frontend.Dockerfile		frontend.Dockerfile
poetry.lock		poetry.lock
pyproject.toml		pyproject.toml

BigDataIA-Fall2024-TeamA2/Assignment3-Multi-Modal-RAG-Application-for-Document-Analysis-

Folders and files

Latest commit

History

Repository files navigation

Multi-Modal RAG Application for Document Analysis

Application Summary

Key Features

Frontend (Streamlit)

Backend (FastAPI)

Deployment

Usage Instructions

Installation

Repository Overview

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Languages

Packages