★ Featured 🏢 Caisse des Dépôts (ICDC)

Archi+ - AI Chatbot

Technologies & tools

Python FastAPI

Angular

LLM / RAG / LangChain

Azure AI Search

MongoDB

Overview

Conversational multimodal AI chatbot for the Caisse des Dépôts group, anchored in the company’s internal data.

Multi-model architecture with intelligent orchestration (Mistral for conversational, Codestral for code, Flux for image generation)
Advanced RAG pipeline with semantic chunking, RAG Fusion and autonomous agents to process complex internal documents
ReAct (Reasoning + Acting) agent system with access to external tools (web search, image generation)
Scalable architecture supporting high load with async optimizations and multi-threading

Complete design and implementation of backend architecture, with successive iterations to meet evolving business needs.

Challenge: Most execution time is spent waiting for LLM responses (network latency).

Fully async FastAPI architecture with:

Complete async management (asyncio) to maximize throughput
Concurrency of multiple requests without blocking
Optimized streaming pipeline for fluid user experience
Multi-threading and multi-processing for CPU-intensive tasks (embedding, document parsing)

Implementation of SSE (Server-Sent Events) instead of WebSockets to optimize simplicity and performance in unidirectional communication.

Results:

Anchoring chatbot in company’s internal data (technical documentation, processes, knowledge base).

Complete RAG pipeline:

Development of a multi-format ingestion system (PDF, DOCX, TXT) with parsing and normalization
Integration with Azure AI Search for indexing and vector search
Orchestration of entire pipeline: ingestion → chunking → embedding → retrieval → generation

Advanced optimizations:

Semantic chunking - Intelligent splitting algorithm based on document structure and meaning. Context preservation and significant relevance improvement.
RAG Fusion - Retrieval strategy generating multiple question reformulations, parallel searches and result fusion with re-ranking. Robustness against imperfect formulations.
Multi-query retrieval - Automatic generation of query variations to maximize recall and cover different semantic angles.
Autonomous agents - Agent architecture to dynamically orchestrate search strategies and iteratively refine results.