Shruti
← Back to Projects

Agentic RAG Clinical Trial Discovery Chatbot

A patient- and clinician-facing clinical trials search assistant that uses agentic RAG over 113k ClinicalTrials.gov records to surface grounded, safety-filtered trial options for 14 major diseases.

Domain: Healthcare • Clinical trials Tech: Python • Gemini 2.0 Flash • all-MiniLM-L6-v2 • Qdrant • Streamlit • Docker • Google Cloud Run Data: 113k ClinicalTrials.gov trials (14 diseases) with NCT IDs, status, phase, enrollment, and summaries UI: Streamlit chat interface for trial search

Abstract

This project develops a multi-agent, retrieval-augmented chatbot that lets patients and clinicians query ClinicalTrials.gov in natural language and receive ranked, explainable trial matches. The system embeds 113,247 trials into a Qdrant vector database using all-MiniLM-L6-v2 and applies a disease-aware hybrid ranking function that combines semantic similarity, disease alignment, and recruitment status. Top-5 trials are summarized with NCT IDs and PubMed-grounded explanations, while an ActiveSafetyFilter agent blocks or rewrites unsafe medical advice before responses reach the user. The assistant is delivered through a Streamlit web app deployed on Google Cloud Run, backed by a managed Qdrant cluster.

What I did (methods)

Key findings

What this shows about me