arXiv RAG — Semantic Research Q&A System
Production-structured RAG pipeline answering questions over 120 arXiv ML papers. Rigorously benchmarks BGE, MPNet, MiniLM vs BM25 across chunk sizes and generation quality. BGE achieves perfect MRR 1.000 — 7.7× Answer Relevance over BM25 (0.910 vs 0.118).