AI-Paper Classifier and Summarizer · v1.0

Read papers, faster.

ResearchPilot turns any PDF into a structured analysis, domain classification across 7 fields and 42 sub-fields, an academic-register summary, ranked keywords, and ten recommendations sorted by cosine similarity.

researchpilot.app/analyze · attention-is-all-you-need.pdf

Classification

Primary domainComputer Science
Sub-domainMachine Learning
Confidence0.972

Keywords · YAKE

attention transformer self-attention encoder-decoder positional encoding

Summary · Gemini

Introduces the Transformer, an architecture relying solely on attention mechanisms, dispensing with recurrence and convolutions. Achieves state-of-the-art translation quality with a fraction of the training cost.

Recommendations · Semantic Scholar (top 3 of 10)

BERT: Pre-training of Deep Bidirectional…0.91
Layer Normalization0.84
An Image is Worth 16×16 Words0.79
01 — Capabilities

Four primitives,
one upload.

Every capability is independently inspectable: classifications, embeddings, and rankings are exposed as raw values, not just rendered pixels.

01 · Ingestion
PDF parsing with structural fidelity.

PyMuPDF reconstructs title, abstract, body and references from any uploaded PDF. No manual annotation, no template matching. Clean inputs ready for the linguistic models downstream.

02 · Classification
SciBERT, fine-tuned.

7 domains. 42 sub-categories.

03 · Summarisation
Academic register, by design.

Gemini is constrained to a fixed schema: objective, methodology, principal findings.

04 · Recommendation
Semantic Scholar, re-ranked semantically.

20 candidates retrieved from the live Semantic Scholar API; Sentence-BERT embeddings re-rank by cosine similarity to surface the 10 most intellectually proximate works to the paper at hand.

02 — Pipeline

From upload to insight
in a single pass.

A compact, audit-friendly inference graph. Each node fails gracefully, uncertainty is surfaced rather than papered over.

01 · UPLOAD PDF PyMuPDF 02 · CLASSIFY SciBERT 7 / 42 labels 03 · SUMMARISE Gemini academic register 04 · KEYWORDS YAKE semantic · unsupervised 05 · RETRIEVE Semantic Scholar 20 candidates 06 · RANK Sentence-BERT cosine similarity → OUTPUT Classification domain + confidence → OUTPUT Summary → OUTPUT Recommendations top 10
7/42
Domains / Sub-domains
10 of 20
Recs surfaced
~3s
End-to-end
100%
Open source
03 — Stack

Built on tools
we trust in production.

Every dependency is open source. Every component is replaceable.

Frontend
React
Streaming UI shell
Styling
Tailwind CSS
Design tokens
Backend
FastAPI
Async inference API
Database
PostgreSQL
User libraries
Classification
SciBERT
Fine-tuned model
Summarisation
Gemini API
Constrained prompt
Retrieval
Semantic Scholar API
Live candidate pool
Embeddings
Sentence-BERT
Cosine similarity
04 — Team

A senior project,
three engineers.

University of Bahrain · 2026

Abdelrahman Adel
Team Leader, Backend, Data collection and preparation, SciBERT training
Mohammed Al Jariri
Backend, SciBERT training
Mohammed Yaser Al Yusuf
Frontend, SciBERT training

Upload a paper.
Read it differently.

The repository is public. The demo is live. The pipeline is yours to inspect.

Open the demo → GitHub ↗