Systematic Literature Reviews with AI

Overview
#

Systematic reviews constitute a critical foundation for evidence-based decision-making across disciplines. However, the labor-intensive nature of traditional SLRs - requiring weeks to months of manual work - has driven significant interest in AI-assisted automation.

Key Statistics
#

Metric	Value
Workload reduction with AI screening	40-95%
otto-SR: Cochrane reviews completed	12 reviews in 2 days (vs ~12 work-years manually)
GPT-4 PICO extraction accuracy	>85% median

AI Tools for Systematic Reviews
#

Open Source Tools
#

Tool	Description	Features	Link
ASReview	Active learning for systematic reviews	Open-source, 95% workload reduction, Python-based	Visit GitHub
RobotReviewer	ML system for RCT assessment	Free, web-based, bias assessment	Visit
Colandr	Open-source screening tool	Free, collaborative	Visit
FAST2	Active learning screening	Open source, Python 3	Visit GitHub

Commercial & Freemium Tools
#

Tool	Description	Pricing	Link
Rayyan	AI-powered review management	Free tier available	Visit
Elicit	AI research assistant	Free: basic / Pro: $42/mo	Visit
Covidence	Cochrane-recommended tool	Free for Cochrane reviews	Visit
DistillerSR	Enterprise review software	Subscription-based	Visit
Laser AI	Living systematic reviews	Commercial	Visit
otto-SR	End-to-end LLM workflow	Web platform	Visit
EPPI-Reviewer	Comprehensive review tool	Subscription	Visit

Specialized LLM Applications
#

Tool/Method	Application	Model
Systematic Review Extractor Pro	Data extraction	`Custom GPT`
otto-SR Screening Agent	Abstract/full-text screening	`GPT-4.1`
otto-SR Extraction Agent	Data extraction	`o3-mini-high`

Key Research Papers
#

Foundational Papers
#

2021 foundational

ASReview Framework

van de Schoot, R. et al.

An open source machine learning framework for efficient and transparent systematic reviews

Nature Machine Intelligence 3 , 125-133

DOI: 10.1038/s42256-020-00287-7

Tool/Method	Sensitivity	Specificity	Notes
otto-SR	96.7%	97.9%	GPT-4.1 based
Human dual review	81.7%	98.1%	Traditional approach
Rayyan AI	97-99%	19-58%	At <2.5 threshold
ASReview	Variable	Variable	Depends on dataset

Model	Precision	Recall	Notes
GPT-based (pooled)	83.0%	86.0%	Mean across studies
BERT-based	Lower	Lower	Compared to GPT
otto-SR extraction	93.1% accuracy	-	o3-mini-high

Stage	Traditional	AI-Assisted	Reduction
Screening	8-12 weeks	2-3 weeks	~75%
Data extraction	10-16 weeks	3-5 weeks	~70%
Per-paper extraction	36 min	27 sec + 13 min review	~60%

Systematic Literature Reviews with AI

Overview
#

Key Statistics
#

AI Tools for Systematic Reviews
#

Open Source Tools
#

Commercial & Freemium Tools
#

Specialized LLM Applications
#

Key Research Papers
#

Foundational Papers
#

ASReview Framework

Rayyan Original Paper

Recent LLM Research (2024-2025)
#

otto-SR: Automation of Systematic Reviews with LLMs

LLMs for Systematic Reviews: Scoping Review

GPT-4 Evaluation for SLR

LLM-Assisted SLR System

Methodology & Guidelines
#

PRISMA-AI Guidelines

Practical Guide to ML in Research Synthesis

Methodological Guidelines
#

PRISMA-AI Framework
#

LLM Integration Guidelines
#

1. Screening Phase
#

2. Data Extraction
#

3. Quality Assurance
#

Performance Benchmarks
#

Screening Accuracy
#

Data Extraction
#

Time Savings
#

Additional Resources
#

Getting Started
#

Key GitHub Repositories
#

Library Guides
#

Python Quick Start
#

Overview#

Key Statistics#

AI Tools for Systematic Reviews#

Open Source Tools#

Commercial & Freemium Tools#

Specialized LLM Applications#

Key Research Papers#

Foundational Papers#

ASReview Framework

Rayyan Original Paper

Recent LLM Research (2024-2025)#

otto-SR: Automation of Systematic Reviews with LLMs

LLMs for Systematic Reviews: Scoping Review

GPT-4 Evaluation for SLR

LLM-Assisted SLR System

Methodology & Guidelines#

PRISMA-AI Guidelines

Practical Guide to ML in Research Synthesis

Methodological Guidelines#

PRISMA-AI Framework#

LLM Integration Guidelines#

1. Screening Phase#

2. Data Extraction#

3. Quality Assurance#

Performance Benchmarks#

Screening Accuracy#

Data Extraction#

Time Savings#

Additional Resources#

Getting Started#

Key GitHub Repositories#

Library Guides#

Python Quick Start#

Overview
#

Key Statistics
#

AI Tools for Systematic Reviews
#

Open Source Tools
#

Commercial & Freemium Tools
#

Specialized LLM Applications
#

Key Research Papers
#

Foundational Papers
#

Recent LLM Research (2024-2025)
#

Methodology & Guidelines
#

Methodological Guidelines
#

PRISMA-AI Framework
#

LLM Integration Guidelines
#

1. Screening Phase
#

2. Data Extraction
#

3. Quality Assurance
#

Performance Benchmarks
#

Screening Accuracy
#

Data Extraction
#

Time Savings
#

Additional Resources
#

Getting Started
#

Key GitHub Repositories
#

Library Guides
#

Python Quick Start
#