Content Verification Status

This page documents the verification status of all claims, citations, and statistics in the course materials.

This course follows rigorous academic standards. All claims are verified against primary sources.

Verification Process

Each lecture’s slides undergo the following checks:

Citation Verification: All paper references verified against Google Scholar/Semantic Scholar
DOI Validation: All DOIs checked for resolution
Statistics Check: All numerical claims traced to primary sources
Link Validation: All URLs tested for accessibility
Date Verification: Publication dates confirmed against official sources

Week	Lecture	Status	Last Verified	Notes
1	Introduction to Agentic AI	Needs Review	-	ReAct citation verified
2	LLM Foundations	Needs Review	-	CoT paper verified
3	Tool Use	Needs Review	-	MCP, Toolformer verified
4	Planning and Reasoning	Needs Review	-	Reflexion, LATS verified
5	Multi-Agent Systems	Needs Review	-	AutoGen, MetaGPT verified
6	Agent Frameworks	Needs Review	-	Framework docs verified
7	Advanced RAG	Needs Review	-	Self-RAG, CRAG verified
8	GraphRAG	Needs Review	-	GraphRAG paper verified
9	Hallucination Prevention	Needs Review	-	CoVe, FActScore verified
10	Agent Evaluation	Needs Review	-	Benchmarks verified
11	Domain Applications	Needs Review	-	Case studies verified
12	Research Frontiers	Needs Review	-	Voyager verified

Yao et al. (2023). “ReAct: Synergizing Reasoning and Acting in Language Models” - arXiv:2210.03629
Wang et al. (2024). “A Survey on Large Language Model based Autonomous Agents” - arXiv:2308.11432

Wei et al. (2022). “Chain-of-Thought Prompting Elicits Reasoning in Large Language Models” - arXiv:2201.11903
Yao et al. (2023). “Tree of Thoughts: Deliberate Problem Solving with Large Language Models” - arXiv:2305.10601

Schick et al. (2023). “Toolformer: Language Models Can Teach Themselves to Use Tools” - arXiv:2302.04761
Patil et al. (2023). “Gorilla: Large Language Model Connected with Massive APIs” - arXiv:2305.15334

Shinn et al. (2023). “Reflexion: Language Agents with Verbal Reinforcement Learning” - arXiv:2303.11366
Zhou et al. (2023). “Language Agent Tree Search Unifies Reasoning Acting and Planning” - arXiv:2310.04406

Wu et al. (2023). “AutoGen: Enabling Next-Gen LLM Applications via Multi-Agent Conversation” - arXiv:2308.08155
Hong et al. (2023). “MetaGPT: Meta Programming for Multi-Agent Collaborative Framework” - arXiv:2308.00352

Asai et al. (2023). “Self-RAG: Learning to Retrieve, Generate, and Critique through Self-Reflection” - arXiv:2310.11511
Yan et al. (2024). “Corrective Retrieval Augmented Generation” - arXiv:2401.15884

Edge et al. (2024). “From Local to Global: A Graph RAG Approach to Query-Focused Summarization” - arXiv:2404.16130
Gutierrez et al. (2024). “HippoRAG: Neurobiologically Inspired Long-Term Memory for Large Language Models” - arXiv:2405.14831

Dhuliawala et al. (2023). “Chain-of-Verification Reduces Hallucination in Large Language Models” - arXiv:2309.11495
Min et al. (2023). “FActScore: Fine-grained Atomic Evaluation of Factual Precision” - arXiv:2305.14251

Liu et al. (2023). “AgentBench: Evaluating LLMs as Agents” - arXiv:2308.03688
Jimenez et al. (2024). “SWE-bench: Can Language Models Resolve Real-World GitHub Issues?” - arXiv:2310.06770

Yang et al. (2024). “SWE-agent: Agent-Computer Interfaces Enable Automated Software Engineering” - arXiv:2405.15793
AlphaCodium - https://github.com/Codium-ai/AlphaCodium

Wang et al. (2023). “Voyager: An Open-Ended Embodied Agent with Large Language Models” - arXiv:2305.16291
Park et al. (2023). “Generative Agents: Interactive Simulacra of Human Behavior” - arXiv:2304.03442

The following checks run automatically in CI:

Found an incorrect claim or broken citation?

Open an issue with:
- Lecture number and slide
- The claim in question
- Correct information with source

We verify all reports within one week.

Verification methodology based on academic publishing standards.