Quiz: AI Agents in Finance
20 multiple-choice questions · Click an option to check your answer
Question 1
The lecture defines four components that distinguish an AI agent from a simple chatbot: LLM (reasoning), tools, memory, and autonomy. If you remove autonomy from an agent, what do you have?
- (A) A chatbot
- (B) A database
- (C) A search engine
- (D) A copilot -- it
Question 2
The ReAct pattern stands for "Reasoning + Acting." What is the correct sequence of steps in a single ReAct cycle?
- (A) Thought ? Action (call a tool) ? Observation
- (B) Act → Reason → Observe
- (C) Observe → Act → Reason
- (D) Reason → Observe → Act → Reason again → Output
Question 3
The lecture describes three types of agent memory. Which type stores past decisions and their outcomes so the agent can avoid repeating mistakes?
- (A) Short-term memory -- current conversation context
- (B) Episodic memory
- (C) Long-term memory -- policy documents and regulatory rules
- (D) Cache memory -- temporary storage for faster processing
Question 4
The autonomy spectrum runs from Level 0 (fully manual) to Level 5 (fully autonomous). At which level does an AI agent execute actions within pre-set limits while a human monitors and can override?
- (A) Level 2 -- AI recommends
- (B) Level 1 -- AI assists
- (C) Level 3 -- AI acts
- (D) Level 5 -- fully autonomous
Question 5
What is a hallucination in the context of AI agents, and why is it more dangerous in an agent than in a standalone LLM?
- (A) A hallucination is when the model generates confident but false information
- (B) A hallucination is when the AI has a hardware malfunction; it is equally dangerous in both cases
- (C) A hallucination only occurs when the agent runs out of memory
- (D) Hallucinations are not a real problem -- they have been solved
Question 6
The lecture identifies four application domains for AI agents in finance. Which domain is described as the "killer app" for agents due to its high volume, rule-heavy nature, and current reliance on manual work?
- (A) Research
- (B) Compliance (transaction monitoring, SAR generation, regulatory reporting)
- (C) Trading
- (D) Customer service
Question 7
The EU AI Act classifies AI systems by risk level. Under which category do most financial AI agents (credit scoring, fraud detection, trading agents) fall?
- (A) Unacceptable risk -- banned outright
- (B) High risk -- requires risk management
- (C) Minimal risk -- no specific requirements
- (D) Limited risk -- must disclose AI interaction
Question 8
A robo-advisor agent monitors a client's 60/40 stock/bond portfolio. Stocks drift to 67%. The agent's rules say to rebalance when any asset class drifts more than 5% from target. What does the agent do?
- (A) Converts the entire portfolio to cash
- (B) Calls the client and asks for permission to rebalance
- (C) Sells approximately $7
- (D) Nothing -- 67% is close enough to 60%
Question 9
A compliance agent reviews 10,000 transactions daily. Traditional rule-based systems flag 500 as suspicious, of which 475 are false positives (95% false positive rate). How does an AI agent reduce this rate?
- (A) By lowering the detection threshold so fewer transactions are flagged
- (B) By replacing human compliance officers entirely
- (C) By ignoring all alerts
- (D) By enriching each alert with client history, news
Question 10
A research agent analyzes earnings for a tech company using the ReAct pattern. It extracts EPS as $2.30, but the actual EPS was $1.80 (the agent confused a footnote about a different quarter). The agent then calculates a P/E ratio that makes the stock look cheap and recommends buying. What guardrail would have prevented this error?
- (A) Using a faster LLM
- (B) Running the agent on a more powerful computer
- (C) Building a verification step into the pipeline
- (D) Removing the agent's ability to read earnings reports
Question 11
A bank compares human-only and agent-assisted workflows for processing 10 earnings reports. Human-only costs $3,000 (40 hours at $75/hour). Agent-assisted costs $377 (compute + 5 hours of human review). What is the approximate cost saving?
- (A) 99%
- (B) 25%
- (C) 50%
- (D) 87%
Question 12
In a multi-agent system for trading, specialized agents play different roles: Research Agent, Risk Agent, Compliance Agent, and Execution Agent. An Orchestrator coordinates them. What real-world team does this mirror?
- (A) A trading desk
- (B) A customer service call center
- (C) A marketing department
- (D) A software development team
Question 13
The lecture warns about "systemic risk from correlated agents": if 100 trading agents all use the same underlying LLM, they may converge on the same conclusion during a market event. Why is this a new form of systemic risk?
- (A) Because regulators will ban all AI trading
- (B) Because LLMs are slower than human traders
- (C) Because model homogeneity breaks the "diverse
- (D) Because agents cannot trade during market hours
Question 14
The lecture presents the "liability chain": developer → deployer (bank) → user → agent. When an agent makes a costly error, who does the EU AI Act assign primary responsibility to?
- (A) The end user who delegated the task to the agent
- (B) The deployer
- (C) The developer who built the underlying LLM
- (D) The agent itself, which has legal personhood under EU law
Question 15
The lecture distinguishes three generations of trading systems: rule-based (1990s), ML-powered (2010s), and LLM-powered (2024+). What is the key capability that LLM-powered agents add over ML-powered systems?
- (A) LLM-powered agents can read unstructured text
- (B) LLM-powered agents do not need training data
- (C) LLM-powered agents trade faster
- (D) LLM-powered agents are always more profitable
Question 16
Klarna deployed a customer service agent that handles two-thirds of customer chats autonomously (Level 3). A traditional chatbot would only answer FAQs and then transfer to a human. What is the key architectural difference?
- (A) Klarna's agent has a bigger FAQ database
- (B) Klarna uses a special programming language
- (C) There is no difference -- all chatbots work the same way
- (D) Klarna's agent has tools, memory (client history)
Question 17
A bank's CEO proposes deploying a Level 5 (fully autonomous) agent to manage a $500 million portfolio. Using the lecture's framework, what is the strongest argument against this?
- (A) Level 5 autonomy removes human oversight entirely
- (B) Level 5 agents are too expensive
- (C) There are no arguments against -- Level 5 is the goal
- (D) The technology is not fast enough
Question 18
The lecture describes guardrails for financial agents: approval gates, risk limits, kill switches, and audit trails. A junior developer suggests that implementing just one guardrail (e.g., audit trails) is sufficient. Why is this wrong?
- (A) Because audit trails are not technically feasible
- (B) Because guardrails slow down the agent too much
- (C) Because regulators do not care about guardrails
- (D) Because no single guardrail is sufficient
Question 19
An AI agent discovers evidence of fraud at a major client. Reporting it would crash the client's stock price and affect 50,000 retail investors. The bank earns $10 million per year in fees from this client. What should the agent do?
- (A) Ignore the evidence to protect the client relationship
- (B) File the SAR automatically without human involvement
- (C) Escalate to a human compliance officer who can weigh
- (D) Delete the evidence to avoid liability
Question 20
The lecture states: "The gap between 'demo' and 'deployment' is enormous. Regulation, liability, and trust are the bottlenecks -- not technology." Most banks have deployed agents at Level 1--2 (assist and recommend), with only customer service and compliance reaching Level 3. What does this pattern reveal about the real barrier to agent adoption in finance?
- (A) Agents are too expensive for banks
- (B) The barrier is institutional
- (C) The technology is not ready
- (D) Banks do not understand AI