Data Analysis Agent (data processing and extraction) |
Text Summarization (TS) |
ECT-Sum, LCFNS |
Text (earnings-call transcripts, financial reports, news articles) |
ROUGE, BERTScore, Num-Prec., SummaC |
FinMA, FinTral, InvestLM, BloombergGPT, FinGPT |
Limited structured data integration, high computational cost, lack of real-time updates. |
Name-Entity Recognition (NER) |
FIN, FiNER-ORD |
Text (SEC filings, financial news articles) |
Precision, Recall, F1-score |
FinMA, InvestLM, ICE-INTERN |
Small-scale coverage, weak entity linking, limited numeric reasoning. |
Financial Relation Extraction (FRE) |
FinRED, FIRE, KPI-EDGAR |
Text (EDGAR filings, earnings-call transcripts, KPI mentions) |
Precision, Recall, F1-score |
FinTral, ICE-INTERN, Xuanyuan 2.0 |
Difficulty detecting event-based relationships, lack of domain-specific pretraining. |
Investment Research Agent (asset evaluation and market prediction) |
Event Classification (EC) |
FOMC, FedNLP, Headlines |
Text (policy statements, news headlines, earnings-call transcripts) |
Accuracy, Precision, Recall, F1-score |
BloombergGPT, FinLLaMA, Temporal meets LLM, FinMA, FinGPT |
No real-time market data, insufficient domain-specific pretraining. |
Sentiment Analysis (SA) |
FPB, FiQA-SA, StockEmotions |
Text (news articles, microblogs, StockTwits) |
Accuracy, Precision, Recall, F1-score, MSE |
FinGPT, FinMA, BloombergGPT, FinLLaMA |
Short-text limitations, oversimplified sentiment classification, lack of multi-modal context. |
Time Series Forecasting (TSF) |
StockNet, Bigdata22, CIKM18 |
Text (tweets, microblogs), Time Series (stock prices) |
Accuracy, MCC |
Temporal meets LLM, FinLLaMA, FinGPT, FinMA |
No real-time data, weak asset-specific feature integration. |
Trading Agent (strategy execution and decision-making) |
Strategy Execution (SE) |
GPT-InvestAR, FinTrade |
Text (earnings reports, sentiment), Tables (historical prices) |
Profitability, Sharpe Ratio (SR) |
GPT-3.5-Turbo, FinBen |
Narrow market coverage, lack of real-time data, overlook portfolio diversification. |
Support Decision-Making (SDM) |
InvestorBench, STRUX, FinBen |
Text (financial reports), Tables (crypto market data), Time Series (stock prices) |
Cumulative Return (CR), Sharpe Ratio (SR), Annualized Volatility (AV), Maximum Drawdown (MDD) |
FinMEM, STRUX, CFGPT |
Narrow real-world asset coverage, over-reliance on simplistic reward signals. |
Investment Management Agent (portfolio optimization and allocation) |
Question-Answering (QA) |
FiQA-QA, FinQA, ConvFinQA |
Text (financial news, social media posts, earnings statements), Tables (S&P 500 market tables) |
nDCG, MRR, Execution Accuracy, Program Accuracy |
FinQANet, Alphafin, FinMA, InvestLM |
Limited multi-modal support, struggle with long multi-hop reasoning. |
Risk Management Agent (fraud detection and compliance) |
Fraud Detection (FD) |
Credit Card Fraud, ccFraud |
Text (credit card transactions), Tables (financial logs) |
Accuracy, Precision, Recall, F1-score, AUC-ROC |
Finbench, FinGPT, CALM |
Class imbalance, evolving fraud patterns, lack of real-time tracking. |
Default Risk Prediction (DRP) |
Finbench-CD, Finbench-LD |
Text (home equity loans, vehicle loans), Tables (credit card client records) |
Accuracy, Precision, Recall, F1-score |
Finbench, FinGPT, CALM |
Highly imbalanced data, poor interpretability for credit decisions. |
Multi-Agent Collaboration (MAC) |
Multi-Agent Collaboration (MAC) |
FinCon, Tradingagents, Cryptoagents |
Text (financial news), Tables (crypto market data), Audio (ECC recordings) |
CoT Accuracy, Profitability, CR, SR, MDD |
StockGPT, FinCon, Tradingagents |
Lack of real-time trading support, prompt engineering sensitivity. |