Do LLMs really understand causality?
Evaluated LLM-generated simulations of causal outputs with algorithmic results across real-world, causal and synthetic datasets using metrics such as Precision, Recall, F1-score and Structural Hamming Distance.
Research Project Details
Add detailed information about your research project here.
Abstract
Brief summary of the research.
Methodology
Describe your approach.
Results
Across all experiments, LLMs produced plausible yet inaccurate estimates, showing that they rely on heuristics rather than actual computation.