Do LLMs really understand causality?

Evaluated LLM-generated simulations of causal outputs with algorithmic results across real-world, causal and synthetic datasets using metrics such as Precision, Recall, F1-score and Structural Hamming Distance.

Research Project Details

Add detailed information about your research project here.

Abstract

Brief summary of the research.

Methodology

Describe your approach.

Results

Across all experiments, LLMs produced plausible yet inaccurate estimates, showing that they rely on heuristics rather than actual computation.