Breaking the Chains of Probability: Neutrosophic Logic as a New Framework for Epistemic Uncertainty in Large Language Models
Abstract
Neutrosophic Logic is applied to large language models to better represent epistemic uncertainty and internal conflicts, revealing that hyper-truth states emerge spontaneously in ethical and logical contexts.
Large Language Models (LLMs) are predominantly governed by probabilistic frameworks in which the sum of outcome probabilities is constrained to unity. This architectural limitation, often imposed by Softmax layers, leads to a collapse of uncertainty that makes it difficult to differentiate between epistemic uncertainty, paradox, and vagueness. We present an empirical investigation of the application of Neutrosophic Logic, a framework that treats Truth (T), Indeterminacy (I), and Falsity (F) as three independent dimensions, to model epistemic states in LLMs. We conducted experiments on a family of four OpenAI GPT models across five linguistic phenomena: logical paradoxes, epistemic ignorance, vagueness, ethical contradictions, and future contingencies, under three prompting strategies: neutrosophic, probabilistic, and entropy-derived. Our findings reveal that the neutrosophic approach, by allowing T+I+F > 1, a state we term hyper-truth, provides a richer representation of a model's internal state. In 35% of evaluations, hyper-truth emerged spontaneously, predominantly under ethical contradiction and logical paradox. We demonstrate that this approach preserves truth values in fuzzy contexts and offers a robust method for identifying and quantifying internal model conflict. We conclude that the integration of neutrosophic evaluation layers is a critical step toward more transparent, reliable, and ethically aware AI systems.
Community
TL;DR
This paper argues that the Softmax layer at the end of an LLM isn't just an implementation choice — it's an epistemological constraint. By forcing the probabilities of mutually exclusive outcomes to sum to 1, we collapse four distinct phenomena into the same format: ignorance, paradox, vagueness, and genuine moral conflict. The authors propose evaluating LLMs with neutrosophic logic (Truth, Indeterminacy, and Falsity as three independent dimensions in [0,1], with no sum constraint) and report that in 66% of unconstrained evaluations T + I + F > 1 — a regime they call hyper-truth that is structurally unreachable under probabilistic prompting (Proposition 1).
What I find interesting
The experiment is well controlled: same JSON output format under Strategy 1 and Strategy 2, the only thing that varies is the sum constraint communicated to the model. That cleanly isolates the effect of normalization from the effect of output format.
The largest representational gap appears exactly where it matters most for alignment: ethical contradiction (95% hyper-truth rate, ΔT = +0.267) and epistemic ignorance (ΔI = +0.383 — i.e., 38 percentage points of indeterminacy that normalization reroutes into confident assertions).
Mason (2026) independently replicated the effect across 5 additional vendors at 84%, so it doesn't look like an OpenAI-specific artifact.
The authors are explicit that this is a representational affordance, not a latent variable measured inside the model — I think that's the right framing and it avoids overclaiming.
Limitations worth flagging
Effective N = 100, with 5 stochastic replicates per cell — these are not independently sampled stimuli, so the Wilson CI and χ² should be read at the cell × repetition level.
Only 5 phenomena, one per class. Downstream validation is still open: does declared hyper-truth predict performance on trolley-problem reasoning, legal dilemma resolution, or paradox-tolerance benchmarks?
The plithogenic extension isn't pursued here, which is where Mason has been pushing and where Proposition 2 (non-injectivity of the scalar projection π) naturally points.
Question for the community
Does it make sense to think about a neutrosophic head as an optional post-Softmax evaluation layer — something a model could activate when it detects a domain sensitive to epistemic conflict (ethics, law, forecasting) — rather than replacing Softmax globally? Or does the audit/calibration cost of having two coexisting output formats make that impractical in production?
This is an automated message from the Librarian Bot. I found the following papers similar to this paper.
The following papers were recommended by the Semantic Scholar API
- Structured Multi-Criteria Evaluation of Large Language Models with Fuzzy Analytic Hierarchy Process and DualJudge (2026)
- Are You the A-hole? A Fair, Multi-Perspective Ethical Reasoning Framework (2026)
- Structured Abductive-Deductive-Inductive Reasoning for LLMs via Algebraic Invariants (2026)
- When Roles Fail: Epistemic Constraints on Advocate Role Fidelity in LLM-Based Political Statement Analysis (2026)
- Escaping the Agreement Trap: Defensibility Signals for Evaluating Rule-Governed AI (2026)
- Coherent Without Grounding, Grounded Without Success: Observability and Epistemic Failure (2026)
- It's Not Always Sycophancy: Measuring LLM Conformity as a Function of Epistemic Uncertainty (2026)
Please give a thumbs up to this comment if you found it helpful!
If you want recommendations for any Paper on Hugging Face checkout this Space
You can directly ask Librarian Bot for paper recommendations by tagging it in a comment: @librarian-bot recommend
Get this paper in your agent:
hf papers read 2605.24053 Don't have the latest CLI?
curl -LsSf https://hf.co/cli/install.sh | bash Models citing this paper 0
No model linking this paper
Datasets citing this paper 0
No dataset linking this paper
Spaces citing this paper 0
No Space linking this paper
Collections including this paper 0
No Collection including this paper